Data Mining/Web Extraction/Scraper
I am looking for a person/and or a team with deep experience with software/ and or software development for website extraction/ scraper / harvesting specific data from online sources.
The fields of data I need extracted/scraped are as follows:
Required Data:
Very Helpful:
1. If data is not available for any of the above fields; it should have “Not Available” for that entry.
2. If there are multiple people listed for "Company contact / Owner", listing one name should be sufficient, preferably the owner or the highest position executive person.
3. All data would need to be exportable to a CSV file.
4. Automatically extract and export data from virtually any web-based data source, including HTML, XML, Flash, Excel and PDF.
5. You must be able to build a program to extract data from a many sources.
6. Please list the best potential sources to extract Business to Business data(examples: Manta, Yelp, Yellow Pages, Service Magic, Trade Associations, Chamber of Commerce)
7. Please provide a list of the sources of data that you have previous extraction experience.
8. Have at least 3 year experience (examples of a variety of project results is required) in the field and be available for a long term working relationship if the first project is successful.
9. You must be certain that the extraction program can obtain all data required without detection.
10. The program must be able to import proxies.
Tags:
website data extraction, web content mining, page scrapping, web parsing, web fetching, website scrapping, web scrapping







