scraping task

Closed - This job posting has been filled and work has been completed.
Web & Mobile Development Scripts & Utilities Posted 2 years ago

Fixed Price

Delivery by November 1, 2012

$250.00

Budget

Details

I need an imacro scraping task that will scrape imdb.com.  This information is for personal research purposes only and I do have the right to copy it for those purposes.  You must have the enterprise edition of imacro, which allows you to distribute the imacro player free to me (after I hire you).  Other original scraping program may be acceptable.
The task will scrape all movies for all years, starting here:
http://www.imdb.com/year/
For each movie, the task will scrape all the data available for each movie. To do this, follow the links at the bottom of the page for each movie, under the heading "explore more about . . . ", the links will only be active if there is available data there, so the task will follow the available links and scrape the data behind each one. Do not follow the links under the sub-headings "external links", "related items," or "professional services".   Some of the links are to multiple pages, for example I want to scrape each review (some movies have several hundred reviews).
Data will be saved as many different csv files.  
Again, I am open to alternatives here. I can accept an sql database table, but I will need to be able to produce excel files from it.
Before payment, the task needs to have run on at least 100 movies (at least half of which are from the contemporary era, to assure the task is scraping all available data), and I need to approve the output. Thanks.

---
Skills: research


About the Client

(4.98) 44 reviews

United States
Fresno 05:33 AM

128 Jobs Posted
58% Hire Rate, 4 Open Jobs

Over $10,000 Total Spent
73 Hires, 8 Active

$5.58/hr Avg Hourly Rate Paid
47 Hours

Member Since Oct 11, 2012