Data Mining

Closed - This job posting has been filled and work has been completed.
Web, Mobile & Software Dev Scripts & Utilities Posted 1 year ago

Hourly Job

Less than 30 hrs/week
Less than 1 week
$$

Intermediate Level

I am looking for a mix of experience and value

Details

We need a programmer to write a script that is able to crawl http://answers.yahoo.com/ and extract the following information:

1. A list of questions that are asked on the website (originating from USA).
2. Date that the question was posted.
3. If the question has been answered then also get number of answers to the question.
4. If the question has been 'starred' or 'favorited' then also obtain the number of favorites to the question.

The script should preferably be in Linux shell or python.  The result should be in the form of a text file with four tab separated columns corresponding to the four pieces of information listed above.  It is possible that for many questions column three and column four will be empty.  The script should be able to grab all questions that were asked (originating from USA) in the last 4 days.

We don't need you to run the script; we just need the script.


About the Client

(4.97) 52 reviews

United States
Piscataway 04:09 AM

20 Jobs Posted
85% Hire Rate, 1 Open Job

Over $40,000 Total Spent
88 Hires, 0 Active

$17.29/hr Avg Hourly Rate Paid
2,633 Hours

Member Since Jul 26, 2013