You've landed at the right place. oDesk is now Upwork. Learn about the new platform.

Data Scraping Jobs

137 were found based on your criteria {{ paging.total | number:0 }} were found based on your criteria

show all
  • Hourly ({{ jobTypeController.getFacetCount("hourly") | number:0}})
  • Fixed Price ({{ jobTypeController.getFacetCount("fixed") | number:0}})
show all
only
only
only
show all
only
only
only
only
only
show all
only
only
only
Looking for the Team App?
Download the New Upwork Team App
Hourly - Intermediate ($$) - Est. Time: Less than 1 month, 10-30 hrs/week - Posted
Want to get company names (particularly Solar PV installers) from online membership directories from Texas and Hawaii. Then identify two contacts from each company based on their position and then get contact information for both of those contacts. PLEASE SEE ATTACHED INSTRUCTION GUIDE THAT OUTLINES WORK TO IN DETAIL Depending on hourly rates and estimates for number of records that will be collected, will negotiate either a fixed price for set number of leads contacts or maximum amount of hours worked. Expectation is about 200-250 companies between the two markets. My hope is that someone might be able to create a script or a bot to collect this information, however this position will also be posted in the admin section.
Skills: Data scraping Data Entry Data mining Data Science
Hourly - Expert ($$$) - Est. Time: 1 to 3 months, 10-30 hrs/week - Posted
I am looking to build a program that can do web scraping of a single site on a daily basis. I would like this software to be able to run in the cloud and extract data to a spreadsheet. I plan to continually improve and add on to this web scraping platform and integrate the data with other services so there is a strong possibility of continual work if interested. below is a brief video explaining what I need done http://screencast.com/t/OcdWxFR7CgjE Please be honest, dependable, and highly experienced in programming/web scraping.
Skills: Data scraping Web scraping
Hourly - Entry Level ($) - Est. Time: More than 6 months, 10-30 hrs/week - Posted
I am looking for a mix of experience and value. Details: We are a newly formed UK based property investment firm seeking an experienced VA to assist with lead generation and data entry for our telemarketers to chase up. The successful candidate will be required to scrape leads from various sources and enter in to a spreadsheet provided adhering to strict exclusion criteria outlined by us. This job will be offered initially on a trial basis, but for the right candidate this job will become a permanent role. When applying for the role please detail experience in similar roles you have undertaken. We look forward to hearing from you.
Skills: Data scraping Data Entry Email Marketing Google Docs
Hourly - Intermediate ($$) - Est. Time: More than 6 months, Less than 10 hrs/week - Posted
We're looking for someone to scrap twitter for us. Specific keywords / hashtags would be provided to you for e.g. webinar or teleclass etc. You need to follow the link people are talking about using that hashtag and collect data such as the webinar host's name and contact info.
Skills: Data scraping Data Entry Data mining Internet research
Hourly - Entry Level ($) - Est. Time: Less than 1 week, 10-30 hrs/week - Posted
I have downloaded ALL files - text and metadata - http://thomas.loc.gov/home/multicongress/multicongress.html The gov't puts text in one file and metadata in another file. Put these together into one JSON file per bill, nomination, or amendment. General Specifications for THOMAS Context Parse materials from Library of Congress (THOMAS). Convert to JSON. The parser should be optimized for speed. Deliverables - Python code utilizing good programming practices and principles (e.g. use of a good style guide for readability; good OOP principles) - Documentation (how to use the program you’ve written and how the file directory hierarchy is laid out) - JSON files. Further Details http://thomas.loc.gov/home/LegislativeData.php?&n=BSS&c={congress}.format(congress=congress) Congress is the congressional meeting; the latest is the 114th congress Iterate through Bills and Legislative actions Keys of the JSON representation would be under the following structure (see model at the end of this documentation) Important: top level “words” field must be clean, text-only, no extraneous whitespace (no non-ASCII characters), no \n's JSON Data Model { “external_id”: LoCT_<Congress Number>_<Bill number>, “abstract”: “...” # summary “date”: <introduction date of Bill>, “title”: “official title”, “words”: “...”, “meta”: { “latest_title”: “...”, “sponsor”: “...”, “text_of_legislation”: “...”, “titles”: [“title 1”, “title 2”, ...], “related_bills”: [“bill 1”, “bill 2”, ...], “cbo_cost_estimates”: , “text_of_legislation”: “...”, “cosponsor”: [“name 1”, “name 2”], “amendments”: [“amendment 1”, “amendment 2”], “subjects”: [“subject 1”, “subject 2”], “crs summary”: “...”, “committees”: [ { “committee 1”: “...”, “subcommittee”: “...” }, { “committee 2”: “...”, “subcommittee”: “...” } ], “congressional_actions”: [ { “action 1”: [“sub-action 1”, “sub-action 2”, …, “sub-action n”]}, { “action 2”: [“sub-action 1”, “sub-action 2”, …, “sub-action n”]}, …, { “action 3”: [“sub-action 1”, “sub-action 2”, …, “sub-action n”]} ] } }
Skills: Data scraping Python
Hourly - Entry Level ($) - Est. Time: 1 to 3 months, Less than 10 hrs/week - Posted
Hello, I am looking for someone to build me a list of email addresses. You must find the prospects on linkedin. I am looking specifically for users who work in requested ZIP codes (to be provided PM) Must have a proven track record and confidently be able to do this task. Must be able to provide weekly reports that include data in Excel files. Must have excellent Communication Skills. Start you PROPOSAL WITH THE WORD "IN" to verify you have read the description correctly.
Skills: Data scraping Data Entry Data mining Email Handling
Hourly - Entry Level ($) - Est. Time: 1 to 3 months, 10-30 hrs/week - Posted
Looking for a lead generation expert with a premium Linkedin account to scrape a list of leads. Criteria: Current Title: Marketing Director Country: United States Industry: Legal Services Industry: Law Practice Company Size: 11-50 The result should be an excel file with the following info: First name Last name, Title, Email address, Company name, Company size, Linkedin url and phone number if possible. Please recommend the best way you will approach this task. Please reply with "I have read and understood the instructions" at the start of your application. What challenging part of this job are you most experienced in? Have you taken any oDesk tests and done well on them that you think are relevant to this job?
Skills: Data scraping Data mining Internet research Lead generation
Looking for the Team App?
Download the New Upwork Team App
Fixed Price Budget - ${{ job.amount.amount | number:0 }} to ${{ job.maxAmount.amount | number:0 }} Fixed-Price - Est. Budget: ${{ job.amount.amount | number:0 }} Open to Suggestion Hourly - Est. Time: {{ [job.duration, job.engagement].join(', ') }} - Posted
Skills: {{ skill.prettyName }}
Looking for the Team App?
Download the New Upwork Team App