You've landed at the right place. oDesk is now Upwork. Learn about the new platform.

Web Scraping Jobs

162 were found based on your criteria

show all
show all
only
only
only
show all
only
only
only
only
only
show all
only
only
only
Fixed-Price - Est. Budget: $ 70 Posted
The goal of the project is to download the raw texts of all posts of a given blog. In addition to the raw HTML, we need the extracted title, raw content, publication date, name of the author and the list of tags. jsoup should be used to parse the HTML data and extract the relevant data based on CSS selectors. All data need to be stored in a Postgres table. The script should be written either in Java8 or in Kotlin. If you are able to write it in Kotlin, then your substantially more likely to get this job and subsequent job. We have a number of jobs that we would like to outscore. We are looking forward to your application. Please quote a fixed price and start your post with the word "jsoup". We have to ignore all applications that fail to meet these requirements
Fixed-Price - Est. Budget: $ 100 Posted
I am looking for someone to get me a list of all the dance schools in the US: Data to be included in different columns of an excel sheet. 1. Studio Name (ABC Dance School, XYZ Karate Dojo) 2. Owner Name (Jane Doe) 3. Type of studio (Dance-HipHop, Dance-Jazz, Dance-Ballet, etc., Karate) 4. Studio Address (1234 Name Street) 5. Studio City (City Name) 6. Studio State (New York, California, etc.) 7. # of Studio Locations (1, 2, 3, 4, etc.) 8. # of students (100, 25, 500, etc.) 9. Phone Number (777-444-4232) 10. Secondary Phone Number (777-444-4000) 11. Studio Email (admin@abcdance.com) 12. Studio Website (www.abcdance.com) 13. Studio Facebook Page 14. Studio Yelp Page 15. Studio Twitter Page 16. Studio YouTube Page 17. Studio Google+ Page 18. Sources where data was acquired (Google Maps, Yellow pages, yelp, dance school directories, karate school directory, etc.) From initial research there is about 36,000+ dance schools in the United States. Looking for someone...
Fixed-Price - Est. Budget: $ 30 Posted
I need you to format an excel spreadsheet and create new columns: First Name/Last Name. You will have a list of websites. I need you to find the First and Last Name of the business owner. I have attached an example of how I would like you to format the final excel spreadsheet. You will also have to look at the business owners website and grade it as poor, average, premium - based on design. If you have read and understand this job posting please write: "Fast Delivery" at the top of your proposal. Final formatting will be based on the first page of the example spreadsheet in this document.
Hourly - Est. Time: Less than 1 month, 30+ hrs/week - Posted
We are an Australian ground transportation company looking to expand our marketshare. We need someone to find decion makers from Brisbane based companies and executive assistants/personal asssistants from all companies across all sectors .We are looking for quality specific leads go junk. The first week will be a trial to see the quality, from there we are looking to make the right person a permanent member of our team.
Hourly - Est. Time: Less than 1 week, 30+ hrs/week - Posted
1, it can scrape about 100M products info daily or weekly. Each product have 300-1K bytes. If it is hard, scraping 10M product daily or weekly also ok. 2, so it seem a distributed crawl is a must. 3, use proxies to save money and can detect IP ban and rotation or slow scraping to avoid IP ban. 4, I found scrapy cluster project is great buy not try. At moment, scrapy(or custom crawl) + redis is a option. 5, save data to mysql or sql server or even oracle to query products.
Fixed-Price - Est. Budget: $ 20 Posted
I'm looking for someone who can make a script that can scrape a table from a webpage and save it as a csv. I tried using iMarcos but it skips some lines of the table for some reason. The script will need to be something I can reuse on the same webpage because there are several different data sets I need to scrape. The number or columns and their headers are the same for each data set. The only thing that changes is the number of rows. The webpage is not a public url so we will need to use teamviewer and communicate through skype.
Hourly - Est. Time: Less than 1 month, 10-30 hrs/week - Posted
Two jobs: 1) We have a (nearly complete) python web scraping program for facebook and need your help finishing. 2) We have an online directory and can only access 150 contacts daily. We need to run a daily script to collect the information of 150 local contacts each day.