You've landed at the right place. oDesk is now Upwork. Learn about the new platform.

Web Scraping Jobs

152 were found based on your criteria

show all
show all
only
only
only
show all
only
only
only
only
only
show all
only
only
only

Hourly - Est. Time: Less than 1 week, 10-30 hrs/week - Posted
This task involves scraping data from a Japanese government website and organizing it into 2 excel or open-office spreadsheets. *Please note* : I am not interested in hiring anyone for a manual copy-and-paste job, as that will take far too long. The data will need to be scraped using grepping methods or specialized software (which is why I've listed Python as a needed skill). It is slightly more cumbersome than your typical web scraping task since the data sources for each variable/column in the excel spreadsheets come from multiple pages within the website… and of course, it’s in Japanese. That said, I will provide detailed instructions on the exact data I’m looking for with links to each data source, screen shots of the display, and a copy of the html source script for each variable. You won’t need to be able to read Japanese to do it. I'll also include an example of what each excel spreadsheet needs to look like, pasted at the end of the instructions. To make it more...

Hourly - Est. Time: Less than 1 week, Less than 10 hrs/week - Posted
We will provide you with a guide containing detailed information on how to scrape certain sites. These sites contain lists of different bars in different cities. The information you obtain will be posted on a google doc with the following: Name, location, type, phone number, etc.

Fixed-Price - Est. Budget: $ 100 Posted
Newsletters are received in the form of emails from multiple data sources on the web. . Newsletters contain both content and in some cases URL’s that point to other sources on the Internet. These emails are to be saved on the local hard drive inside a folder. The following steps need to be performed: 1. Extract the content ( not the metadata but the actual message) of the emails (stored as .MSG or .HTML or other files) in .XLS or .CSV format. The content should be stored in a suitable structure. 2. Extract the content from the URL’s that point to other sources on the Internet in the same .XLS or .CSV file providing a suitable structure for the content to be presented. For e.g. a newsletter with content about top 10 telecom trends should have ideally 3 columns : Column1: Keywords ( Top 10 telecom trends ) Column2: (Short news item ) : Column 3: ( The URl from the Newsletter that leads to the web.

Hourly - Est. Time: Less than 1 week, 10-30 hrs/week - Posted
I need information including Chief Executive (CEO) name, Chief Executive (CEO) email to be found for the UK Automotive industry. Company names to be researched as well as information needed to be found are on the attached excel document. The job needs to be done as accurately as possible, as communications will be sent out via email to the respective CEO email address. The job should be completed using the exact template and Company Names used on the attached Excel document. Under heading 'Company Employee Size' please use size brackets 1-249, 250-3499, 3500+. Please ensure that Address lines are split using a comma. The first row has been completed as an example. If the organisation does not have a CEO, please find the relevant information for either the Managing Director or Human Resources Director. If the organisation is globally active, please make sure that the information is relevant to the UK head of the organisation. The job needs someone who is willing to look through...

Hourly - Est. Time: Less than 1 month, Less than 10 hrs/week - Posted
We need to grab PDF links and descriptions from various sites, to build a catalog which will be used to improve our positions in the search engines output. This catalog should be kept up-to-date. Please describe your typical approach, technical stack, any additional functionality (captcha, proxy support, UFT support, etc) that might be needed for this project. What are hardware requirements for your solution (VM size)? Please estimate baseline implementation, and the cost of adding new sites. We’ll need a code sample from you to make an informed decision.

Hourly - Est. Time: Less than 1 week, Less than 10 hrs/week - Posted
Our requirement: extract necessary data of an website and insert it an excel sheet, we would provide you with an excel sheet containing the columns of data needed to be extracted and listed. Please to get back to us with the number of hours required and rate. Thank You Venu

Fixed-Price - Est. Budget: $ 20 Posted
I need 125 personal Email contacts of HR Managers. Each contact & Company have to be unique & from UK. Name, Surname, PERSONAL EMAIL (No Generic Email), Telephone Number, Company Name, Some kind of page on the person/Company ie. Website/Linkedin All contacts either have to be HR Managers OR Managing Directors THE COMPANIES HAVE TO EMPLOY BETWEEN 10 - 200 EMPLOYEES. I would need a sample before awarding contract. PS. If all goes well, I will create a contract every month.