You've landed at the right place. oDesk is now Upwork. Learn about the new platform.

Web Crawler Jobs

59 were found based on your criteria {{ paging.total | number:0 }} were found based on your criteria

show all
  • Hourly ({{ jobTypeController.getFacetCount("hourly") | number:0}})
  • Fixed Price ({{ jobTypeController.getFacetCount("fixed") | number:0}})
show all
only
only
only
show all
only
only
only
only
only
show all
only
only
only
Hourly - Est. Time: Less than 1 week, Less than 10 hrs/week - Posted
I am after a list of as many the contacts contacts (employees) and their positions, emails as possible. Prefer marketing managers and all C-level execs. These guys have something like it but membership required and I don't think I get access to the details I need. http://www.icsc.org/directories/global-shopping-center-directory PLEASE NOTE Your solution must be an automated solution, I am not going to pay for hours and hours of manual work here as it can be completely automated by someone with the skills.
Hourly - Est. Time: 3 to 6 months, Less than 10 hrs/week - Posted
We launched our eCommerce business 3 months ago and our business is growing VERY FAST and we're looking for some help! We are looking for someone to source products for our Amazon business for private labeling and/or products to sell on Amazon through approaches such as purchasing direct from a supplier like (Alibaba). The categories we are interested in are as follows: Health & Beauty, Travel, Home, Tools & Home Improvement, Automotive or Patio, Lawn & Garden, Pet Supply (#1 option). We would like someone that is proficient in at least one or two of these categories. Meaning you are very familiar with and have done searching in these categories extensively. **We will provide the tools to help + Training on this** We are most interested in your experience in dealing with Alibaba. Specifically, we are looking to source a product that we can make our own i.e. (private labeling ) or sell on Amazon without private labeling. When you find a product we ask that you check the top...
Fixed-Price - Est. Budget: $ 500 Posted
We are looking to build a database of information for restaurants in Hong Kong we expect between 50,000 and 60,000 records, . We are looking to have the collect the following data. 1. Name 2. Name in Chinese (if available) 3. Phone Number 4. Address 5. Address in Chinese (if available) 6. Open hours 7. Average Price Range 8. Cuisine 9. Average User rating There could be a number of ways that you could gain this information from web searches, phone directories also OpenRice.com has a large number of records. We would like the data in JSON format.
Fixed-Price - Est. Budget: $ 250 Posted
Require a data list that fits the below criteria... Western Nationality Management Level and Above English Speaking Live in Dubai Work in Dubai Must Provide following details... First Name Surname Mobile/Cell Number Company Name Job Title Nationality Require ongoing data provider for weeks, months, years if quality data is provided. Fee is negotiable dependent on quality. Require a sample list of 50 contacts Ideal quantity would be 200 names per week however this can be across multiple freelancers
Fixed-Price - Est. Budget: $ 200 Posted
We are seeking a perfectionist to scrape 100,000 + records from the MLS that will be broken down in different towns, cities, and areas of Illinois, Wisconsin, Indiana, etc. The information will be in a spreadsheet format that has Listing agent and Buyer Agent Contact information which will include Name, Office/Phone number, Cell number, email, fax, etc. I want different tabs for each area with information on properties closed and the buying and seller realtors contact info. I also want a tab with all areas and realtors under one tab. I also want a cell that tells me how many duplicates there are for any real estate agent. Here is a copy of a video I created to show the process http://screencast-o-matic.com/watch/coi0QAfUxe.
Hourly - Est. Time: 3 to 6 months, Less than 10 hrs/week - Posted
Need A data extraction, web scraping expert needed for ongoing data mining activities. Need to be able to use import.io and or scrapy.org for extracting data. I will give you instructions on the data I need and I need you to build a crawler/extractor that will find the information and present it in csv format I also need you to find information on latest blogs, social media mentions, etc of my topics and people and compile it into data I can use. Respond with what you can do Software you work with work you have completed time frame to start getting data
Fixed-Price - Est. Budget: $ 300 Posted
### Expert Level ### Please reply with real world examples ?? If you have experience with a bloom filter please let me know. Hi I have a version of the application already built but isn't performing as I had hoped. I need a developer to help me improve the application in terms of performance and accuracy. Below are what I want to happen so be clear in your replies that you can perform this type of work. - Web scraping sort of works, but really needs to become multi-threaded and a lot more robust (currently breaks lots), the basic stack is below but I'm not adverse to other technologies being used. - The crawler aims to collect 100's millions of rows of data from numerous content networks, so the crawler needs to be able to manage that number of rows and the complications that bring to the table. - Post content to social networks, current app doesn't post in the correct format (weird categories) so that needs to be fixed. Basic headlines - Create multi-threaded...
Hourly - Est. Time: 1 to 3 months, 10-30 hrs/week - Posted
The task is to implement a focused Web crawler component comprising an API and a Web-based GUI. The focused crawler's objective is to gather user-specified pieces of data from Web pages using two strategies: (1) by carrying out a set of Web search engine queries for user-provided query terms, by downloading Search Engine Result Pages, and optionally following the hyperlinks contained therein; and (2) using a set of user-provided seed URLs, by following the hyperlinks on the Web pages identified by these URLs, using a breadth-first search and a user-specified depth. Rather than implement from scratch, the project should be pursued as wrapping existing open-source components such as crawler libraries and Web search engine APIs with additional layers of functionality, and in some cases, providing a convenience layer that hides complexity. Skills/technologies: Java, Heritrix, WARC, search engine technology, crawling, Yahoo BOSS
Fixed-Price - Est. Budget: $ 150 Posted
I need scrape and parser to xml akomantoso version, of http://www.concejodebucaramanga.gov.co/descargas.php?seccion=NQ==&categoria=MQ== and http://www.concejodebucaramanga.gov.co/descargas.php?seccion=NQ==&categoria=OA== I have 2 scripts templates for that only need create regualar expressions for that council chamber. Examples: https://comision6senado.files.wordpress.com/2013/03/acta-10-12-septiembre-18-de-2012.pdf converted to http://senado.felipeurrego.com/comisi%C3%B3n-sexta-senado/2012/septiembre/acta-no-10-18-09-2012.an I need test experience and skills with http://www.concejodecali.gov.co/documentos.php?id=502
Fixed-Price - Est. Budget: $ {{ job.amount.amount | number:0 }} Open to Suggestion Hourly - Est. Time: {{ [job.duration, job.engagement].join(', ') }} - Posted
{{ job.description }}