You've landed at the right place. oDesk is now Upwork. Learn about the new platform.

Web Crawler Jobs

53 were found based on your criteria {{ paging.total | number:0 }} were found based on your criteria

show all
  • Hourly ({{ jobTypeController.getFacetCount("hourly") | number:0}})
  • Fixed Price ({{ jobTypeController.getFacetCount("fixed") | number:0}})
show all
only
only
only
show all
only
only
only
only
only
show all
only
only
only
Looking for the Team App?
Download the New Upwork Team App
Fixed-Price - Intermediate ($$) - Est. Budget: $100 - Posted
Hello, I'm looking for someone to download all prices for every part, for all vehicle year/make/models. From an automotive parts site such as http://www.rockauto.com/en/catalog/ I would like it to be summarized in an excel spreadsheet, or itemized database that can be downloaded. Please let me know A) How you plan to do this (crawler, manually, etc) B) How long it will take you C) Price you will charge (can be more or less than $100)
Skills: Web Crawler Web scraping
Fixed-Price - Intermediate ($$) - Est. Budget: $250 - Posted
I need help in scrapping university websites sites for email address. This is a low weight work. The code can be written in any language, preferably in Ruby. I will need a csv file in the format, i specify. I will also specify the sites and techniques to scrape as i have done this before. There are close to 30 sites, you will be needing to scrape from. I will give you the university name and you will need to get names from university facebook group or use common names that i can provide. Find the student directory page for that university and use that for searching and fetching the email and other meta information from the results
  • Number of freelancers needed: 2
Skills: Web Crawler Data scraping HTML JavaScript
Hourly - Intermediate ($$) - Est. Time: 1 to 3 months, 10-30 hrs/week - Posted
This job is focused on advancement of the experience that thousands of users get navigating, browsing, searching and comparing the content offered through our proprietary technology platform. The end-result (output of the ontology model) will be a set of intuitive and comprehensive multi-level navigation structures (hierarchical taxonomies, facets) for browsing, searching and tagging the content offered to our clients. The end-task is envisioned to be primarily achieved with the usage of Semantic Web concepts and data (LOD and other available SKOS) as per Semantic Web standards. The task most likely will require knowledge/learning of several RDF-based schemas (Resume RDF, HRM Ontology, HR-XML, FOAF, SCIOC, Schema.org) and usage of the W3C’s Semantic Web technology stack components (SPARQL, Protege, Semantic resoners). Key tasks: - Definition of RDF Schema and ontologies based on several existing RDF Schemas (Resume RDF, HRM Ontology, HR-XML, FOAF, SCIOC, Schema.org, etc.) - linking available LOD and SKOS data sets, building several core multi-level hierarchical taxonomies (magnitude of tens of thousands of elements) comprehensively describing the content in our system - Rule-based processing and linking of multiple existing, as well as obtained sets of data using semantic reasoners - Definition, structuring and optimization of hierarchical data sets, definition and maintenance of hierarchical relationships of particular terms (facets) - Research (independent, as well as guided by management team) on publicly available SKOS and LOD sets related to the content of the platform from public (international standards, patent databases, public and government databases, various organizational, available XML datasets, etc.), as well as acquired proprietary sources - Retrieval and ETL of multiple additional data sets from multiple sources - Tagging, Classification, entity extraction - Working with management team to maintain and advance particular segments of defined taxonomies Optional Stretch-Tasks (Depending on Candidate's Qualifications): - Automatic analysis of content, extraction of semantic relationships - Auto-tagging, auto-indexing - Integration and usage of selected IBM Watson services for content analysis - Integration with Enterprise Taxonomy Management platforms (Mondeca, Smartlogic, PoolParty, or others) This job will initially require commitment of 15-20 hours per week over 3-6 months engagement. Interaction with a responsible manager will be required at least twice a week over Skype and Google Hangouts. Longer-term cooperation is possible based on the results of the initial engagement. Required Experience: - Detailed knowledge of Semantic Web concepts and techniques - Intimate familiarity with W3C’s Semantic Web technology stack (RDF, SPARQL, etc.) - Hands-on experience with LOD (DB Pedia and others) and various SKOS - Experience of modeling data based on various RDF schemas (Resume RDF, HRM Ontology, HR-XML, FOAF, SCIOC, ISO 25964, etc.) - Knowledge of common open-source ontology environments and tools (Mediawiki, Protege, etc.) or other enterprise-grade ontology tools (Synaptica, DataHarmony, PoolParty, Mondeca, Top Braid, etc.) - Experience of work with semantic reasoners - Prior experience of content management and maintenance of taxonomies for consumer or e-commerce applications Additional Preferred Experience: - Background in Library and Information Science (MLIS), Knowledge Management, Information Management, Linguistics or Cognitive Sciences - Familiarity with common classification systems - Experience working with catalog and classification systems and creation of thesauri - Auto-tagging, auto-classification, entity extraction
Skills: Web Crawler Web Crawling Data Analytics Data Entry
Fixed-Price - Expert ($$$) - Est. Budget: $50 - Posted
I'm going to Angola, a Portuguese speaking country, and need someone to conduct some research (within financial services/ banking and telecommunications) on this country, some important individuals and other aspects of these industries. A lot of this information will be in Portuguese and I need someone with excellent research skills - web research/ deep mining - and native (or near native) Portuguese: written, reading and speaking. Ideally the individual will have lived or live in Angola - though this is not completely necessary. I'll provide you with the list of names (approx. 10) , companies/ societies (approx. 10) and specific details on the aforementioned industries. I'd need this work completed by: 16th February. It should be presented in a table either using MS Word or Excel. All to be written in English. Please ask any needed questions. Price given is negotiable depending on skill, expertise, and speed of delivery.
Skills: Web Crawler Data scraping Internet research Market research
Hourly - Intermediate ($$) - Est. Time: Less than 1 week, 10-30 hrs/week - Posted
i will provide company name and potential job titles to look for per company you will find exact name, job title add to columns in existing spreadsheet example of what i will provide: company name: google company domain: google.com title(s) to look for: ceo you will return in two columns (column A = name1, column b = title1): name, title: sundar pichai, ceo this must be done in bulk and automatically as new rows are added to sheet if i have a list of 100 company and/or domains you should be able to return accurate results in minutes or seconds for all results this is not a manual job you can create scraper, crawler, api whatever i just need fast, accurate, consistent results
Skills: Web Crawler Web Crawling Data scraping Scrapy
Hourly - Expert ($$$) - Est. Time: Less than 1 month, 30+ hrs/week - Posted
I am in need of hotel, restaurants lists with Business, Email, Website (optional). Countries: USA,UK, France, Italy, Spain Turkey, UAE, Australia, Japan, Brasil. Email type: Manager's emails or Generic e.g. (reservation@; info@; frontdesk@; reception@; rsv@; etc..) ONLY APPLY IF YOU HAVE OR CAN GET MORE THAN 1000 CONTACTS A DAY. We consider acquiring existing databases too, only large ones.
  • Number of freelancers needed: 3
Skills: Web Crawler Data mining Data scraping Email Marketing
Hourly - Expert ($$$) - Est. Time: Less than 1 month, 30+ hrs/week - Posted
I am in need of hotel database with Hotel Name, Email. Countries: USA,UK, France, Italy, Australia, Japan, Brasil. Email type: generic e.g. (reservation@; info@; frontdesk@; reception@; rsv@; etc..) Existing databases are accepted too.
  • Number of freelancers needed: 3
Skills: Web Crawler Data mining Data scraping Email Marketing
Hourly - Expert ($$$) - Est. Time: Less than 1 month, 10-30 hrs/week - Posted
I need a professional to scrape Commercial Real Estate listings from thousands of websites. I will provide the website lists. The data will be in different levels of the website. I need a dynamic crawl that can locate the data for every website at any level and scrape it. Below are the fields I'm looking for. In addition, I will provide a sample website list (around 500) to test when I hire the appropriate professional. The output will be primarily be CSV Data to Scrape: Property Address price property type (Exclude Residential and Business for Sale) property description Building size (Sq ft) Sale / Lease (rent) Property Image broker name Broker Phone # Broker Email Website copyright (if has copyright need to flag "Y") Privacy (if has privacy need to flag "Y") terms and conditions (if has Terms and Conditions need to flag "Y") Original Scrape Date
Skills: Web Crawler Data mining Data scraping Web scraping
Fixed-Price - Entry Level ($) - Est. Budget: $200 - Posted
We need to develop a PHP script that crawls craiglist website. Must work as a bot. The script must do: 1) The bot must crawl all cities in craiglist on the selected craiglist section (vacation rentals) and retrieve all the data of each apartment (including email, photos, text, features and owner data). 2) The bot will insert the crawled data into our database (mysql). - First creating the user with the data in craiglist and then the apartment data. 3) Send email to inform about the listing to the owner. Our site is developed in CakePHP and the coder can help with the step 2).
  • Number of freelancers needed: 2
Skills: Web Crawler CakePHP Web Crawling MySQL Programming
Looking for the Team App?
Download the New Upwork Team App
Fixed Price Budget - ${{ job.amount.amount | number:0 }} to ${{ job.maxAmount.amount | number:0 }} Fixed-Price - Est. Budget: ${{ job.amount.amount | number:0 }} Open to Suggestion Hourly - Est. Time: {{ [job.duration, job.engagement].join(', ') }} - Posted
Skills: {{ skill.prettyName }}
Looking for the Team App?
Download the New Upwork Team App