Only freelancers located in the U.S. may apply.U.S. located freelancers only
Needs to hire 20 Freelancers
Mapzen is looking for help building out web scrapers to generate a large list of business listings for All The Places, the open data project, from websites that have 'store locator' pages like restaurants, gas stations, retailers, etc.
The project is built using scrapy, a Python-based web scraping framework. Each target website gets its own spider, which does the work of extracting interesting details about locations and outputting results in a useful format.
To scrape a new website for locations, you'll want to create a new spider. You can copy from existing spiders or start from a blank, but the result is always a Python class that has a process() function that yields GeojsonPointItems. The Scrapy framework does the work of outputting the GeoJSON based on these objects that the spider generates.
Full details on the project, including more on the workflow: