Web data scraping from a local directory of office buildings

Web, Mobile & Software Dev Other - Software Development Posted 3 years ago

Fixed Price

Budget: $63
Delivery by July 4, 2013

Start Date

July 2, 2013


Need to scrap an office building directory site which claims to have 50k records. Data are tidily shown up in templated pages. All pages are linked from a rigid 2-tier category structure with pagination. The top tier has only 3 categories.

You will need to write a php scraper to crawl all pages and write the data onto a utf-8 tab-delimited text file. For 50k records your may set the bot to start crawling from each given top-tier category. So the 50k records will be split into 3 txt files.

Sample page: primeoffice dot com dot hk slash hong_kong_office slash building_index slash Building_Profile.asp?B=10422

PS: First image URL of each building is also needed.

  • Other Skills:

Activity on this Job

Last Viewed by Client: 3 years ago

Hired: 1

About the Client

(4.95) 17 reviews

Hong Kong
Kowloon Bay 09:38 AM

28 Jobs Posted
54% Hire Rate, 1 Open Job

$941 Total Spent
25 Hires, 0 Active

$10.00/hr Avg Hourly Rate Paid
9 Hours

Member Since Jun 28, 2012