Web data scraping from a local directory of office buildings

Web & Mobile Development Other - Web & Mobile Development Posted 2 years ago

Fixed Price

Delivery by July 4, 2013

$63.00

Budget

Details

Need to scrap an office building directory site which claims to have 50k records. Data are tidily shown up in templated pages. All pages are linked from a rigid 2-tier category structure with pagination. The top tier has only 3 categories.

You will need to write a php scraper to crawl all pages and write the data onto a utf-8 tab-delimited text file. For 50k records your may set the bot to start crawling from each given top-tier category. So the 50k records will be split into 3 txt files.

Sample page: primeoffice dot com dot hk slash hong_kong_office slash building_index slash Building_Profile.asp?B=10422

PS: First image URL of each building is also needed.

Open Attachment

Skills Required:

Client Activity on this Job

Last Viewed: 2 years ago

Proposals: 29

Hired: 1


About the Client

(4.95) 17 reviews

Hong Kong
Kowloon Bay 10:59 PM

28 Jobs Posted
54% Hire Rate, 1 Open Job

$941 Total Spent
25 Hires, 0 Active

$10.00/hr Avg Hourly Rate Paid
9 Hours

Member Since Jun 28, 2012