We need to develop a Web Spider, Web crawler, web harvesting or web data extraction tool that we can use to find information regarding Taxi Companies.
We need to extract the following data Name of Taxi Company, Address, Country, Telephone Number. The Software should deliver this data in a csv file. We will host the spider on a unix server, the spider should save extracted data to a daily file on the server. We should be able to run many spiders at the same time.
The spider can be written in any language. It must be standalone. It must keep a record of Web Pages visited. It should not revisit previously visited sites.
Deliverables will be the spider software, and 3 days worth of results files that proves that the spider works.
To be considered, please provide us with an outline of how you will develop the Spider, language used and a time line that we can use to measure progress.