I am looking for expert , experience python scraper developer with tons of experience in scraping ..
You will be creating script to scrap millions of data , on regular basis .. this will be web based script .. data will be saved in some kind of db ...
Previous experience with amazon , walmart, costco, ebay etc scraping is big plus.
I am not looking for command line or desktop based program. This will be web based program that run on linux AWS or some cloud server.
You should know following advanced techniques to solve scraping issues
1. Able to run multiple scrap / threads in parallel
2. ABle solve ip blocking issue by proxy IP rotation logic
3. Capcha solver
4. Selenium browser automation to login to certain account and do some steps
Here are some idea
1. Logic to accept scraping / browser automation request
2. decode request into scraping / browser request
3. Queue / fifo in case of too many scraping request
4. ip proxy handling logic for scraping request
4. automatically trigger some scraping on daily / timely basis
5. check scraping status, % complete , estimate , check output response/
6. accept request only from cetrain ip .. and ip based request limit
7. Creating API for accepting request and getting data
On average I am looking to pay $50 per scrap / automation website script. And we have 50+ websites that needs to be scraped.
Commitment to deadline and good communication is must . If you are working on too many other projects, dont apply.
This job is for 10 different amazon page scrap / browser automation scripts.
1. write 'warriors' before application
2. write your previous scraping experience. What websites and how much data. Any experience with amazon, walmart ?
3. Have you ever has issue with ip blocking ? how did you handle it ? If you used proxy rotation, from which website did you get proxies.
4. Any experience with selenium or browser automation ?
5. Send me example of previous / complex scrap / browser automation projects.