I am looking for expert , experience python scraper developer with tons of experience in scraping ..
You will be creating script to scrap millions of data , on regular basis .. this will be web based script .. data will be saved in some kind of db ...
Previous experience with amazon , walmart, costco, ebay etc scraping is big plus.
I am not looking for command line or desktop based program. This will be web based program that run on linux AWS or some cloud server.
You should know following advanced techniques to solve scraping issues
1. Able to run multiple scrap / threads in parallel
2. ABle solve ip blocking issue by proxy IP rotation logic
3. Capcha solver
4. Selenium browser automation to login to certain account and do some steps
Here are some idea
1. Logic to accept scraping / browser automation request
2. decode request into scraping / browser request
3. Queue / fifo in case of too many scraping request
4. ip proxy handling logic for scraping request
4. automatically trigger some scraping on daily / timely basis
5. check scraping status, % complete , estimate , check output response/
6. accept request only from cetrain ip .. and ip based request limit
7. Creating API for accepting request and getting data
On average I am looking to pay $50-$100 per scrap / automation website script. And we have 50+ websites that needs to be scraped.
Looking for long time partner.. Once I start making revenue from this, you will also own part of company and generate extra revenue ...
In application please answer following
1. write 'warriors' before application
2. write your previous scraping experience. What websites and how much data. Any experience with amazon ?
3. Have you ever has issue with ip blocking ? how did you handle it ? If you used proxy rotation, from which website did you get proxies.
4. Any experience with selenium or browser automation ?
5. Any experience with creating api for scraped data ?
6. Are you company or individual. If individual , full time freelancer or you have some other full time job ?
7. Send me example of previous / complex scrap / browser automation projects.