You will get data from any websites in excel/json format
Project details
I love doing web scraping and I have almost 5 years of experience. In that time, I have scraped almost all types of websites, Below I've mentioned a few important websites I've scraped till now. I love to take on challenges and would be willing to scrape new and exciting websites.
USE MULTIPROCESSING TO SPEED UP THE SCRAPING
SCRAPED MORE THAN 44 MILLION DATA POINTS
✅* Packages Use for Scraping:
Requests, BeautifulSoup, Selenium, Scrapy, Twint, Instagram-scraper
✅ Experience with Scraping
a. Social Media: Twitter, Instagram, Facebook
b. App Reviews: Google Play, App Store
c. E-commerce: Shopify, Amazon
d. Real Estate: Realtor.com, Har.com, Zillow
e. Business: Yelp.com
f. Restaurant: Getir
g. Crypto:Coinmarketcap, Opensea
h. News: cointelegraph.com
i. Jobs: Indeed, Glassdoor
j. Investors: signal.nfx.com
k. Miscellaneous: Openwheathermap.com
And more...
✅ Data Extraction through Official APIs
✅ Data Cleaning and stored in the required format file.
USE MULTIPROCESSING TO SPEED UP THE SCRAPING
SCRAPED MORE THAN 44 MILLION DATA POINTS
✅* Packages Use for Scraping:
Requests, BeautifulSoup, Selenium, Scrapy, Twint, Instagram-scraper
✅ Experience with Scraping
a. Social Media: Twitter, Instagram, Facebook
b. App Reviews: Google Play, App Store
c. E-commerce: Shopify, Amazon
d. Real Estate: Realtor.com, Har.com, Zillow
e. Business: Yelp.com
f. Restaurant: Getir
g. Crypto:Coinmarketcap, Opensea
h. News: cointelegraph.com
i. Jobs: Indeed, Glassdoor
j. Investors: signal.nfx.com
k. Miscellaneous: Openwheathermap.com
And more...
✅ Data Extraction through Official APIs
✅ Data Cleaning and stored in the required format file.
Data Tool
PythonWhat's included
Service Tiers |
Starter
$30
|
Standard
$50
|
Advanced
$100
|
---|---|---|---|
Delivery Time | 1 day | 1 day | 3 days |
Number of Pages Mined/Scraped | 1 | 20 | 50 |
Number of Sources Mined/Scraped | 100000 | 100000 | 1000000 |
Number of Revisions | Unlimited | Unlimited | Unlimited |
Optional add-ons
You can add these on the next page.
Fast Delivery
+$20
Additional Page Mined/Scraped
+$5
Additional Source Mined/Scraped
+$5Frequently asked questions
66 reviews
(64)
(2)
(0)
(0)
(0)
SG
Shreyans G.
Jul 20, 2022
BI
Borna I.
May 28, 2023
Signal Data Collection
Arjit did a great job and delivered a quality end result! He is very diligent, quick to respond and processing input, clear and pleasant communicator. I would highly recommend Arjit.
PS
Paul S.
Apr 17, 2023
Stata to python code translation
DD
Dragesco D.
Mar 28, 2023
Job Scraping on Website
Great job, every time !
AA
Anthony A.
Mar 7, 2023
Python Web Scraper
SB
Scott B.
Mar 6, 2023
Python Developer - Zillow Scraper
Arjit delivered the work quickly and listened to my feedback and concerns. I will definitely keep Arjit in mind for future Python and web scraping projects
About Arjit
Data Science | Web Scraping | Software Engineer | Data Analyst
90%
Job Success
Allahabad, India - 8:56 am local time
✅ AREAS OF EXPERTISE
Data Science, Data Engineering, Data Visualization, Machine Learning, Deep Learning, Data Mining, Web Scraping
✅* Packages Use for SCRAPING:
Requests, BeautifulSoup, Selenium, Scrapy, Twint, Instagram-scraper
✅ Experience with Scraping
a. Social Media: Twitter, Instgram, Facebook
b. App Reviews: Google Play, App Store
c. E-commerce: Shopify, Amazon
d. Real Estate: Realtor.com, Har.com, zillow
e. Business: Yelp.com
f. Restaurant: Getir
g. Crypto:Coinmarketcap, Opensea
h. News: cointelegraph.com
i. Jobs: Indeed, Glassdoor
j. Investors: signal.nfx.com
k. Miscellaneous: Openwheathermap.com
✅ TECHNICAL SKILLS
Programming Languages - Python, C/C++, Java, R
Framework- Flask, Django
✅ Packages and tools - Pandas, NumPy, SciPy, Scikit-Learn, NLTK, matplotlib, Seaborn, ggplot2, Pyspark, Keras, Tensorflow, BeutifulSoup, Selenium, Scrapy, Requests
Good with API development
✅ Machine Learning - Dimensionality reduction - Principal Component Analysis(PCA), K-Nearest Neighbors(KNN), Support Vector Machines(SVM), Naïve Bayes(NB), Decision Trees(DT), Random Forest(RF), Gradient Boosting Machines(GBM), XGBoost, Deep Learning – Deep Neural Networks & Convolution Neural Networks (CNN), Hierarchical & K-Means clustering, RFM, Market Basket Analysis - Association Rule Mining
✅ Statistical Modelling - Linear Regression, Logistic Regression, Multi-nominal Logistic Regression, Regularization - Ridge Regression, Lasso Regression, Time Series forecasting
✅ Text Mining - Text Pre-Processing, Information Retrieval, Text Classification, Topic Modeling (Latent Dirichlet Allocation(LDA), Non-negative Matrix Factorization(NMF)), Text Clustering, Sentiment Analysis
✅ DATABASE
SQL, PostgresAQl, MongoDB, DynamoDB
Steps for completing your project
After purchasing the project, send requirements so Arjit can start the project.
Delivery time starts when Arjit receives requirements from you.
Arjit works on your project following the steps below.
Revisions may occur after the delivery date.
Inspecting Website
This step involves reviewing websites, checking their HTML code, requests, and some javascript code. If they're using APIs to get data from their database then I'll intercept that API otherwise checks the HTML classes that contain required data.
Writing Script
I'll write the Python script in which By Using API: I'll request the API for the specific information. By HTML: Request the HTML inside bs4 and extract the data by classes. The above 2 functions will work on multiprocessing to speed up the process.