You will get PubMed to Excel extraction with Python NLP (clean & deduped)


Project details
I turn PubMed search results into clean, analysis-ready Excel/CSV. You’ll get exactly the fields you need—PMID, title, authors, journal, year, DOI, abstract, MeSH, affiliations—plus optional NLP tags (drug/disease/outcome). I work API-first with Python (pandas, spaCy/scispaCy) so the process is reproducible, deduped, and documented.
Typical flow: you share a topic or PMIDs and the columns you want → I send a small sample for sign-off → I run the full batch and deliver the dataset (and code/README in higher tiers). A light QA pass (spot checks & summary stats) is included so you can trust the numbers.
No meetings required—clear inputs are enough. Privacy respected; I only use open sources or files/credentials you provide. Great for literature reviews, evidence maps, and internal dashboards.
Typical flow: you share a topic or PMIDs and the columns you want → I send a small sample for sign-off → I run the full batch and deliver the dataset (and code/README in higher tiers). A light QA pass (spot checks & summary stats) is included so you can trust the numbers.
No meetings required—clear inputs are enough. Privacy respected; I only use open sources or files/credentials you provide. Great for literature reviews, evidence maps, and internal dashboards.
Machine Learning Tools
BERT, Google Sheets, Microsoft Excel, NLTK, NumPy, pandas, Python, PyTorch, scikit-learn, Scrapy, SQL, Tableau, Tesseract OCRWhat's included
| Service Tiers |
Starter
$79
|
Standard
$229
|
Advanced
$499
|
|---|---|---|---|
| Delivery Time | 2 days | 4 days | 7 days |
Number of Revisions | 1 | 2 | 3 |
Number of Model Variations | 0 | 0 | 0 |
Number of Scenarios | 0 | 0 | 0 |
Number of Graphs/Charts | 1 | 0 | 0 |
Model Validation/Testing | |||
Model Documentation | |||
Data Source Connectivity | - | - | |
Source Code |
Optional add-ons
You can add these on the next page.
Fast Delivery
+$49 - $219
Additional Revision
+$25
Additional Model Variation
(+ 2 Days)
+$120
Additional Scenario
(+ 1 Day)
+$80
Additional Graph/Chart
(+ 1 Day)
+$45
Data Source Connectivity
(+ 2 Days)
+$150
Python code + README
(+ 1 Day)
+$90
Extra data source
(+ 2 Days)
+$150Frequently asked questions
About Sara
Expert Web Scraper | Python, Scrapy & Selenium | Clean Data Extraction
London, United Kingdom - 12:59 pm local time
Whether you need to monitor competitor prices, generate business leads, or gather data for market research, I have the skills to get the job done accurately and efficiently.
My core services include:
✅ Custom Web Scraping & Crawling Solutions
✅ Data Extraction from Dynamic & JavaScript-heavy websites
✅ Bypassing Anti-Scraping Measures (CAPTCHA, IP Blocks)
✅ Data Cleaning and Structuring (CSV, JSON, Excel, SQL)
✅ Automated Data Entry & API Integration
My Tech Stack:
▶️ Python
▶️ Scrapy, Selenium, Playwright, Beautiful Soup, Requests
Why choose me?
🔹 I deliver 100% accurate, ready-to-use data.
🔹 I ensure clear communication and provide regular updates on the project's progress.
🔹 I respect website terms and ethical scraping practices.
Click the "Invite" button to send me a message. I'd be happy to discuss your project needs in a free consultation. Let's turn raw web data into valuable insights!
Steps for completing your project
After purchasing the project, send requirements so Sara can start the project.
Delivery time starts when Sara receives requirements from you.
Sara works on your project following the steps below.
Revisions may occur after the delivery date.
Data Understanding & Cleaning
I will explore the dataset, clean missing values, remove duplicates, and prepare data for analysis
Data Analysis / Modeling
I will perform statistical analysis, build models (if needed), and generate insights