Data Scraping & Research Specialist
Worldwide
Job Description We are seeking an experienced data scraping and research professional to build a comprehensive statewide database of California Regional Center service providers. The goal is to identify, organize, and analyze all vendorized providers across California’s 21 Regional Centers. This project requires web scraping, data extraction, data cleaning, deduplication, and spreadsheet/database organization. Scope of Work Research and collect provider/vendor information from all 21 California Regional Centers. Extract and organize: * Provider/Vendor Name * Vendor Number * Service Code(s) * Service Description * Regional Center(s) Served * Contact Name (if available) * Email Address (if available) * Phone Number (if available) * Website (if available) * Physical Address (if available) Required Deliverables Deliverable 1 – Master Vendor Database Excel workbook containing: * All vendorized providers statewide * All associated service codes * Regional center association * Contact information Deliverable 2 – Service Code Analysis Create a report showing: * Number of vendors by service code * Number of vendors by regional center * Number of vendors statewide * Top service categories * Vendors with multiple service codes Deliverable 3 – Marketing Lists Separate Excel sheets containing email lists grouped by: * Independent Living Services (ILS) * Supported Living Services (SLS) * Respite Services * Transportation Services * Day Programs * Residential Services * Employment Services * Behavioral Services Each list should indicate: * Company Name * Email * Regional Center * Service Code Deliverable 4 – Deduplication Many providers may appear in multiple directories. Please: * Identify duplicate providers * Merge records where appropriate * Create a unique statewide vendor count Technical Requirements Preferred experience with: * Python * BeautifulSoup * Scrapy * Playwright * Selenium * Pandas * Excel Data Analysis Experience scraping government, healthcare, or provider directories is highly preferred. Project Success Criteria The final deliverable should allow us to answer questions such as: * How many ILS providers exist statewide? * How many SLS providers exist statewide? * How many providers exist by service code? * Which regional centers have the highest concentration of providers? * What contact information is available for providers by service category? Proposal Requirements Please include: 1. Examples of similar scraping or database projects. 2. Estimated completion timeline. 3. Estimated accuracy rate. 4. Whether you can automate updates in the future. 5. Total fixed-price bid. Budget Open to proposals. Preference will be given to applicants who can demonstrate large-scale scraping, data cleaning, and deduplication experience.
$650.00
Fixed-price- ExpertExperience Level
- Remote Job
- One-time projectProject Type
Skills and Expertise
Activity on this job
- Proposals:20 to 50
- Last viewed by client:last week
- Interviewing:0
- Invites sent:0
- Unanswered invites:0
About the client
- USAFremont3:01 AM
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by