Software Developer for Physician Database

Posted 5 days ago

Worldwide

Summary

Project: Build a Concierge Physician Lead Database We are looking for a freelance software developer/data engineer to create a structured spreadsheet of concierge, membership-based, executive health, and direct primary care physicians in the United States. The goal is to build a high-quality lead database for physician outreach. The final deliverable should be a spreadsheet that includes physician and practice-level information, source URLs, and confidence scoring. Scope of Work: The developer will identify concierge physicians and related practices using publicly available online sources, including physician practice websites, concierge network directories, NPI data, and compliant search tools/APIs. The ideal candidate should already have access to commercial search APIs or compliant web search services (e.g., Google Search APIs, SERP APIs, Bing APIs, DuckDuckGo-compatible search providers, DataForSEO, SerpAPI, Bright Data, Oxylabs, or similar) that allow legitimate, terms-compliant retrieval of search results. We are not looking for automated scraping of Google search result pages in violation of Google's Terms of Service. Target Physician Types: Include physicians or practices that appear to offer one or more of the following: - Concierge medicine - Membership medicine - Retainer-based primary care - Executive health - Personalized preventive medicine - Direct primary care - Integrative functional medicine - Longevity Data Fields to Collect: For each physician, collect as many of the following fields as possible: - Physician name - Credentials (ie MD, DO, etc) - Specialty (Family medicine, Internal Medicine, etc) - Practice name - Practice website - Practice Address Street - Practice Address City - Practice Address State - Practice Address ZIP code - Phone number - Email address, if publicly available - Concierge network affiliation, if any - NPI number - Practice type/category - Source URL(s) - Evidence phrases found on the website, such as "concierge," "membership," "annual fee," "limited patient panel," or "same-day access" - Confidence score that the physician is concierge/membership-based - Notes/comments Desired Scoring Logic: Each physician/practice should be assigned a confidence score based on visible evidence, such as: - Explicit use of "concierge medicine" or "membership medicine" - Affiliation with known concierge networks - Annual or monthly membership fee - Limited patient panel -Executive health or comprehensive wellness positioning - Preventive, longevity, or lifestyle medicine focus Deliverables A clean spreadsheet in CSV or Excel format. A brief data dictionary explaining each field. Source URLs for verification. A short summary of the methodology used. The scripts/code used to collect and score the data (preferred). Documentation so the process can be rerun and updated periodically. Clear notes on any limitations, incomplete fields, or uncertain records. Preferred Technical Skills - Python - BeautifulSoup, Scrapy, Playwright, or similar web crawling frameworks pandas - Search APIs (Google, Bing, DuckDuckGo-compatible providers, SerpAPI, DataForSEO, Bright Data, Oxylabs, etc.) - Experience building large lead-generation databases and deduplicating records Important Requirements: - Use compliant search APIs or licensed search providers rather than scraping Google search result pages directly. - Respect website terms of service and robots.txt where applicable. - Deduplicate physicians and practices. - Include source URLs for every record. - Prioritize data quality over volume. Initial Milestone: As a first milestone, we'd like a pilot dataset of 300 concierge physicians in Florida, (excluding MDVIP and SignatureMD physicians) including confidence scores and source URLs. If the quality is strong, the project will expand to a nationwide database of concierge physicians. See attached file Sales Pipeline Master Sheet as an example output. Note we've already completed the spreadsheet for concierge physicians in MDVIP adn SignatureMD in FL (and other states). See Concierge ecosystem for a starting list of other concierge networks and if they have physician locators on their websites that might be scraped.

  • $250.00

    Fixed-price
  • Intermediate
    Experience Level
  • Remote Job
  • Ongoing project
    Project Type
Skills and Expertise
Mandatory skills
Database Design
Database Programming
Nice-to-have skills
MySQL
Claris FileMaker
Activity on this job
  • Proposals:20 to 50
  • Last viewed by client:4 days ago
  • Interviewing:
    18
  • Invites sent:
    31
  • Unanswered invites:
    10
About the client
Member since Feb 13, 2019
  • United States
    Santa Fe6:08 PM
  • $150K total spent
    89 hires, 4 active
  • 2,418 hours
  • Mid-sized company (10-99 people)

Explore similar jobs on Upwork

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo