Python Data Engineer Needed for Sports Performance Analytics (Web Scraping & PostgreSQL)
Worldwide
I am the founder of an elite sports performance and coaching brand. I am looking for a skilled Backend Developer / Data Engineer to build an automated data pipeline and PDF generation engine. We are building a tool that scrapes public race results, processes the raw data through our proprietary mathematical logic (calculating percentiles, variance, and specific performance penalties), stores the data in a database, and automatically generates a custom, highly visual PDF report for the athlete. Note: You do not need to build a complex front-end web application or mobile app. This is purely a backend data pipeline, database architecture, and automation project. Core Technical Scope: Mass Web Scraping: Build a Python scraper (using Selenium, Playwright, or similar) to extract tabular race data from a specific sports database. You must have experience bypassing rate limits and using rotating proxy networks (e.g., BrightData, Smartproxy) to safely scrape thousands of rows weekly without getting IP-banned. There is an ongoing requirement to scrape new data on a weekly basis and store in the bespoke database. Database Architecture: Set up and structure a cloud PostgreSQL database (e.g., Supabase or AWS RDS) to store the scraped data cleanly so I can run my own SQL queries directly. Logic Engine: Translate proprietary mathematical formulas and conditional logic gates into Python to evaluate the athlete's data. Automated PDF Generation: Use a data visualization library (matplotlib/seaborn) or an HTML-to-PDF webhook to generate a clean and premium report containing chronological line charts and clustered bar charts based on the athlete's data. Integration: Link the backend to a front-end intake form (Typeform) and email delivery system. The Ideal Candidate: Deep expertise in Python, web scraping, and database management. A strong grasp of mathematics and statistics. Bonus: A personal interest in sports, fitness, or "Moneyball" style data analytics. If you understand athletic performance data, you will grasp the vision of this project much faster. Next Steps & How to Apply: To protect proprietary coaching logic, the full scope and exact mathematical blueprints will be shared during a discovery call. If you are interested, please reply with: A brief overview of your experience with web scraping (specifically handling anti-bot protections/proxies) and PDF generation. The word "COACH" at the top of your proposal so I know you read this entire description. From there, we will schedule a brief video call to run through the blueprint document. Once you understand the exact logic flow, you can provide an accurate, fixed-price quote for the build.
- Not SureHourly
- < 1 monthDuration
- IntermediateExperience Level
- Remote Job
- One-time projectProject Type
Skills and Expertise
Activity on this job
- Proposals:20 to 50
- Last viewed by client:3 days ago
- Interviewing:0
- Invites sent:0
- Unanswered invites:0
About the client
- United Kingdom2:50 AM
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by