๐ AI Football Event Spotting Engine โ Milestone-Based Contract (3 Months)
Worldwide
๐ AI Football Event Spotting Engine โ Milestone-Based Contract (3 Months) โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ ๐ PROJECT OVERVIEW โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ We are building Roots & Rise IQ โ an AI-powered Football Event Spotting Engine that detects, classifies, and timestamps 15 discrete football action events from single-camera grassroots match footage in near real-time. The camera setup is a single pitch-side PTZ (pan-tilt-zoom) unit similar to Veo โ it continuously tracks the ball, producing a stabilized cropped tracking view. The system must process a 30-second clip and return a structured JSON event stream within 25 seconds. This is NOT a research project. We need a working, deployable system with measurable performance scores at each milestone. Every milestone is independently evaluated using a VisionScore metric on 100 held-out private test clips. โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โฝ WHAT THE SYSTEM MUST DO โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ Detect and timestamp 15 action event classes from video: โข pass, pass_received, recovery, tackle, interception โข ball_out_of_play, clearance, take_on, substitution โข block, aerial_duel, shot, save, foul, goal Output format (JSON array, sorted by timestamp): [ "action": "shot", "timestamp": 14.40, "team": "home", "action": "save", "timestamp": 14.84, "team": "away" ] Hard constraints: โข Processing latency: โค 25 seconds per 30-second clip (non-negotiable) โข Inference must be incremental โ not buffered until clip end โข Output must pass schema validation with zero violations โข Deployed on NVIDIA T4 GPU (16GB VRAM) โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ ๐ง REQUIRED SKILLS & EXPERIENCE โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ This role requires genuine hands-on experience โ not theoretical knowledge. You must have: โ MANDATORY โข PyTorch 2.x โ model training, custom loss functions, TorchScript export โข Video processing โ PyAV / OpenCV / FFmpeg-based frame pipelines โข Object detection โ YOLOv8 fine-tuning (player, ball, pose keypoints) โข Temporal action spotting โ SoccerNet, E2E-Spot, T-DEED, or Dense Anchor familiarity โข Multi-object tracking โ BoT-SORT, ByteTrack, or DeepSORT with camera motion compensation โข ONNX Runtime + TensorRT FP16 deployment pipeline โข FastAPI async inference server โข Docker containerization with NVIDIA CUDA runtime โข GPU cloud (AWS EC2/GCP) โ provisioning, CUDA setup, training runs โญ STRONGLY PREFERRED โข Direct experience with SoccerNet-v2 or Ball Action Spotting datasets โข Optical flow + homography-based camera motion compensation โข Pose estimation (YOLOv8-Pose or ViTPose) for contact event disambiguation โข DBSCAN jersey-color clustering for team attribution โข Focal loss, balanced mixup, and curriculum training for class imbalance โข T-DEED, ASTRA, or E2E-Spot architecture implementation โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ ๐ MILESTONE PLAN โ 3 MONTHS, 3 DELIVERIES โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ This is a strict milestone-based contract. Payment is released only upon verified milestone completion. โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ MILESTONE 1 โ Deadline: June 28, 2026 Target: VisionScore โฅ 0.40 / 1.0 โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ Deliverables: โข Full pipeline: ingestion โ camera compensation โ detection โ tracking โ spotting โ JSON output โข 8 core event classes active: pass, pass_received, recovery, ball_out_of_play, clearance, tackle, shot, aerial_duel โข Architecture: ResNet-50 / RegNet-Y + Dense Detection Anchor head with dual classification + timestamp displacement regression โข Soft-NMS post-processing, DBSCAN team attribution โข Latency: โค 25s per clip on T4 GPU โข Zero JSON schema violations โข Working FastAPI endpoint โ submit clip URL โ receive JSON event stream โข VisionScore โฅ 0.40 on 100-clip Cluster A (independent evaluation) โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ MILESTONE 2 โ Deadline: July 28, 2026 Target: VisionScore โฅ 0.50 / 1.0 โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ Deliverables: โข Upgrade to T-DEED end-to-end architecture (EfficientNetV2 + GSF + temporal encoder-decoder) โข All 15 action classes active with valid predictions โข YOLOv8-Pose integration โ 17-point skeleton for foul/tackle and block/save disambiguation โข 5-pair disambiguation heads deployed (tackle/foul, block/save, shot/clearance, interception/recovery, pass/clearance) โข Goalkeeper identity module (position prior + kit colour classifier) โข Balanced mixup training (ASTRA-style) for class imbalance โข Mean absolute timestamp error below 0.8s for goal, foul, save, shot โข Foul precision above 0.68 (false foul penalty = 7.7 weight units โ cannot afford FPs) โข VisionScore โฅ 0.50 on 100-clip Cluster B (independent evaluation) โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ MILESTONE 3 โ Deadline: August 28, 2026 Target: VisionScore โฅ 0.60 / 1.0 โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ Deliverables: โข Two-model ensemble: T-DEED + ASTRA Transformer encoder-decoder with late fusion โข TensorRT FP16 export and optimization โ latency โค 20s per clip on T4 โข Curriculum training on full proprietary Roots & Rise dataset (expected mid-July) โข Goal recall above 0.80, foul + save combined precision above 0.68 โข Docker production container (nvcr.io/nvidia/pytorch base) โ runs with single command on NVIDIA GPU server โข Complete API documentation (endpoint spec, request/response schema, code examples) โข Model card โ capabilities, known limitations, class-level metrics โข VisionScore โฅ 0.60 on 100-clip Cluster C (independent evaluation) โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ ๏ธ CONTRACT RESTRICTIONS & CANCELLATION POLICY โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ This project has firm, non-negotiable delivery conditions. Please read carefully before applying. 1. MILESTONE DEADLINE IS FINAL Each milestone has a fixed evaluation submission date (June 28 / July 28 / August 28). Late submissions will NOT be accepted. The evaluation clusters are released only upon submission โ there is no extension mechanism. 2. FAILED MILESTONE = CONTRACT TERMINATION If the official VisionScore falls below the milestone target (M1: 0.40, M2: 0.50, M3: 0.60) on the independent evaluation, the contract will be terminated at that milestone. Payment for that milestone will NOT be released. 3. LATENCY FAILURE = DISQUALIFICATION If end-to-end processing of a 30-second clip exceeds 25 seconds on T4 GPU, the submission is automatically disqualified โ regardless of accuracy. This is an evaluation system rule, not our policy. 4. SCHEMA VIOLATIONS = REJECTION Any output that fails JSON schema validation is treated as a non-submission. The output array must be sorted by timestamp, all field types must match exactly, and team values must be exactly "home" or "away". 5. PAYMENT TERMS โข Milestone 1 payment: released only after VisionScore โฅ 0.40 is confirmed โข Milestone 2 payment: released only after VisionScore โฅ 0.50 is confirmed โข Milestone 3 payment: released only after VisionScore โฅ 0.60 is confirmed AND Docker package is delivered โข No payment is made for partial milestone completion 6. WEEKLY PROGRESS UPDATES ARE MANDATORY You must share a written progress update every 7 days. Failure to communicate for more than 5 consecutive working days without prior notice will be treated as project abandonment and the contract will be closed. 7. CODE OWNERSHIP All code, models, weights, and documentation produced under this contract are the full intellectual property of Roots & Rise IQ. No proprietary dataset content may be retained, shared, or reused after contract completion. โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ ๐ฆ RESOURCES WE PROVIDE โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โข Public reference dataset: https://huggingface.co/datasets/amandeepds1/roots_and_riseIQ โข SoccerNet-v2 and Ball Action Spotting datasets (publicly available โ you source these) โข Proprietary Roots & Rise grassroots footage dataset โ expected mid-July 2026 (delivered for M2/M3 training) โข Full Technical Requirements Document (TRD) with exact architecture specifications, scoring formulas, and evaluation methodology โข Shared project board (Notion) for task tracking โข GPU cloud access: T4 GPU for evaluation (you provision your own training compute) โข Our team is available for daily async communication and weekly sync calls โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ ๐ฌ HOW TO APPLY โ REQUIRED IN YOUR PROPOSAL โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ Generic proposals will be ignored. To be considered, your proposal must include: 1. RELEVANT EXPERIENCE โ Describe a specific project where you built a video-based action detection or temporal spotting system. What dataset, architecture, and metric did you use? What was the result? 2. ARCHITECTURE CHOICE FOR M1 โ How would you approach the Dense Detection Anchor model for milestone 1? Which backbone would you use and why? (2โ3 sentences โ we are testing domain knowledge, not asking for a full plan.) 3. LATENCY STRATEGY โ How would you ensure the full pipeline (detection + tracking + spotting + JSON) completes within 25 seconds on a T4 GPU for a 30-second clip? What would you profile and optimize first? 4. DISAMBIGUATION APPROACH โ The tackle/foul pair has a combined error cost of 10.2 VisionScore units if misclassified. In one or two sentences, how would you reduce this risk? 5. YOUR BID โ Provide your total bid broken down by milestone. Bids without milestone breakdown will not be considered. 6. TIMELINE CONFIRMATION โ Confirm explicitly that you accept all three deadline dates: June 28, July 28, and August 28, 2026, and understand the cancellation policy. โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ ๐ฏ WHO THIS IS FOR โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ This project is ideal for: โข Computer vision engineers who have worked on sports analytics or temporal video understanding โข ML engineers with SoccerNet or action spotting research/competition experience โข AI engineers who are comfortable owning an end-to-end system โ training, optimization, and deployment This project is NOT for: โข Generalist ML engineers with no video understanding experience โข Freelancers who will subcontract the work without disclosure โข Anyone who cannot commit to all three milestone deadlines We are looking for one focused engineer or a small team (max 2) who can take full ownership and deliver. โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ ๐ SCORING REFERENCE (For Your Information) โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ VisionScore = (matched scores total minus false positive penalties total) divided by ground truth weights total High-stakes events: โข goal: weight 10.9, tolerance ยฑ3.0s โข foul: weight 7.7, tolerance ยฑ2.5s โข save: weight 7.3, tolerance ยฑ2.0s โข shot: weight 4.7, tolerance ยฑ2.0s A missed goal costs 10.9 units. A false foul prediction costs 7.7 units. The scoring system heavily penalizes high-weight class errors โ your model must be precision-calibrated, not just high-recall. Good luck โ we look forward to hearing from engineers who are serious about this challenge. ๐ฏ
$580.00
Fixed-price- ExpertExperience Level
- Remote Job
- Ongoing projectProject Type
Skills and Expertise
Activity on this job
- Proposals:Less than 5
- Last viewed by client:2 weeks ago
- Interviewing:2
- Invites sent:0
- Unanswered invites:0
About the client
- IndiaGorakhpur7:43 AM
- $10 total spent3 hires, 3 active
- Tech & ITSmall company (2-9 people)
Explore similar jobs on Upwork
How it works
Create your free profileHighlight your skills and experience, show your portfolio, and set your ideal pay rate.
Work the way you wantApply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
Get paid securelyFrom contract to payment, we help you work safely and get paid securely.
About Upwork
- 4.9/5(Average rating of clients by professionals)
- G2 2021#1 freelance platform
- 49,000+Signed contract every week
- $2.3BFreelancers earned on Upwork in 2020
Find the best freelance jobs
Growing your career is as easy as creating a free profile and finding work like this that fits your skills.
Trusted by