AI/ML Engineer Needed for OCR and Document Parsing Pipeline

Posted 2 weeks ago

Worldwide

Summary

We are looking for an experienced AI/ML engineer to design and implement an OCR and document parsing pipeline that can reliably extract structured data from scanned documents and PDFs. The goal is to automate data extraction from semi-structured and unstructured documents (invoices, forms, reports, etc.) into a clean, machine-readable format that can feed our backend systems. Project Scope Build an end-to-end OCR and document parsing pipeline (from file upload to structured JSON output or database insertion). Handle a variety of document layouts (scanned PDFs, images, multi-page documents, rotated/cropped pages, noisy scans). Support at least 100 document types initially (e.g., invoices, receipts, ID documents, medical reports), with a design that makes adding new templates straightforward. Responsibilities - Analyze our sample documents and define the optimal OCR and parsing strategy (rules-based, ML-based, or hybrid). - Implement OCR using tools such as Tesseract, EasyOCR, Google Vision, AWS Textract, or similar. - Design and train ML models for layout analysis and field extraction (e.g., key-value pairs, tables, named entities) where needed. - Build document preprocessing steps (deskewing, denoising, contrast/thresholding, language selection). - Implement robust parsing logic to map extracted text to structured fields (JSON schema or database model). - Set up evaluation metrics (precision/recall/accuracy per field) and iterate to reach agreed quality thresholds. - Package the solution as an API or microservice that our backend can call (REST/GraphQL, etc.). Required Skills and Experience - Strong experience with Python for AI/ML and backend scripting. - Hands-on experience with OCR and document AI (e.g., Tesseract, EasyOCR, PaddleOCR, Google Vision, AWS Textract, Azure Form Recognizer). - Solid understanding of computer vision and NLP techniques for document understanding (layout analysis, entity extraction, text normalization). - Experience with one or more ML/DL frameworks: PyTorch, TensorFlow, or similar. - Familiarity with data preprocessing, feature engineering, and model evaluation best practices. - Experience exposing models as APIs or integrating ML pipelines into production systems (Docker, FastAPI, Flask, etc.). - Strong communication skills and ability to document code and decisions clearly. Nice-to-Have Experience with document layout models (LayoutLM, Donut, TrOCR, or similar). Prior work on invoice/receipt parsing, ID document extraction, or healthcare document workflows. Knowledge of MLOps practices (model versioning, logging, monitoring, CI/CD for ML). Experience with major cloud platforms (AWS/GCP/Azure) and their document AI services. Deliverables - A working OCR and document parsing pipeline integrated with our backend (or exposed as a well-documented API). - Clean, well-structured code repository (Git) with clear setup instructions. - Configuration to support at least 100 initial document types, with documentation on how to add more. - Evaluation report (test set, metrics per document type, known limitations and edge cases). - Optional: Basic dashboard or logs to monitor parsing performance and errors over time. How to Apply Please start your proposal with the word "OCR-PIPELINE" so we know you read the post. Include the following: A brief summary of your experience with OCR and document parsing (1-2 paragraphs). Links or descriptions of similar projects you've completed (especially OCR / document AI work). Your preferred tech stack for this project (OCR engine, ML framework, API framework, cloud provider if any). A rough outline of your approach to building a robust OCR + parsing pipeline for mixed document types. Your estimated timeline and total cost or hourly rate Looking forward to your proposals.

  • $200.00

    Fixed-price
  • Expert
    Experience Level
  • Remote Job
  • One-time project
    Project Type

Contract-to-hire opportunity

This lets talent know that this job could become full time.
Learn more
Skills and Expertise
Mandatory skills
AI Development
Artificial Intelligence
Activity on this job
  • Proposals:20 to 50
  • Last viewed by client:last week
  • Hires:
    1
  • Interviewing:
    4
  • Invites sent:
    21
  • Unanswered invites:
    12
About the client
Member since Jul 3, 2025
  • Ukraine
    Bila Tserkva12:13 PM
  • $743 total spent
    22 hires, 1 active
  • 3 hours
  • Tech & IT
    Small company (2-9 people)

Explore similar jobs on Upwork

Build Agentic AI EngineHourly‐ Posted 9 months ago
Artificial Intelligence
turn 2D pose detection to 3dFixed-price‐ Posted 4 weeks ago
Computer Vision
3D Modeling
OpenCV

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo