AI Developer (Claude/Gemini) for Automated PDF Extraction from HVAC Blueprints

Posted 4 weeks ago

Worldwide

Summary

Project Overview: I am looking for an AI automation and data extraction expert to build a reliable pipeline that extracts complex sheet metal HVAC ductwork sequences and specifications from architectural blueprints. The goal is to automatically process these blueprints and output the data into a clean, structured master duct-schedule in Google Sheets/Excel so I can clearly understand the system layout and fabrication requirements. The Problem: I currently use Claude and Gemini for this, but I consistently hit roadblocks. The files are large and visually complex. The AI frequently "forgets" the file, drops context, or refuses to process the PDF due to limitations in the standard chat interface. I need a robust API or scripted solution to bypass these web-interface limits. File Formats: We are primarily dealing with large PDF blueprints. However, I can often provide the original AutoCAD (.dwg/.dxf) files. If you know how to extract the raw metadata/layers from AutoCAD before passing it to the LLM, that is a massive bonus. What You Will Do: Build a script/pipeline (likely Python) to parse complex HVAC blueprints (PDF or AutoCAD). Integrate the Claude or Gemini API to accurately interpret the extracted data (identifying duct sizes, lengths, tags, and sequences). Engineer the perfect prompt/system instructions to handle construction terminology without hallucinating. Format the final output so it easily ports into a structured spreadsheet. Ideal Candidate: Expertise in Python, API integration (Anthropic/Google), and handling large context windows. Strong experience with OCR (for PDFs) and programmatic CAD extraction (for .dwg files). Experience dealing with complex tables, schematics, or blueprints. Clear communicator who can build this so it is easy for a non-programmer to run on their own machine. Budget Structure: I am setting a fixed price for the initial successful setup, testing, and delivery of the working script. Once the pipeline is built, I would like to transition to an hourly contract for ongoing support, edge-case troubleshooting, and fine-tuning as I process different types of blueprints.

  • $400.00

    Fixed-price
  • Expert
    Experience Level
  • Remote Job
  • Ongoing project
    Project Type
Skills and Expertise
Mandatory skills
Python
API Integration
Activity on this job
  • Proposals:50+
  • Last viewed by client:4 weeks ago
  • Interviewing:
    0
  • Invites sent:
    0
  • Unanswered invites:
    0
About the client
Member since Nov 3, 2012
  • United States
    Orangeburg4:29 AM
  • $60K total spent
    87 hires, 6 active
  • 8,244 hours
  • Art & Design
    Small company (2-9 people)

Explore similar jobs on Upwork

Local Lead GenerationHourly‐ Posted 2 weeks ago
Web Scraping
Data Scraping
Data Extraction
Lead Generation
Data Entry
Data Mining
Data Collection
Data Processing
Web Scraping Framework
Web Crawler Framework
Web Scraping Software
Web Scraping Plugin
Web API
Search Tool
Search Engine
Microsoft Word
Data Entry
Administrative Support
Microsoft Excel

How it works

  • Post a job icon
    Create your free profile
    Highlight your skills and experience, show your portfolio, and set your ideal pay rate.
  • Talent comes to you icon
    Work the way you want
    Apply for jobs, create easy-to-by projects, or access exclusive opportunities that come to you.
  • Payment simplified icon
    Get paid securely
    From contract to payment, we help you work safely and get paid securely.
Want to get started? Create a profile

About Upwork

  • Rating is 4.9 out of 5.
    4.9/5
    (Average rating of clients by professionals)
  • G2 2021
    #1 freelance platform
  • 49,000+
    Signed contract every week
  • $2.3B
    Freelancers earned on Upwork in 2020

Find the best freelance jobs

Growing your career is as easy as creating a free profile and finding work like this that fits your skills.

Trusted by

  • Microsoft Logo
  • Airbnb Logo
  • Bissell Logo
  • GoDaddy Logo