You will get a custom OCR solution for document, receipt, or invoice data extraction

Name: You will get a custom OCR solution for document, receipt, or invoice data extraction
Availability: InStock

Md Faruk A. Md Faruk A.

4.9

Md Faruk A. Md Faruk A.

4.9

Project details

Have documents, receipts, or invoices that need data extracted automatically? I build custom OCR solutions that read text from images, PDFs, and scanned documents with high accuracy.

What I can build:
- Invoice and receipt data extraction (amount, date, vendor, line items)
- ID card and passport text recognition
- Handwritten text recognition and digitization
- Number plate and license plate reading
- Custom document parsing with structured output (JSON, CSV, Excel)

My Tech Stack:
- OCR Engines: Tesseract, EasyOCR, PaddleOCR, Google Vision API, AWS Textract
- AI/ML: OpenCV, PyTorch, TensorFlow, custom trained models
- Output: JSON, CSV, Excel, database integration
- Deployment: Desktop app, web app, or API
- Languages: Python, C++

Message me with a sample document and I will tell you what approach works best.

Programming Languages

PHP, Python, TypeScript

Coding Expertise

Cross Browser & Device Compatibility, Performance Optimization, Design

What's included

Service Tiers	Starter $300	Standard $1,600	Advanced $4,000
Delivery Time	3 days	10 days	30 days
Number of Revisions	1	1	1
Design Customization	-
Content Upload	-
Responsive Design	-
Source Code

Frequently asked questions

4.9

9 reviews

89% Complete

(8)

11% Complete

(1)

1% Complete

(0)

1% Complete

(0)

1% Complete

(0)

Consultant Needed for Roboflow and YOLO Object Detection Training

Machine learning tool for material identification The project went smoothly and was delivered ahead of schedule.

3D Reconstruction Pipeline Great communication, skills stuck to the deadlines and offered great ideas to improve the project.

Facial recognition to a web application that uses the camera and recognizes with AI skin problems. Faruk is a great person, willing to help and solve the needs of what is needed, thanks for everything.

AI/ML Prototype Developer for UKEY App Faruk was a pleasure to work with, very skilled at what he does and Impeccable communication. I would be happy to work with him again

About Md Faruk

View profile

View portfolio

Computer Vision, Machine Learning, Jetson, C++ | AI Agent, Claude Code

83% Job Success

4.9 (9 reviews)

Rangamati, Bangladesh - 1:37 pm local time

I'm a Senior Computer Vision Engineer with 7+ years of professional experience delivering enterprise-grade Computer Vision, Edge AI, Deep Learning & Machine Learning Solutions. I have a solid foundation in state-of-the-art deep learning models and machine learning algorithms, applying them to build, deploy, and optimize production-ready systems across edge devices, cloud platforms, and agentic AI pipelines.

✅ What I Build
▸ Computer Vision: object detection, tracking, segmentation, pose estimation, image classification, counting, OCR, anomaly detection
▸ Edge AI: NVIDIA Jetson (Nano, Orin, Xavier), Raspberry Pi, model optimization with TensorRT, ONNX, and TFLite for real-time inference
▸ Image Generation: Stable Diffusion (SDXL, SD 3.5), Flux, DALL·E 3, ControlNet, LoRA fine-tuning, ComfyUI workflows
▸ Video Generation: Kling, Runway Gen-3, Minimax Hailuo, Veo, Pika, automated cinematic and product video pipelines
▸ AI Agents: autonomous multi-step agents using LangChain, LangGraph, CrewAI, and AutoGen with RAG, memory, and tool use
▸ Voice AI: conversational voice agents using Vapi, Retell AI, ElevenLabs, Bland AI, and Deepgram for inbound/outbound calling and automation
▸ MCP Servers: custom Model Context Protocol servers connecting Claude and other agents to your APIs, databases, and internal tools
▸ Claude Code: agentic software engineering with subagent architectures, skill-based pipelines, and autonomous multi-step coding workflows

✅ Tech Stack
▪ Computer Vision: OpenCV, YOLO, MediaPipe, Detectron2, SAM, Vision Transformers
▪ Deep Learning: PyTorch, TensorFlow, Keras, ONNX
▪ Tracking & Optimization: DeepSORT, ByteTrack, TensorRT, OpenVINO
▪ Deployment: DeepStream, Triton Inference Server, TFLite, FastAPI, Flask, Docker
▪ Generative AI: Stable Diffusion, Flux, ComfyUI, Replicate, Fal.ai
▪ Agents & Voice: LangChain, LangGraph, CrewAI, Vapi, Retell AI, ElevenLabs, n8n
▪ MCP & Agentic: Claude Code, MCP SDK, custom tool servers
▪ Cloud: AWS, GCP, Azure
▪ Languages: Python, C++, CUDA

🚀 Send me a message with what you are trying to build, and let's discuss.

Steps for completing your project

After purchasing the project, send requirements so Md Faruk can start the project.

Delivery time starts when Md Faruk receives requirements from you.

Md Faruk works on your project following the steps below.

Revisions may occur after the delivery date.

Build OCR Pipeline

Set up preprocessing, text extraction, and field parsing.

Train or Fine-tune

Train custom model if standard OCR is not accurate enough.

Review the work, release payment, and leave feedback to Md Faruk.

Select service tier

Starter$300

Standard$1,600

Advanced$4,000

Basic OCR Setup

Basic OCR setup for single document type with text extraction to JSON or CSV

Delivery Time 3 days
Number of Revisions 1
- Source Code

3 days delivery — Jun 25, 2026

Revisions may occur after this date.

Upwork Payment Protection

Fund the project upfront. Md Faruk gets paid once you are satisfied with the work.

You will get a custom OCR solution for document, receipt, or invoice data extraction

Let a pro handle the details

Let a pro handle the details

Project details

Programming Languages

Coding Expertise

What's included

Frequently asked questions

IN

RP

YB

AL

NR

About Md Faruk

Computer Vision, Machine Learning, Jetson, C++ | AI Agent, Claude Code

Steps for completing your project

After purchasing the project, send requirements so Md Faruk can start the project.

Md Faruk works on your project following the steps below.

Build OCR Pipeline

Train or Fine-tune

Review the work, release payment, and leave feedback to Md Faruk.

Select service tier

Basic OCR Setup