You will get a custom OCR solution for document, receipt, or invoice data extraction


Project details
Have documents, receipts, or invoices that need data extracted automatically? I build custom OCR solutions that read text from images, PDFs, and scanned documents with high accuracy.
What I can build:
- Invoice and receipt data extraction (amount, date, vendor, line items)
- ID card and passport text recognition
- Handwritten text recognition and digitization
- Number plate and license plate reading
- Custom document parsing with structured output (JSON, CSV, Excel)
My Tech Stack:
- OCR Engines: Tesseract, EasyOCR, PaddleOCR, Google Vision API, AWS Textract
- AI/ML: OpenCV, PyTorch, TensorFlow, custom trained models
- Output: JSON, CSV, Excel, database integration
- Deployment: Desktop app, web app, or API
- Languages: Python, C++
Message me with a sample document and I will tell you what approach works best.
What I can build:
- Invoice and receipt data extraction (amount, date, vendor, line items)
- ID card and passport text recognition
- Handwritten text recognition and digitization
- Number plate and license plate reading
- Custom document parsing with structured output (JSON, CSV, Excel)
My Tech Stack:
- OCR Engines: Tesseract, EasyOCR, PaddleOCR, Google Vision API, AWS Textract
- AI/ML: OpenCV, PyTorch, TensorFlow, custom trained models
- Output: JSON, CSV, Excel, database integration
- Deployment: Desktop app, web app, or API
- Languages: Python, C++
Message me with a sample document and I will tell you what approach works best.
Programming Languages
PHP, Python, TypeScriptCoding Expertise
Cross Browser & Device Compatibility, Performance Optimization, DesignWhat's included
| Service Tiers |
Starter
$300
|
Standard
$1,600
|
Advanced
$4,000
|
|---|---|---|---|
| Delivery Time | 3 days | 10 days | 30 days |
Number of Revisions | 1 | 1 | 1 |
Design Customization | - | ||
Content Upload | - | ||
Responsive Design | - | ||
Source Code |
Frequently asked questions
9 reviews
(8)
(1)
(0)
(0)
(0)
This project doesn't have any reviews.
IN
Ihor N.
Nov 2, 2025
Consultant Needed for Roboflow and YOLO Object Detection Training
RP
RJ P.
Apr 11, 2025
Machine learning tool for material identification
The project went smoothly and was delivered ahead of schedule.
YB
Yusuf B.
Nov 29, 2024
3D Reconstruction Pipeline
Great communication, skills stuck to the deadlines and offered great ideas to improve the project.
AL
Alex L.
Jul 8, 2024
Facial recognition to a web application that uses the camera and recognizes with AI skin problems.
Faruk is a great person, willing to help and solve the needs of what is needed, thanks for everything.
NR
Nils R.
Jul 5, 2024
AI/ML Prototype Developer for UKEY App
Faruk was a pleasure to work with, very skilled at what he does and Impeccable communication. I would be happy to work with him again
About Md Faruk
Computer Vision, Machine Learning, Jetson, C++ | AI Agent, Claude Code
83%
Job Success
Rangamati, Bangladesh - 1:37 pm local time
✅ What I Build
▸ Computer Vision: object detection, tracking, segmentation, pose estimation, image classification, counting, OCR, anomaly detection
▸ Edge AI: NVIDIA Jetson (Nano, Orin, Xavier), Raspberry Pi, model optimization with TensorRT, ONNX, and TFLite for real-time inference
▸ Image Generation: Stable Diffusion (SDXL, SD 3.5), Flux, DALL·E 3, ControlNet, LoRA fine-tuning, ComfyUI workflows
▸ Video Generation: Kling, Runway Gen-3, Minimax Hailuo, Veo, Pika, automated cinematic and product video pipelines
▸ AI Agents: autonomous multi-step agents using LangChain, LangGraph, CrewAI, and AutoGen with RAG, memory, and tool use
▸ Voice AI: conversational voice agents using Vapi, Retell AI, ElevenLabs, Bland AI, and Deepgram for inbound/outbound calling and automation
▸ MCP Servers: custom Model Context Protocol servers connecting Claude and other agents to your APIs, databases, and internal tools
▸ Claude Code: agentic software engineering with subagent architectures, skill-based pipelines, and autonomous multi-step coding workflows
✅ Tech Stack
▪ Computer Vision: OpenCV, YOLO, MediaPipe, Detectron2, SAM, Vision Transformers
▪ Deep Learning: PyTorch, TensorFlow, Keras, ONNX
▪ Tracking & Optimization: DeepSORT, ByteTrack, TensorRT, OpenVINO
▪ Deployment: DeepStream, Triton Inference Server, TFLite, FastAPI, Flask, Docker
▪ Generative AI: Stable Diffusion, Flux, ComfyUI, Replicate, Fal.ai
▪ Agents & Voice: LangChain, LangGraph, CrewAI, Vapi, Retell AI, ElevenLabs, n8n
▪ MCP & Agentic: Claude Code, MCP SDK, custom tool servers
▪ Cloud: AWS, GCP, Azure
▪ Languages: Python, C++, CUDA
🚀 Send me a message with what you are trying to build, and let's discuss.
Steps for completing your project
After purchasing the project, send requirements so Md Faruk can start the project.
Delivery time starts when Md Faruk receives requirements from you.
Md Faruk works on your project following the steps below.
Revisions may occur after the delivery date.
Build OCR Pipeline
Set up preprocessing, text extraction, and field parsing.
Train or Fine-tune
Train custom model if standard OCR is not accurate enough.