You've landed at the right place. oDesk is now Upwork. Learn about the new platform.

Ocr Tesseract Jobs

4 were found based on your criteria

show all
show all
only
only
only
show all
only
only
only
only
only
show all
only
only
only
Fixed-Price - Est. Budget: $ 250 Posted
Hi, We have 2 application running at client side. We need to integrate either 'tesseract' or any better OCR open source application which can convert 2 documents (PAN CARD and VOTER ID card) into text. We would like to have it as real time thing. In case of some 5-10% error in processing team will handle. Need suggestion plus integration so the person or team has handle such issue in past can revert. We do have some code with us, which gives a better idea. Our web platform is in Python and DB we are using MongoDB. Best
Fixed-Price - Est. Budget: $ 200 Posted
We want a script that will run on a Linux server and read a single long PDF document containing documents similar to those in link below and return a JSON with extracted data. Data to extract: - Violation Number - License Plate - Date - Amount / Toll Due Sample PDFs: http://104.236.66.8/buggy/pdf_tickets.zip The uploaded documents will be the same in the link but will have different data for Violation Number, License Plate etc. Script should preferably be in PHP and use Tesseract OCR engine. Here is PHP class for Tesseract: https://github.com/thiagoalessio/tesseract-ocr-for-php If this doesn't work well then another language would be fine. You would be provided root access to a Ubuntu server and be responsible for setting everything up.
Fixed-Price - Est. Budget: $ 200 Posted
You have experience in training tesseract. I want to OCR Identity Cards which uses different fonts on each row. You will take sample Scanned Cards, isolate the different fonts used within them, and run enough training on samples of each that accuracy will be improved on future Cards, and return me the tessdata folder, along with whatever else I need to run tesseract. If it matters, this will be an Windows installation of tesseract 3.02 with C# Wrapper I can provide you with unlimited samples, you don't have to train on all pages of them, of course, but you do need to go through a sampling and identify and isolate the crucial fonts, and train on enough to improve accuracy. Important Notes: 1. the cards contains both Arabic and English text with numbers, I m interested in Arabic Fonts. 2. Fonts and Sample will be delivered once approval. 3. Arabic Characters has different forms based on the location of the work( start, between, end, isolated) I will help you to identify different...