Ocr Tesseract Jobs

11 were found based on your criteria {{ paging.total|number:0 }} were found based on your criteria

show all
  • Hourly ({{ jobTypeController.getFacetCount("0")|number:0}})
  • Fixed Price ({{ jobTypeController.getFacetCount("1")|number:0}})
Fixed-Price - Expert ($$$) - Est. Budget: $1,000 - Posted
I need a website from where people can easily convert their images, files from each format to other. Like image to pdf, pdf to docs, pdf to html, pdf to xsls, pdf to txt file vice versa I like this website http://onlineocr.net and I want my website to be like them! Please apply if you have confident and have previous experience making this type of website. Regards, Waris
Skills: OCR Tesseract C# C++ OCR algorithms
Fixed-Price - Expert ($$$) - Est. Budget: $4,000 - Posted
We are looking for a developer who has experience with OCR software such as tesseract and data mining from the OCRed data and also data mining from PDFs that contain extractable text. We need to extract structured data from invoices that come both scanned and text based using an in-house solution. Invoices will be coming from many different suppliers and it will be ongoing project. Budget will be adjusted based on agreed work scope.
Skills: OCR Tesseract Data mining
Hourly - Intermediate ($$) - Est. Time: Less than 1 month, 10-30 hrs/week - Posted
I need OCR solution to convert the domain names provided in the links as images. https://www.afnic.fr/en/products-and-services/services/daily-list-of-registered-domain-names/ please let me know 1. how accurate the results can be, I need close to 100% accuracy. 2. what language stack you are going to use for this.
Skills: OCR Tesseract OCR algorithms
Hourly - Intermediate ($$) - Est. Time: 3 to 6 months, Less than 10 hrs/week - Posted
Hello, I am looking for a solution that will analyze a jpeg image file, and extract the text data and dump it into a database in an unstructured form. The next step would be to then analyze that unstructured data and pull out the key pieces of information to then create a structured database with specific data. USE CASE: The idea is to take the daily image files put out by the county with information such as deeds, debts, and liens. Scan those images, extract all of the text within each document, and then be able to add certain bits of information to a database. For example, let's say that there is an image file that includes a notice of a mortgage for a property. This image would include certain bits of information such as the "folio number" which is the unique property identifier for the property with the county, the mortgage amount, and maybe the terms of the mortgage. I would like for the solution to be able to extract that data, and then dump that into a database table, so that I can then link it with another table about the property. I was thinking of using apache tika to extract the data, and then Pig to parse it. However, if you are an expert in this, maybe you have a better way.
Skills: OCR Tesseract Machine learning OCR algorithms OpenCV
Fixed Price Budget - Expert ($$$) - $3,500 to $10,000 - Posted
Want person with experience in OCR project having used any open source or proprietory engine. The extraction has to be done from images having tabular data. The format is not fixed and approach to be taken would be identifying label & value field set or in some cases columnar data
Skills: OCR Tesseract Machine learning OCR algorithms
Hourly - Intermediate ($$) - Est. Time: 3 to 6 months, Less than 10 hrs/week - Posted
I'm looking someone with experience in performing Optical Character Recognition (OCR) for scanned PDFs. I have many thousands of scanned PDFs that I need the text of to be used for an internal project. The scanned PDFs contain blocks of text and also tables that would require OCR. The nature of this project requires that the OCR be as close to 100% accurate as possible. The use of any technology is acceptable (tesseract, ABBYY, etc) as long as the OCR of the PDF files is as close to 100% as possible. I will provide the files in PDF format via Dropbox and the deliverable format should be in .txt format (no formatting other than line breaks required). Proficiency in english (written and spoken) is a must-have requirements for this job and be able to communicate status updates and issues. There is a short term need to digitize 2,000 files and potential for follow on work up to 500 files a month there after.
Skills: OCR Tesseract Adobe PDF Computer vision English Spelling