Ocr Tesseract Jobs

12 were found based on your criteria {{ paging.total|number:0 }} were found based on your criteria

show all
  • Hourly ({{ jobTypeController.getFacetCount("0")|number:0}})
  • Fixed Price ({{ jobTypeController.getFacetCount("1")|number:0}})
Fixed-Price - Intermediate ($$) - Est. Budget: $6 - Posted
I am looking for someone with experience using OCR frameworks and libraries. AIM : EXTRACT TEXT ON PDF IMAGE SCANS AND SAVE THE DATA TO DATABASE WHAT TO BE DONE We want someone with the knowledge of object character recognition to implement a program that will get a batch of pdf files scan the extract text from the images and send the extracted data to a database with required fields. The scanned documents come in varying degrees of quality, however, we still require accurate text capture. The critical and first stage is POC and ability to extract data from the different ID images DO NOT APPLY IF YOU DO NOT HAVE EXPERIENCE WITH OCR.
Skills: OCR Tesseract OCR algorithms
Fixed-Price - Expert ($$$) - Est. Budget: $200 - Posted
Hi, pdftowordconvert.com is my website. Recently I hired a guy who made my site and just completed 70% of my site and disappeared. Need an experienced developer who has experience making file conversion website in the past. Please only apply if you can fix some conversion that are not working. I have attached a txt file where I gave details about my site's current condition. Waiting for the best developer! Thanks
Skills: OCR Tesseract CSS FileMaker HTML5
Fixed-Price - Intermediate ($$) - Est. Budget: $1,000 - Posted
hello, I need a program that can monitor a link and put the shoe in the cart then check out. There are lots of competition with others people software , I need a program that can solve/bypass the adidas captcha in less than .62 seconds automatically with also grabbing the shoe and checking out as quick as possible. I also need a server that is close to the adidas server so it will need quick ping time. I currently have programs that do this if you want an example however i need it on a much larger scale.
Skills: OCR Tesseract Bot Development C C#
Hourly - Expert ($$$) - Est. Time: 1 to 3 months, Less than 10 hrs/week - Posted
Looking for a proven professional in the field of OCR. The project includes developing / setting up existing OCR to support a text extraction application. Relevant applicants will be provided additional details.
Skills: OCR Tesseract OCR algorithms
Hourly - Expert ($$$) - Est. Time: Less than 1 week, Less than 10 hrs/week - Posted
We have two books. One book 1200 pages Rashi font, and a second book 1200 pages Niqqud-Hebrew. We have attached only one page of each as a 5MB sample. Each page needs header, margin, and footnotes electronically trimmed off. The margin (verse and chapter numbers) and these trimming also need to be saved on the corresponding page/file number. Some pages may need fine tesseract alignment, (deskewed). Select all 27 individual letters-shapes, plus 15 Niqqud points, and the cantillation marks and get these ligatures from raster to vector for each book. We hope to get 99.99% of the lines of text in good order and be put into a spread sheet line by line. We need to keep the Niqqud points from interfering with the good sequenced lines of letters. We expect the accuracy of the Niqqud points to be lower. Select for us a collection of the 3 best of each ligature that will provide 3 sets (a good/better/best) for us to adjust later to a keyboard or for reprinting. The best of three (or a combination of) is what must work in the tuned tesseract product. We expect the software pipeline complete with read me instructions in addition to the two spreadsheet outputs of the two tesseract operations sent to us. The books proper page number must correspond with the page number in the spreadsheet and we should be able to duplicate the OCR output with the open source software you will provide.
Skills: OCR Tesseract Data Entry Editing Internet research
Hourly - Expert ($$$) - Est. Time: Less than 1 month, 30+ hrs/week - Posted
I'm looking someone to create an OCR system by using opencv. This app must to recognize my card images. Cards exists on fuzzy images that captured via web camera. The candidate must be to becomes an expert on C++, OpenCV with more than 5 years. Some overaped cards exists on that images. If you can finish this task in time pls reply on me. Thanks Regards.
Skills: OCR Tesseract C# C++ Data Science
Fixed-Price - Expert ($$$) - Est. Budget: $1,000 - Posted
in this project we need to implement an OCR tool that can analyze an image of a standard doc (for example Identity card or salary bill ) extract specific values from this image (Identity number , net salary , name , address etc ) and compare it to info in our DB of the same user that uploaded these images that has been analyzed via OCR to find a match or alert on a miss match.
Skills: OCR Tesseract OCR algorithms
Hourly - Expert ($$$) - Est. Time: Less than 1 week, Less than 10 hrs/week - Posted
This is proof of concept project. Candidates must be experts in Tesseract OCR. If we can extract the test data reliably, then we plan to start work on a large project. Ultimately we want to hire developer full or part time to be our lead on the OCR project. Will be setting up VPS - Need you to install LINUX of OS you recommend. - Need you to install Tesseract. - Need consultant to install any other tools needed for this software to work. - Need consultant to install recommended database. Probably MySQL. - Will give a sample set of 3 documents. See attached examples. This test program will: - read these three files from a folder. - extract approximately 20 data fields from the three test files. - write the data to a database. - create a tabular output with fields and the field contents.
Skills: OCR Tesseract
Fixed-Price - Intermediate ($$) - Est. Budget: $1,000 - Posted
I am looking for an experienced developer to build a customised OCR system for internal use. About the System We receive original PDF documents(not scanned) from about 50 different providers via email. There are approximately 16 pieces of information we require from the PDF document, however the location of the 16 pieces of information are in varies from provider to provider. Each provider has a unique ID, this may assist with the identifying of data locations. There will be validation rules, e.g. Field 1 + Field 2 must equal field 3. The validation rules will be provided. There must be a user interface that enables our internal users to check the data that has been OCR’d, if exceptions (missing data or formula does not equal) are present, a user must be manually able to enter data by viewing the PDF. There is a specific export process that must be adhered to. The original PDF must be renamed (post OCR) to a specific naming protocol. The data that has been exported, must also be exported in a specific csv format, specs for each will be provided. Key Features of the system 1. Must be automated and able to detach a PDF document from an email and the OCR the image. 2. User interface must allow a user to a. View the data that has been OCR’s per PDF, ability to overwrite the data if needed. b. Manage exceptions c. Export data both manually and automatically (at a set time) 3. Ability to easily add another PDF source. 4. Source code will be owned by us, the system must be documented, and handed over to our own internal developers.
Skills: OCR Tesseract .NET Framework OCR algorithms