Hourly - Expert ($$$) - Est. Time: 1 to 3 months, Less than 10 hrs/week
Looking for a proven professional in the field of OCR. The project includes developing / setting up existing OCR to support a text extraction application. Relevant applicants will be provided additional details.
Skills: OCR Tesseract OCR algorithms
Hourly - Expert ($$$) - Est. Time: Less than 1 week, Less than 10 hrs/week
We have two books. One book 1200 pages Rashi font, and a second book 1200 pages Niqqud-Hebrew. We have attached only one page of each as a 5MB sample. Each page needs header, margin, and footnotes electronically trimmed off. The margin (verse and chapter numbers) and these trimming also need to be saved on the corresponding page/file number. Some pages may need fine tesseract alignment, (deskewed). Select all 27 individual letters-shapes, plus 15 Niqqud points, and the cantillation marks and get these ligatures from raster to vector for each book. We hope to get 99.99% of the lines of text in good order and be put into a spread sheet line by line. We need to keep the Niqqud points from interfering with the good sequenced lines of letters. We expect the accuracy of the Niqqud points to be lower. Select for us a collection of the 3 best of each ligature that will provide 3 sets (a good/better/best) for us to adjust later to a keyboard or for reprinting. The best of three (or a combination of) is what must work in the tuned tesseract product. We expect the software pipeline complete with read me instructions in addition to the two spreadsheet outputs of the two tesseract operations sent to us. The books proper page number must correspond with the page number in the spreadsheet and we should be able to duplicate the OCR output with the open source software you will provide.
Skills: OCR Tesseract Data Entry Editing Internet research
Hourly - Expert ($$$) - Est. Time: Less than 1 month, 30+ hrs/week
I'm looking someone to create an OCR system by using opencv. This app must to recognize my card images. Cards exists on fuzzy images that captured via web camera. The candidate must be to becomes an expert on C++, OpenCV with more than 5 years. Some overaped cards exists on that images. If you can finish this task in time pls reply on me. Thanks Regards.
Skills: OCR Tesseract C# C++ Data Science
Hourly - Expert ($$$) - Est. Time: Less than 1 week, Less than 10 hrs/week
This is proof of concept project. Candidates must be experts in Tesseract OCR. If we can extract the test data reliably, then we plan to start work on a large project. Ultimately we want to hire developer full or part time to be our lead on the OCR project. Will be setting up VPS - Need you to install LINUX of OS you recommend. - Need you to install Tesseract. - Need consultant to install any other tools needed for this software to work. - Need consultant to install recommended database. Probably MySQL. - Will give a sample set of 3 documents. See attached examples. This test program will: - read these three files from a folder. - extract approximately 20 data fields from the three test files. - write the data to a database. - create a tabular output with fields and the field contents.
Skills: OCR Tesseract