I am looking for someone with experience using OCR frameworks and libraries.
AIM : EXTRACT TEXT ON PDF IMAGE SCANS AND SAVE THE DATA TO DATABASE
WHAT TO BE DONE
We want someone with the knowledge of object character recognition to implement a program that will get a batch of pdf files scan the extract text from the images and send the extracted data to a database with required fields.
The scanned documents come in varying degrees of quality, however, we still require accurate text capture.
The critical and first stage is POC and ability to extract data from the different ID images
DO NOT APPLY IF YOU DO NOT HAVE EXPERIENCE WITH OCR.