I have installed alfresco community 5.0.d on ubuntu 14.04 desktop and am looking to integrate it with an OCr like tesseract for scanned pdf images to add a text layer and make it searchable .currently my scanned images go to alfresco as pdf files but alfresco
does not differenciate between scaned and standard pdf files.so will need ocr
tesseract must also processes my uploaded pdf scan files, and any image files as well.
i will need a script and an xml bean file.
you can either do it for me remotely or send me the files with instructions to install .
This is my start with alfresco and am looking for a good reliable freelancer for jobs on long term basis.