We would like to search for a contractor to build data extraction programs to allow us to extract word and number data from PDF files with similar format, outputted into excel.
1. Develop working PDF extraction programs that outputs data in excel in accordance to the template provided
2. The extraction program should be able to be applicable to extract PDFs in bulk as many as needed(such as 20,000 PDF files)
3. The extraction should be able to locate the needed data within the PDF document (they are not on the same page), and to extract accordingly
4. The program is user friendly and job ends with user capable of using the program without assistance
The needed fields are highlighted in the file “Extraction Example Highlighted” within the link below
The data that needs extraction are on: Page 1, Schedule A, and Schedule B
More examples are within the link
PDF examples links: https://www.dropbox.com/sh/bfs01n2zcpk8p7n/AADWo-jqU_FS0UcpZotoL_cra?dl=0
The output example extraction result template(excel attached) using the “Extraction Example Highlighted” PDF within the above link.
1. Significant automatic data extraction program development experience in word and number data extraction from PDF and outputting to excel
2. Bachelor’s degree in Computer Science, IT, data science, statistics, or related fields
In your application to this job, please include at least 3 examples of PDF data extraction and the output data results