We need to test conversion of PDF to HTML.
We will provide:
1. Source PDF
2. Zipped file output HTML.
Compare HTML with PDF to see if the content is extracted accurately. Items to look for:
1. Is all content extracted on all pages
2. Does the format of the HTML match the source PDF
Your output will be in the form of an excel spreadsheet that has the following:
a. File name
b. Error description
c. Page number for error.
The initial test will be for 100 documents.