I have a data set of about 15 000 annual reports in PDF formats. I Have get about 1200 new reports/week. I need to do three things with this data set. This may be solved by using existing open source software, licensed soft ware or build something new. 1. Build a process tool to convert PDF->Word+Excel. Find a way to automise this. 2. Build a tool to search through PDF and find all annual reports that contain a specific word or phrase 3. Find and extract KPI:s from income statement and balance sheet and extract to a database.