We are looking for an experienced web developer for a scraping task. We need a script that can be run automatically in order to scrape the content of a webpage. The URLs require login on the platform that the script can pass with given credentials.
The content consists of several text paragraphs and images whose sources are loaded via an external JS (widget). The data output (xml file) should represent the content and structure so certain parts of the document can be implemented into any other cms without great effort. Additionally it would be nice to have different output formats available like pdf, doc, csv or xml. Please contact us for detailed description and your pricing options.
References: Escenic Content Syndication,