I teach an introductory course in Decision Support using Prolog, and I have assigned my students a project on tokenization using Prolog. I would like to see a model solution from a developer on the problem, which will allow to establish a baseline for the solution.
Key concepts to use are: (1) tokenization (2) stopword removal (3) stemming (4) inverted index (5) page rank (6) entity identification (7) tf-idf.
Since this is an introductory course, your solution should not include advanced Prolog concepts. Key prolog concepts that we taught are: (1) lists (2) relationships (3) predicates (4) unification.
Please apply only if you know Prolog well. This program should be easy for someone who know Prolog, and I have assigned a budget which I believe will be fair. It will not be fair to either of us if you need to spend more time learning, and thus spending a lot of time on this project.
Successful completion of the project will potentially make you eligible for follow up projects.