We have web search engine project to complete.
The search engine will run on Fastcgi and in C language.
We need an organized programmer , skilled ,creative and really interested by a search engine p2p project. We value creativity,freedom,open-source and innovative.. Thinking outside of box.
We encourage you to use the available open source codes pertaining to reduce production time as lowest.
We need an efficient, optimal resource usage and fast engine. It will be hosted on freebsd server.
1- Server-side Web search engine platform run on fastcgi C connected to P2P network. so it will distribute their server index to p2p network.
2- Search ranking algorithm based on user-usage + relevance determined by algorithm,data mining and machine learning. Our crawler will determine subject of the indexed pages through artificial intelligence that will determine relevancy ranking.
3- Crawler robot that will index all the domains and their pages.. Date mining, algorithm, machine learning could be used to summarize the main content of page and used in search ranking. *** Crawler could extract list of domains from TLD zone file. After
After all the different tld domain list will be crawled. it will check back on oldest tld zone file server checked..if any new tld domain added... We could compress data to optimized ressource usage and you could give you input about resource usage.
4- Automated bot prevention mechanism to avoid any bot to use search function of site. ( we could discuss about the different type of prevention before project beginning)
5- P2P client available for the different os. ( we could build on yacey p2p project or we could start more scratch a P2P search engine client for the different os., You could evaluate http://yacy.net here and let me know what do you recommend to achieve the requirements.
We will submit an heuristic map of the main elements of project.
Please include reference DD-LE-F-12
Daily report of your progression by email.
We will provide project management tool on our server.
Include your cv updated with your cover page.
We will consider carefully every candidate.