Design of a Personalized Domain Specific Web Crawler

  • U.K.Balajisaravanan,K.Karthick,S.Rajkumar,M.Murali,N.Selvanathan

Abstract

The World Wide Web (WWW) contains a huge amount of information which is easily accessible to the users. The users should make use of some tools for gathering information from the Web. Search engines are used to collect information and present them as search results. But they generate millions of Web pages for a single keyword. Users will not be able to find their specific information which is clubbed with that millions of Web pages which are gathered by search engines and also searching for specific data is cumbersome. Due to this drawback, a personalized tool is to be implemented. In this work, a crawler design is proposed to gather pages that are relevant to a particular user or group of users. A graphical user interface is designed for the users to interact with the crawler. This interface integrates with the topic taxonomy to provide more reasonable seed URLs. The search system with this URL suggestion module enhances the relevancy of URLs collected. The empirical results provide enough proof for the quality of results obtained.

Published
2020-07-01
How to Cite
U.K.Balajisaravanan,K.Karthick,S.Rajkumar,M.Murali,N.Selvanathan. (2020). Design of a Personalized Domain Specific Web Crawler. International Journal of Advanced Science and Technology, 29(7), 12162 - 12167. Retrieved from http://sersc.org/journals/index.php/IJAST/article/view/27907
Section
Articles