A Novel Architecture for Search Engine using
[摘要] Search engines, an information retrieval tool are the main source of information for users’ information need now aday. For every query, the search engine explores its repository and/or indexer to find the relevant documents/URLs for thatquery. Page ranking algorithms rank the Uniform Resource Locator in abstract section (URLs) according to its relevancy withrespect to users’ query. It is analyzed that many of the queries fired by users on search engines are duplicate. There is a scopeto improve the performance of search engine to reduce its efforts for duplicate queries. In this paper a proxy server is createdthat keep store the search results of user queries in web log. The proposed proxy server uses this web log to find results fasterfor duplicate queries fired next time. The proposed scheme has been tested and found prominent. The proposed architecturetested for ten duplicate user queries. it return all relevant web pages for duplicate user query (if query is found in web log atproxy server) from a particular domain instead of entire database. It reduces the perceived latency for duplicate query andalso improves the value of precession and accuracy up to 81.8% and 99% respectively for all duplicate user queries.
[发布日期] [发布机构]
[效力级别] [学科分类] 计算机科学(综合)
[关键词] Search engine;information retrieval;web usage mining;content mining [时效性]