Knowledge and Data Management White Papers

Improving Encarta Search Engine Performance by Mining User Logs

Overview This paper proposes a data-mining approach that produces generalized query patterns (with generalized keywords) from the raw user logs of the Microsoft Encarta search engine (http://encarta.msn.com). Those query patterns can act as cache of the search engine, improving its performance. The cache of the generalized query patterns is more advantageous than the cache of the most frequent user queries since the patterns are generalized, covering more queries and future queries - even those not previously asked. The method discussed is unique since query patterns discovered reflect the actual dynamic usage and user feedbacks of the search engine, rather than the syntactic linkage structure of web pages (as Google does).

Further White Paper Details
PublisherMicrosoft File FormatPDF
Date PublishedNovember 2002
FormatWhite Papers   
Topics

Quick Sitemap Links: