White Papers

Collaborative Web Crawling: Information Gathering/Processing Over Internet

Overview The main objective of the IBM Grand Central Station (GCS) is to gather information of virtually any type of formats (text, data, image, graphics, audio, video) from the cyberspace, to process/index/summarize the information, and to push the right information to the right people. Because of the very large scale of the cyberspace, parallel processing in both crawling/gathering and information processing is indispensable. This paper presents a scalable method for collaborative web crawling and information processing.

Further White Paper Details
PublisherIBM File FormatPDF
Date PublishedNovember 2004
FormatWhite Papers   
Topics
    N/A
E4 embraces web 2.0 audience

E4 embraces web 2.0 audience

Case study: How the Channel 4's teen channel put its mind to building a community website... more

Cheat Sheet: Cloud computing

Cheat Sheet: Cloud computing

A tech storm is brewing...  more


Quick Sitemap Links: