Content Management White Papers
Mirror, Mirror on the Web: A Study of Host Pairs with Replicated Content
Overview What exactly is mirroring? The term is often used in the Web research literature but a crisp definition is hard to come by. At one extreme one can say that two sites are mirrored if their content is byte-wise identical. In practice this definition is too restrictive: even on successive accesses to the same URL the fetched content may differ slightly because of dynamic components, timestamps, transaction-ids, etc. At the other extreme, one can say that two sites are mirrors if enough pages on one are very similar to pages on the other. This definition does not address the issue of structure. When two sites have different structures, the proof offered by content similarity is not compelling. For instance, consider the web sites of New York Times and Washington Post (two major US newspapers). On any given day, they are likely to have some syntactically similar pages, e.g., because they draw articles from common sources, or because they publish official documents. Nevertheless, the two sites can hardly be called mirrors.
| Publisher | World Wide Web Conference | File Format | HTML |
|---|---|---|---|
| Date Published | August 2003 | ||
| Format | White Papers | ||
| Topics | |||
Web Content Management Powers Highly Successful Partner, Customer, and Employee Portals
McDATA is the expert provider of hardware, software, and services that enable partners and customers around the world. Prior to leaving EMC, McDATA focused exclusively on engineering and "OEMing" its...
Podcast - Chinwag Live: Wobble 2.0 - 6th February 2007
When Web 2.0 collides with Bubble 2.0 is the result real, sustainable business or faddish pipe dreams? This was the question posed at Chinwag Live's lively inaugural event, and debated...
How to Use Lotus Domino to Publish Policies and Procedures Online
If you notice that people tend to use workarounds - such as making local copies of documents and exchanging these copies by email instead of reading documents on the corporate...
Beyond Web Conferencing—Web Collaboration Solutions
Web collaboration technologies are advancing at a rapid pace. New features and interfaces are appearing almost daily as vendors race to capture a piece of this multi-billion dollar market. As...
Lifecycle Fixed Content Manager 100 Series Solution: Archival WORM Storage on Magnetic Disks
The Lifecycle Fixed Content Manager 100 Series solution provides an easy to use, well controlled and safe storage system optimized for large-scale and long-term disk-based storage of archival data. Individual...


