Knowledge and Data Management White Papers
Provenance as Data Mining: Combining File System Metadata With Content Analysis
Overview Provenance describes how an object came to be in its present state. Thus, it describes the evolution of the object over time. Prior work on provenance has focussed on databases and the file system. The database or file system is enhanced or augmented in order to capture additional information about the historical evolution of document collections, and thus answer the provenance question. The paper addresses the question of provenance for unstructured information (i.e., document corpii from file systems) but without any enhancements to the file system. To provide a solution in this setting, the paper models the provenance problem in such a setting as a problem of data mining. The paper shows that data mining can provide provenance information for repositories of unstructured information, including chains of historical evolution.
| Publisher | Hewlett-Packard (HP) | File Format | |
|---|---|---|---|
| Date Published | February 2009 | ||
| Format | White Papers | ||
| Topics | |||
Accelerating Enterprise Data Governance Part 1
In the first of this series of three white papers, Mike Ferguson of Intelligent Business Strategies defines what data governance is and then looks at the requirements that need to...
Data Governance for Master Data Management and Beyond
There is growing interest on behalf of both data management professionals and senior business managers to understand the motivations, mechanics, and benefits of instituting data governance within an organization. This...
Getting Started with Master Data Management
Master data management forms part of an overall enterprise governance program that aims to establish trusted data throughout the enterprise. This white paper from Mike Ferguson of Intelligent Business Strategies...
Five Steps to More Valuable Enterprise Data
Companies worldwide struggle with inconsistent, inaccurate or unreliable data - and often don't know how to build more useful corporate information. This white paper examines a five-step method for...
The Journey Along an Information-Led Transformation
A shift is underway from simple automation to business optimization, and information is at the center of it. Information, when aligned with your business strategy, holds the key to driving profitable...



