Data Mining - Analysis White Papers
Record Linkage: A Machine Learning Approach, a Toolbox, and a Digital Government Web Service
Overview Data cleaning is a vital process that ensures the quality of data stored in real-world databases. Data cleaning problems are frequently encountered in many research areas, such as knowledge discovery in databases, data warehousing, system integration and eservices. The process of identifying the record pairs that represent the same entity (duplicate records), commonly known as record linkage, is one of the essential elements of data cleaning. This paper addresses the record linkage problem by adopting a machine learning approach.
| Publisher | Hewlett-Packard | File Format | |
|---|---|---|---|
| Date Published | July 2003 | Downloads | 2 |
| Format | White Papers | ||
| Topics | |||


