Data Mining - Analysis White Papers

Record Linkage: A Machine Learning Approach, a Toolbox, and a Digital Government Web Service

Overview Data cleaning is a vital process that ensures the quality of data stored in real-world databases. Data cleaning problems are frequently encountered in many research areas, such as knowledge discovery in databases, data warehousing, system integration and eservices. The process of identifying the record pairs that represent the same entity (duplicate records), commonly known as record linkage, is one of the essential elements of data cleaning. This paper addresses the record linkage problem by adopting a machine learning approach.

Further White Paper Details
PublisherHewlett-Packard File FormatPDF
Date PublishedJuly 2003 Downloads2
FormatWhite Papers   
Topics

Quick Sitemap Links: