Knowledge and Data Management White Papers

SAS Data Quality - Cleanse: Techniques for Merge/Purge on Very Large Datasets

Overview The SAS System can perform merge/purge - the process of combining lists and removing duplicates - with the functionality provided by SAS Data Quality-Cleanse ("SAS-DQ"). In developing SAS programs for monthly merge/purge on hundreds of millions of records, the authors' encountered various challenges, including how to significantly decrease processing time, and how to combine partially overlapping duplicate groups based on various criteria.

Further White Paper Details
PublisherSAS Institute File FormatPDF
Date PublishedApril 2004 Downloads1
FormatWhite Papers   
Topics
E4 embraces web 2.0 audience

E4 embraces web 2.0 audience

Case study: How the Channel 4's teen channel put its mind to building a community website... more

Cheat Sheet: Cloud computing

Cheat Sheet: Cloud computing

A tech storm is brewing...  more


Quick Sitemap Links: