Knowledge and Data Management White Papers

Ranking Interesting Subspaces for Clustering High Dimensional Data

Overview Application domains such as life sciences, e.g. molecular biology produce a tremendous amount of data which can no longer be managed without the help of efficient and effective data mining methods. One of the primary data mining tasks is clustering. However, traditional clustering algorithms often fail to detect meaningful clusters because of the high dimensional, inherently sparse feature space of most real-world data sets. Nevertheless, the data sets often contain clusters hidden in various subspaces of the original feature space. The paper presents a pre-processing step for traditional clustering algorithms, which detects all interesting subspaces of high-dimensional data containing clusters.

Further White Paper Details
PublisherSpringer Science+Business Media File FormatPDF
Date PublishedJanuary 2007
FormatWhite Papers   
Topics

Quick Sitemap Links: