Artificial Intelligence White Papers

Feature Preparation in Text Categorization

Overview Text categorization is an important application of machine learning to the field of document information retrieval. Most machine learning methods treat text documents as a feature vectors. The authors report text categorization accuracy for different types of features and different types of feature weights. The comparison of these classifiers shows that stemmed or un-stemmed single words as features give better classifier performance compared with other types of features, and LOG(tf)IDF weight as feature weight gives better classifier performance than other types of feature weights.

Further White Paper Details
PublisherOracle File FormatPDF
Date PublishedApril 2003 Downloads1
FormatWhite Papers   
Topics
  • Featured White Papers
Thin clients switch on digitally excluded

Thin clients switch on digitally excluded

Case study: Digital inclusion project tackles social exclusion in Liverpool more

Renault goes multilingual

Renault goes multilingual

Case study: Translation tech turns docs into 23 languages… more


Quick Sitemap Links: