
The Feature Difference Coefficient: Classification by Means of Feature Distributions
Title: The Feature Difference Coefficient: Classification by Means of Feature Distributions
Authors: Ulli Waltinger and Alexander Mehler
Pub/Conf: Text Mining Services (TMS), March 23-25, Leipzig, 2009
Abstract:
This paper presents a model of text classification using feature frequency distribution. The proposed algorithm offers not only sensitivity to linguistic but also to structure features and calculates a unified fingerprint for each category. Classification is done by finding the closest match to prelearned models using a simple distance metric. The approach will be evaluated against three different classification scenarios. Language identification, text classification based on the Reuters corpus and web genre classification.
BibTeX:
@inproceedings{1904094, author = {Waltinger, Ulli and Mehler, Alexander}, language = {English}, series = {Proceedings of Text Mining Services (TMS), March 23-25, Leipzig, Germany}, title = {The Feature Difference Coefficient: Classification by Means of Feature Distributions}, year = {2009}, }