Semantics-Preserving Dimensionality Reduction: Rough and Fuzzy-Rough-Based Approaches

Research output: Contribution to journalArticlepeer-review

616 Citations (Scopus)
507 Downloads (Pure)

Abstract

Semantics-preserving dimensionality reduction refers to the problem of selecting those input features that are most predictive of a given outcome; a problem encountered in many areas such as machine learning, pattern recognition, and signal processing. This has found successful application in tasks that involve data sets containing huge numbers of features (in the order of tens of thousands), which would be impossible to process further. Recent examples include text processing and Web content classification. One of the many successful applications of rough set theory has been to this feature selection area. This paper reviews those techniques that preserve the underlying semantics of the data, using crisp and fuzzy rough set-based methodologies. Several approaches to feature selection based on rough set theory are experimentally compared. Additionally, a new area in feature selection, feature grouping, is highlighted and a rough set-based feature grouping technique is detailed.
Original languageEnglish
Pages (from-to)1457-1471
Number of pages15
JournalIEEE Transactions on Knowledge and Data Engineering
Volume16
Issue number12
DOIs
Publication statusPublished - 2004

Fingerprint

Dive into the research topics of 'Semantics-Preserving Dimensionality Reduction: Rough and Fuzzy-Rough-Based Approaches'. Together they form a unique fingerprint.

Cite this