Fuzzy-Rough Attribute Reduction with Application to Web Categorization

Research output: Contribution to journalArticlepeer-review

445 Citations (SciVal)
232 Downloads (Pure)

Abstract

Due to the explosive growth of electronically stored information, automatic methods must be developed to aid users in maintaining and using this abundance of information effectively. In particular, the sheer volume of redundancy present must be dealt with, leaving only the information-rich data to be processed. This paper presents a novel approach, based on an integrated use of fuzzy and rough set theories, to greatly reduce this data redundancy. Formal concepts of fuzzy-rough attribute reduction are introduced and illustrated with a simple example. The work is applied to the problem of web categorization, considerably reducing dimensionality with minimal loss of information. Experimental results show that fuzzy-rough reduction is more powerful than the conventional rough set-based approach. Classifiers that use a lower dimensional set of attributes which are retained by fuzzy-rough reduction outperform those that employ more attributes returned by the existing crisp rough reduction method.
Original languageEnglish
Pages (from-to)469-485
Number of pages17
JournalFuzzy Sets and Systems
Volume141
Issue number3
DOIs
Publication statusPublished - 2004

Keywords

  • Attribute reduction
  • Web categorization
  • Data redundancy

Fingerprint

Dive into the research topics of 'Fuzzy-Rough Attribute Reduction with Application to Web Categorization'. Together they form a unique fingerprint.

Cite this