Crynodeb
Data dimensionality has become a pervasive problem in many areas that require the learning of interpretable models. This has become particularly pronounced in recent years with the seemingly relentless growth in the size of datasets. Indeed, as the number of dimensions increases, the number of data instances required in order to generate accurate models increases exponentially. Feature selection has therefore become not only a useful step in the process of model learning, but rather an increasingly necessary one. Rough set and fuzzy-rough set theory have been used as such dataset pre-processors with much success, however the underlying time/space complexity of the subset evaluation metric is an obstacle to the processing of very large data. This paper proposes a general approach to this problem that employs a novel feature grouping step in order to alleviate the processing overhead for large datasets. The approach is framed within the context of (and applied to) fuzzy-rough sets, although it can be used with other subset evaluation techniques. The experimental evaluation demonstrates that considerable computational effort can be avoided, and as a result efficiency can be improved considerably for larger datasets.
Iaith wreiddiol | Saesneg |
---|---|
Teitl | 2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE) |
Man cyhoeddi | NEW YORK |
Cyhoeddwr | IEEE Press |
Tudalennau | 1488-1495 |
Nifer y tudalennau | 8 |
Statws | Cyhoeddwyd - 2014 |
Digwyddiad | Fuzzy Systems - Beijing, Beijing, Tsieina Hyd: 06 Gorff 2014 → 11 Gorff 2014 Rhif y gynhadledd: 23 |
Cyfres gyhoeddiadau
Enw | IEEE International Fuzzy Systems Conference Proceedings |
---|---|
Cyhoeddwr | IEEE |
ISSN (Argraffiad) | 1544-5615 |
Cynhadledd
Cynhadledd | Fuzzy Systems |
---|---|
Teitl cryno | FUZZ-IEEE-2014 |
Gwlad/Tiriogaeth | Tsieina |
Dinas | Beijing |
Cyfnod | 06 Gorff 2014 → 11 Gorff 2014 |