Missing data imputation using fuzzy-rough methods

Mehran Amiri, Richard Jensen

Research output: Contribution to journalArticlepeer-review

128 Citations (Scopus)
637 Downloads (Pure)

Abstract

Missing values exist in many generated datasets in science. Therefore, utilizing missing data imputation methods is a common and important practice. These methods are a kind of treatment for uncertainty and vagueness existing in datasets. On the other hand, methods based on fuzzy-rough sets provide excellent tools for dealing with uncertainty, possessing highly desirable properties such as robustness and noise tolerance. Furthermore, they can find minimal representations of data and do not need potentially erroneous user inputs. As a result, utilizing fuzzy-rough sets for imputation should be an effective approach. In this paper, we propose three missing value imputation methods based on fuzzy-rough sets and its recent extensions; namely, implicator/t-norm based fuzzy-rough sets, vaguely quantified rough sets and also ordered weighted average based rough sets. These methods are compared against 11 state-of-the-art imputation methods implemented in the KEEL data mining software on 27 benchmark datasets. The results show, via non-parametric statistical analysis, that the proposed methods exhibit excellent
performance in general.
Original languageEnglish
Pages (from-to)152-164
JournalNeurocomputing
Volume205
Early online date09 May 2016
DOIs
Publication statusPublished - 12 Sept 2016

Keywords

  • Missing value imputation
  • fuzzy-rough sets
  • OWA
  • VQRS

Fingerprint

Dive into the research topics of 'Missing data imputation using fuzzy-rough methods'. Together they form a unique fingerprint.

Cite this