Noisy Instance Removal Using OWA-Based Fuzzy-Rough Sets

Richard Jensen, Neil Mac Parthalain, Mehran Amiri, Jorg Cassens

Research output: Chapter in Book/Report/Conference proceedingConference Proceeding (Non-Journal item)

Abstract

The reduction of the number of data instances is an important research area, particularly with a view to a reduction in the space requirements for lazy learning algorithms such as kNN. Previously, a fuzzy-rough prototype selection algorithm was proposed for this purpose, called OWAFRDC. This approach uses a criterion based on the upper and lower approximations of fuzzy-rough sets to assess the typicality of dataset instances. OWAFRDC was shown to preserve high quality instances and discard low quality instances. In this paper, a new instance quality criterion/measure is introduced to assess the quality of instances. The new criterion factors in the noisiness of instances in addition to their typicality. A numerical measure is calculated for each instance of a dataset based on the two mentioned criteria. The calculated values are used in the OWAFRDC algorithm to deliver condensed datasets. Non-parametric statistical tests show that the introduced quality measure improves the performance of OWAFRDC in terms of both accuracy and reduction rate.
Original languageEnglish
Title of host publicationADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2022
EditorsG Panoutsos, M Mahfouf, LS Mihaylova
Place of PublicationGEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND
PublisherSpringer Nature
Pages37-48
Number of pages12
Volume1454
ISBN (Print)978-3-031-55567-1; 978-3-031-55568-8
DOIs
Publication statusPublished - 2024

Publication series

NameAdvances in Intelligent Systems and Computing
PublisherSPRINGER INTERNATIONAL PUBLISHING AG

Keywords

  • Fuzzy-rough sets
  • prototype selection
  • nearest neighbour algorithm
  • noisy data
  • instance quality measure

Fingerprint

Dive into the research topics of 'Noisy Instance Removal Using OWA-Based Fuzzy-Rough Sets'. Together they form a unique fingerprint.

Cite this