TY - JOUR
T1 - Nearest-Neighbor Guided Evaluation of Data Reliability and Its Applications
AU - Boongoen, Tossapon
AU - Shen, Qiang
N1 - Funding Information:
Manuscript received July 24, 2009; revised November 19, 2009; accepted January 18, 2010. Date of publication April 26, 2010; date of current version November 17, 2010. This work was supported by the U.K. Engineering and Physical Sciences Research Council Grant EP/D057086. This paper was recommended by Associate Editor J. Basak.
PY - 2010/12
Y1 - 2010/12
N2 - The intuition of data reliability has recently been incorporated into the main stream of research on ordered weighted averaging (OWA) operators. Instead of relying on human-guided variables, the aggregation behavior is determined in accordance with the underlying characteristics of the data being aggregated. Data-oriented operators such as the dependent OWA (DOWA) utilize centralized data structures to generate reliable weights, however. Despite their simplicity, the approach taken by these operators neglects entirely any local data structure that represents a strong agreement or consensus. To address this issue, the cluster-based OWA (Clus-DOWA) operator has been proposed. It employs a cluster-based reliability measure that is effective to differentiate the accountability of different input arguments. Yet, its actual application is constrained by the high computational requirement. This paper presents a more efficient nearest-neighbor-based reliability assessment for which an expensive clustering process is not required. The proposed measure can be perceived as a stress function, from which the OWA weights and associated decision-support explanations can be generated. To illustrate the potential of this measure, it is applied to both the problem of information aggregation for alias detection and the problem of unsupervised feature selection (in which unreliable features are excluded from an actual learning process). Experimental results demonstrate that these techniques usually outperform their conventional state-of-the-art counterparts.
AB - The intuition of data reliability has recently been incorporated into the main stream of research on ordered weighted averaging (OWA) operators. Instead of relying on human-guided variables, the aggregation behavior is determined in accordance with the underlying characteristics of the data being aggregated. Data-oriented operators such as the dependent OWA (DOWA) utilize centralized data structures to generate reliable weights, however. Despite their simplicity, the approach taken by these operators neglects entirely any local data structure that represents a strong agreement or consensus. To address this issue, the cluster-based OWA (Clus-DOWA) operator has been proposed. It employs a cluster-based reliability measure that is effective to differentiate the accountability of different input arguments. Yet, its actual application is constrained by the high computational requirement. This paper presents a more efficient nearest-neighbor-based reliability assessment for which an expensive clustering process is not required. The proposed measure can be perceived as a stress function, from which the OWA weights and associated decision-support explanations can be generated. To illustrate the potential of this measure, it is applied to both the problem of information aggregation for alias detection and the problem of unsupervised feature selection (in which unreliable features are excluded from an actual learning process). Experimental results demonstrate that these techniques usually outperform their conventional state-of-the-art counterparts.
KW - Alias detection
KW - data reliability
KW - nearest neighbor
KW - ordered weighted averaging (OWA) aggregation
KW - unsupervised feature selection
KW - weight determination
UR - http://www.scopus.com/inward/record.url?scp=78649911087&partnerID=8YFLogxK
U2 - 10.1109/TSMCB.2010.2043357
DO - 10.1109/TSMCB.2010.2043357
M3 - Article
C2 - 20423807
SN - 1083-4419
VL - 40
SP - 1622
EP - 1633
JO - IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)
JF - IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)
IS - 6
M1 - 5454303
ER -