Graph clustering-based discretization of splitting and merging methods (GraphS and GraphM)

Kittakorn Sriwanna*, Tossapon Boongoen, Natthakan Iam-On

*Awdur cyfatebol y gwaith hwn

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygladolygiad gan gymheiriaid

18 Dyfyniadau(SciVal)
27 Wedi eu Llwytho i Lawr (Pure)


Discretization plays a major role as a data preprocessing technique used in machine learning and data mining. Recent studies have focused on multivariate discretization that considers relations among attributes. The general goal of this method is to obtain the discrete data, which preserves most of the semantics exhibited by original continuous data. However, many techniques generate the final discrete data that may be less useful with natural groups of data not being maintained. This paper presents a novel graph clustering-based discretization algorithm that encodes different similarity measures into a graph representation of the examined data. The intuition allows more refined data-wise relations to be obtained and used with the effective graph clustering technique based on normalized association to discover nature graphs accurately. The goodness of this approach is empirically demonstrated over 30 standard datasets and 20 imbalanced datasets, compared with 11 well-known discretization algorithms using 4 classifiers. The results suggest the new approach is able to preserve the natural groups and usually achieve the efficiency in terms of classifier performance, and the desired number of intervals than the comparative methods.

Iaith wreiddiolSaesneg
Rhif yr erthygl21
Nifer y tudalennau39
CyfnodolynHuman-centric Computing and Information Sciences
Rhif cyhoeddi1
Dyddiad ar-lein cynnar03 Awst 2017
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 01 Rhag 2017
Cyhoeddwyd yn allanolIe

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'Graph clustering-based discretization of splitting and merging methods (GraphS and GraphM)'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn