A new clustering algorithm with multiple runs of iterative procedures

Qiwen Zhang, Roger D. Boyle*

*Awdur cyfatebol y gwaith hwn

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygladolygiad gan gymheiriaid

20 Dyfyniadau (Scopus)


The K-means clustering algorithm is well known and widely used. Recently, an alternative algorithm based on an earlier idea (Duda and Hart, Pattern Classification and Scene Analysis (1973); Ismail and Kamel, Pattern Recognition 22, 75-89 (1989)) has been proposed; in comprehensive experiments we verify that this alternative is superior to K-means. Both techniques suffer from the complexity of the cost function surface they are descending, and a consequent inclination to fall into poor local minima. A popular solution to this problem is to rerun the chosen iterative procedure from many randomly chosen starting positions, saving the best. We introduce here an alternative strategy to form the starting position for such a multiple run which consists of perturbing an existing result in an attempt to improve its poor aspects. This "controlled configuration change" has comparable complexity with random initialisation but usually produces superior clustering results. Comprehensive experiments confirm this result.

Iaith wreiddiolSaesneg
Tudalennau (o-i)835-848
Nifer y tudalennau14
CyfnodolynPattern Recognition
Rhif cyhoeddi9
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 1991
Cyhoeddwyd yn allanolIe

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'A new clustering algorithm with multiple runs of iterative procedures'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn