TY - JOUR
T1 - Extending data reliability measure to a filter approach for soft subspace clustering
AU - Boongoen, Tossapon
AU - Shang, Changjing
AU - Iam-On, Natthakan
AU - Shen, Qiang
N1 - Boongoen, T., Shang, C., Iam-On, N., Shen, Q. (2011). Extending Data Reliability Measure to a Filter Approach for Soft Subspace Clustering. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 41 (6), 1705-1714
PY - 2011/12/12
Y1 - 2011/12/12
N2 - The measure of data reliability has recently proven useful for a number of data analysis tasks. This paper extends the underlying metric to a new problem of soft subspace clustering. The concept of subspace clustering has been increasingly recognized as an effective alternative to conventional algorithms (which search for clusters without differentiating the significance of different data attributes). While a large number of crisp subspace approaches have been proposed, only a handful of soft counterparts are developed with the common goal of acquiring the optimal cluster-specific dimension weights. Most soft subspace clustering methods work based on the exploitation of $k$-means and greatly rely on the iteratively disclosed cluster centers for the determination of local weights. Unlike such wrapper techniques, this paper presents a filter approach which is efficient and generally applicable to different types of clustering. Systematical experimental evaluations have been carried out over a collection of published gene expression data sets. The results demonstrate that the reliability-based methods generally enhance their corresponding baseline models and outperform several well-known subspace clustering algorithms.
AB - The measure of data reliability has recently proven useful for a number of data analysis tasks. This paper extends the underlying metric to a new problem of soft subspace clustering. The concept of subspace clustering has been increasingly recognized as an effective alternative to conventional algorithms (which search for clusters without differentiating the significance of different data attributes). While a large number of crisp subspace approaches have been proposed, only a handful of soft counterparts are developed with the common goal of acquiring the optimal cluster-specific dimension weights. Most soft subspace clustering methods work based on the exploitation of $k$-means and greatly rely on the iteratively disclosed cluster centers for the determination of local weights. Unlike such wrapper techniques, this paper presents a filter approach which is efficient and generally applicable to different types of clustering. Systematical experimental evaluations have been carried out over a collection of published gene expression data sets. The results demonstrate that the reliability-based methods generally enhance their corresponding baseline models and outperform several well-known subspace clustering algorithms.
UR - http://www.scopus.com/inward/record.url?scp=81955161160&partnerID=8YFLogxK
U2 - 10.1109/TSMCB.2011.2160341
DO - 10.1109/TSMCB.2011.2160341
M3 - Article
C2 - 21803692
VL - 41
SP - 1705
EP - 1714
JO - IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)
JF - IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)
IS - 6
M1 - 5962370
ER -