TY - GEN
T1 - Autonomous data-driven clustering for live data stream
AU - Gu, Xiaowei
AU - Angelov, Plamen Parvanov
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2017/2/9
Y1 - 2017/2/9
N2 - In this paper, a novel autonomous data-driven clustering approach, called AD-clustering, is presented for live data streams processing. This newly proposed algorithm is a fully unsupervised approach and entirely based on the data samples and their ensemble properties, in the sense that there is no need for user-predefined or problem-specific assumptions and parameters, which is a problem most of the current clustering approaches suffer from. Moreover, the proposed approach automatically evolves its structure according to the experimentally observable streaming data and is able to recursively update its self-defined parameters using only the current data sample; meanwhile, it discards all the previously processed data samples. Experimental results based on benchmark datasets exhibit the higher performance of the proposed fully autonomous approach compared with the comparative approaches requiring user- and problem-specific parameters to be predefined. This new clustering algorithm is a promising tool for further applications in the field of real-time streaming data analytics.
AB - In this paper, a novel autonomous data-driven clustering approach, called AD-clustering, is presented for live data streams processing. This newly proposed algorithm is a fully unsupervised approach and entirely based on the data samples and their ensemble properties, in the sense that there is no need for user-predefined or problem-specific assumptions and parameters, which is a problem most of the current clustering approaches suffer from. Moreover, the proposed approach automatically evolves its structure according to the experimentally observable streaming data and is able to recursively update its self-defined parameters using only the current data sample; meanwhile, it discards all the previously processed data samples. Experimental results based on benchmark datasets exhibit the higher performance of the proposed fully autonomous approach compared with the comparative approaches requiring user- and problem-specific parameters to be predefined. This new clustering algorithm is a promising tool for further applications in the field of real-time streaming data analytics.
KW - Ensemble properties
KW - Fully unsupervised clustering
KW - Live data streams
KW - Recursive update
KW - Streaming data analytics
UR - http://www.research.lancs.ac.uk/portal/en/publications/autonomous-datadriven-clustering-for-live-data-stream(af309e0e-e6da-47da-ac34-490c80e74c57).html
UR - http://www.scopus.com/inward/record.url?scp=85015766060&partnerID=8YFLogxK
U2 - 10.1109/SMC.2016.7844394
DO - 10.1109/SMC.2016.7844394
M3 - Conference Proceeding (Non-Journal item)
SN - 9781509018987
T3 - 2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016 - Conference Proceedings
SP - 1128
EP - 1135
BT - 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC 2016)
PB - IEEE Press
ER -