TY - JOUR
T1 - Self-organised direction aware data partitioning algorithm
AU - Gu, Xiaowei
AU - Angelov, Plamen Parvanov
AU - Kangin, Dmitry
AU - Principe, Jose
N1 - Funding Information:
This work was partially supported by The Royal Society grant IE141329/2014 “ Novel Machine Learning Paradigms to address Big Data Streams ”.
Publisher Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 2018/1/1
Y1 - 2018/1/1
N2 - In this paper, a novel fully data-driven algorithm, named Self-Organised Direction Aware (SODA) data partitioning and forming data clouds is proposed. The proposed SODA algorithm employs an extra cosine similarity-based directional component to work together with a traditional distance metric, thus, takes the advantages of both the spatial and angular divergences. Using the nonparametric Empirical Data Analytics (EDA) operators, the proposed algorithm automatically identifies the main modes of the data pattern from the empirically observed data samples and uses them as focal points to form data clouds. A streaming data processing extension of the SODA algorithm is also proposed. This extension of the SODA algorithm is able to self-adjust the data clouds structure and parameters to follow the possibly changing data patterns and processes. Numerical examples provided as a proof of the concept illustrate the proposed algorithm as an autonomous algorithm and demonstrate its high clustering performance and computational efficiency.
AB - In this paper, a novel fully data-driven algorithm, named Self-Organised Direction Aware (SODA) data partitioning and forming data clouds is proposed. The proposed SODA algorithm employs an extra cosine similarity-based directional component to work together with a traditional distance metric, thus, takes the advantages of both the spatial and angular divergences. Using the nonparametric Empirical Data Analytics (EDA) operators, the proposed algorithm automatically identifies the main modes of the data pattern from the empirically observed data samples and uses them as focal points to form data clouds. A streaming data processing extension of the SODA algorithm is also proposed. This extension of the SODA algorithm is able to self-adjust the data clouds structure and parameters to follow the possibly changing data patterns and processes. Numerical examples provided as a proof of the concept illustrate the proposed algorithm as an autonomous algorithm and demonstrate its high clustering performance and computational efficiency.
KW - Autonomous learning
KW - Clustering
KW - Cosine similarity
KW - Empirical Data Analytics (EDA)
KW - Nonparametric
KW - Traditional distance metric
UR - http://www.research.lancs.ac.uk/portal/en/publications/selforganised-direction-aware-data-partitioning-algorithm(24b43ef2-7df1-4f1b-b76e-4e25785db4a1).html
UR - http://www.scopus.com/inward/record.url?scp=85029700275&partnerID=8YFLogxK
U2 - 10.1016/j.ins.2017.09.025
DO - 10.1016/j.ins.2017.09.025
M3 - Article
SN - 0020-0255
VL - 423
SP - 80
EP - 95
JO - Information Sciences
JF - Information Sciences
ER -