TY - JOUR
T1 - A scalable and distributed dendritic cell algorithm for big data classification
AU - Chelly Dagdia, Zaineb
N1 - Funding Information:
This work is part of a project that has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 702527 . Additional thanks go to the support of the Supercomputing Wales project, which is part-funded by the European Regional Development Fund ( ERDF ) via the Welsh Government.
Publisher Copyright:
© 2018 Elsevier B.V.
PY - 2019/11/1
Y1 - 2019/11/1
N2 - In the era of big data, scaling evolution up to large-scale data sets is a very interesting and challenging task. The application of standard biological systems in such data sets is not straightforward. Therefore, a new class of scalable biological systems that embraces the huge storage and processing capacity of distributed platforms is required. In this work, we focus on the Dendritic Cell Algorithm (DCA), a bio-inspired classifier, and its limitation when coping with very large data sets. To overcome this limitation, we propose a novel distributed DCA version for data classification based on the MapReduce framework to distribute the functioning of this algorithm through a cluster of computing elements. Our experimental results show that our proposed distributed solution is suitable to enhance the performance of the DCA enabling the algorithm to be applied over big data classification problems
AB - In the era of big data, scaling evolution up to large-scale data sets is a very interesting and challenging task. The application of standard biological systems in such data sets is not straightforward. Therefore, a new class of scalable biological systems that embraces the huge storage and processing capacity of distributed platforms is required. In this work, we focus on the Dendritic Cell Algorithm (DCA), a bio-inspired classifier, and its limitation when coping with very large data sets. To overcome this limitation, we propose a novel distributed DCA version for data classification based on the MapReduce framework to distribute the functioning of this algorithm through a cluster of computing elements. Our experimental results show that our proposed distributed solution is suitable to enhance the performance of the DCA enabling the algorithm to be applied over big data classification problems
KW - Big data
KW - Dendritic cell algorithm
KW - Distributed processing
UR - http://www.scopus.com/inward/record.url?scp=85053017701&partnerID=8YFLogxK
U2 - 10.1016/j.swevo.2018.08.009
DO - 10.1016/j.swevo.2018.08.009
M3 - Article
SN - 2210-6502
VL - 50
JO - Swarm and Evolutionary Computation
JF - Swarm and Evolutionary Computation
M1 - 100432
ER -