URL filtering using big data analytics in 5G networks

Nasir Ali Khan, Abid Khan, Mansoor Ahmad, Munam Ali Shah, Gwanggil Jeon*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

The future generations networking technologies such as 5G and 6G will provide tremendous performance, network capacity, quality of service and connectivity. Therefore, the convergence of these with technologies with big data analytics in today's smart ecosystem will provide tremendous opportunities. The existing URL filtering techniques do not do real-time filtering, and lack fault-tolerance and scalability. We have addressed these issues and have developed a real-time, fault-tolerant and scalable machine learning based binary classification model, which handles streams of URL traffic and classifies it into obscene or clean material, in real-time. We have only used the URL based features for classification, and have still achieved a good accuracy of 93% on logistic regression classifier and 88%. Our model can filter 2 million URLs in 55 seconds. The proposed model achieved precision, recall and f1-score values of 0.92, 0.95 and 0.93 respectively.

Original languageEnglish
Article number107379
JournalComputers and Electrical Engineering
Volume95
Early online date21 Aug 2021
DOIs
Publication statusPublished - Oct 2021

Keywords

  • Big data analytics
  • Logistic regression
  • Machine learning
  • URL filtering

Fingerprint

Dive into the research topics of 'URL filtering using big data analytics in 5G networks'. Together they form a unique fingerprint.

Cite this