Context-aware embeddings for robust multiclass fraudulent URL detection in online social platforms

Sara Afzal, Muhammad Asim, Mirza Omer Beg, Thar Baker, Ali Ismail Awad , Nouman Shamim

Research output: Contribution to journalArticlepeer-review

Abstract

The current ubiquity of online social networks (OSNs) cannot be overstated, and they have over 4.8 billion users worldwide. These platforms have become integrated into modern life, representing an important means of communication and information sharing. However, this widespread popularity has also drawn the attention of cybercriminals, who seek to exploit OSNs using deceptive Uniform Resource Locators (URLs) as their weapons of choice. Conventional URL-classification methods, which rely on post-access features or static analysis, face significant limitations; they struggle to keep pace with the ever-evolving tactics of cybercriminals, and they often lack the granularity required for precise URL categorization. The methodology proposed herein takes a different path, leveraging the power of an artificial neural network (ANN) in tandem with Bidirectional Encoder Representations from Transformers (BERT) to extract contextual embeddings from URLs. By combining the cutting-edge capabilities of ANNs and BERT, we introduce an efficient approach to safeguarding OSN users from the insidious threats lurking behind deceptive URLs by classifying them into five distinct categories: benign, defacement, phishing, malware, and spam. The proposed approach was found to achieve an impressive accuracy rate of 98.0%, surpassing the previous best of 97.92%. This technique thus has the potential to serve as a crucial defense mechanism for the billions of individuals who rely on OSNs for their social and informational needs.
Original languageEnglish
Article number109494
Number of pages16
JournalComputers and Electrical Engineering
Volume119
Issue numberA
DOIs
Publication statusPublished - 30 Jul 2024

Keywords

  • Artificial neural networks
  • BERT embeddings
  • URL classification
  • Securing online social networks

Fingerprint

Dive into the research topics of 'Context-aware embeddings for robust multiclass fraudulent URL detection in online social platforms'. Together they form a unique fingerprint.

Cite this