Academic Journal

Toward More Generalized Malicious URL Detection Models

Bibliographic Details
Title: Toward More Generalized Malicious URL Detection Models
Authors: Tsai, Yun-Da, Liow, Cayon, Sheng Siang, Yin, Lin, Shou-De
Superior Title: Proceedings of the AAAI Conference on Artificial Intelligence; Vol. 38 No. 19: AAAI-24 Special Track Safe, Robust and Responsible AI Track; 21628-21636 ; 2374-3468 ; 2159-5399
Publisher Information: Association for the Advancement of Artificial Intelligence
Publication Year: 2024
Collection: Association for the Advancement of Artificial Intelligence: AAAI Publications
Subject Terms: General
Description: This paper reveals a data bias issue that can profoundly hinder the performance of machine learning models in malicious URL detection. We describe how such bias can be diagnosed using interpretable machine learning techniques and further argue that such biases naturally exist in the real world security data for training a classification model. To counteract these challenges, we propose a debiased training strategy that can be applied to most deep-learning based models to alleviate the negative effects of the biased features. The solution is based on the technique of adversarial training to train deep neural networks learning invariant embedding from biased data. Through extensive experimentation, we substantiate that our innovative strategy fosters superior generalization capabilities across both CNN-based and RNN-based detection models. The findings presented in this work not only expose a latent issue in the field but also provide an actionable remedy, marking a significant step forward in the pursuit of more reliable and robust malicious URL detection.
Document Type: article in journal/newspaper
File Description: application/pdf
Language: English
Relation: https://ojs.aaai.org/index.php/AAAI/article/view/30161/32059; https://ojs.aaai.org/index.php/AAAI/article/view/30161/32060; https://ojs.aaai.org/index.php/AAAI/article/view/30161
DOI: 10.1609/aaai.v38i19.30161
Availability: https://doi.org/10.1609/aaai.v38i19.30161
https://ojs.aaai.org/index.php/AAAI/article/view/30161
Rights: Copyright (c) 2024 Association for the Advancement of Artificial Intelligence
Accession Number: edsbas.59C6D832
Database: BASE
Description
Description not available.