Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LLMs are One-Shot URL Classifiers and Explainers

About

Malicious URL classification represents a crucial aspect of cyber security. Although existing work comprises numerous machine learning and deep learning-based URL classification models, most suffer from generalisation and domain-adaptation issues arising from the lack of representative training datasets. Furthermore, these models fail to provide explanations for a given URL classification in natural human language. In this work, we investigate and demonstrate the use of Large Language Models (LLMs) to address this issue. Specifically, we propose an LLM-based one-shot learning framework that uses Chain-of-Thought (CoT) reasoning to predict whether a given URL is benign or phishing. We evaluate our framework using three URL datasets and five state-of-the-art LLMs and show that one-shot LLM prompting indeed provides performances close to supervised models, with GPT 4-Turbo being the best model, followed by Claude 3 Opus. We conduct a quantitative analysis of the LLM explanations and show that most of the explanations provided by LLMs align with the post-hoc explanations of the supervised classifiers, and the explanations have high readability, coherency, and informativeness.

Fariza Rashid, Nishavi Ranaweera, Ben Doyle, Suranga Seneviratne• 2024

Related benchmarks

TaskDatasetResultRank
Phishing URL DetectionHP random balanced 1,000 URLs
F1 Score96.12
9
Phishing URL DetectionISCX random balanced subset of 1,000 URLs
F1 Score92.17
9
Phishing URL DetectionEBBU random balanced subset of 1,000 URLs
F1 Score93.72
9
Showing 3 of 3 rows

Other info

Follow for update