Eliciting Least-to-Most Reasoning for Phishing URL Detection

About

Phishing continues to be one of the most prevalent attack vectors, making accurate classification of phishing URLs essential. Recently, large language models (LLMs) have demonstrated promising results in phishing URL detection. However, their reasoning capabilities that enabled such performance remain underexplored. To this end, in this paper, we propose a Least-to-Most prompting framework for phishing URL detection. In particular, we introduce an "answer sensitivity" mechanism that guides Least-to-Most's iterative approach to enhance reasoning and yield higher prediction accuracy. We evaluate our framework using three URL datasets and four state-of-the-art LLMs, comparing against a one-shot approach and a supervised model. We demonstrate that our framework outperforms the one-shot baseline while achieving performance comparable to that of the supervised model, despite requiring significantly less training data. Furthermore, our in-depth analysis highlights how the iterative reasoning enabled by Least-to-Most, and reinforced by our answer sensitivity mechanism, drives these performance gains. Overall, we show that this simple yet powerful prompting strategy consistently outperforms both one-shot and supervised approaches, despite requiring minimal training or few-shot guidance. Our experimental setup can be found in our Github repository github.sydney.edu.au/htri0928/least-to-most-phishing-detection.

Holly Trikilis, Pasindu Marasinghe, Fariza Rashid, Suranga Seneviratne• 2026

Related benchmarks

Task	Dataset	Result
Phishing URL Detection	HP random balanced 1,000 URLs	F1 Score96.58	9
Phishing URL Detection	EBBU random balanced subset of 1,000 URLs	F1 Score95.64	9
Phishing URL Detection	ISCX random balanced subset of 1,000 URLs	F1 Score96.43	9

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord