Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Phishing URL Detection on EBBU random balanced subset of 1,000 URLs
Loading...
99
F1 Score
URLTran
81.0392
85.7021
90.365
95.0279
Jan 28, 2026
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
URLTran
Description=BERT-based...
2026.01
99
Least-to-Most prompting framework
LLM Backbone=Gemini 2....
2026.01
95.64
Least-to-Most prompting framework
LLM Backbone=GPT-4.1
2026.01
94.9
One-shot classifier
LLM Backbone=GPT-4.1
2026.01
93.72
Least-to-Most prompting framework
LLM Backbone=Gemma 3:12b
2026.01
89.62
One-shot classifier
LLM Backbone=Gemini 2....
2026.01
89.1
One-shot classifier
LLM Backbone=Gemma 3:12b
2026.01
87.1
Least-to-Most prompting framework
LLM Backbone=Llama 3.1:8b
2026.01
85.8
One-shot classifier
LLM Backbone=Llama 3.1:8b
2026.01
81.73
Feedback
Search any
task
Search any
task