Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Log Anomaly Detection on Thunderbird
Loading...
96.1
F1 Score
DeBERTa-v3
75.508
80.854
86.2
91.546
Apr 14, 2026
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
DeBERTa-v3
Category=Fine-Tuned
2026.04
96.1
RoBERTa-base
Category=Fine-Tuned
2026.04
95.3
BERT-base
Category=Fine-Tuned
2026.04
94.7
GPT-4 + SLCP (5-shot)
Category=LLM (Prompt),...
2026.04
89.7
Drain + RF
Category=Traditional
2026.04
88.6
Drain + RF (best)
Category=Traditional
2026.04
88.6
GPT-4 (5-shot)
Category=LLM (Prompt),...
2026.04
87.2
GPT-4 + SLCP (zero)
Category=LLM (Prompt),...
2026.04
86.4
Drain + LR
Category=Traditional
2026.04
85.3
Spell + SVM
Category=Traditional
2026.04
84.9
GPT-4 (zero-shot)
Category=LLM (Prompt),...
2026.04
83.1
LLaMA-3 (zero-shot)
Category=LLM (Prompt),...
2026.04
78.6
GPT-3.5 (zero-shot)
Category=LLM (Prompt),...
2026.04
76.3
Feedback
Search any
task
Search any
task