Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
PII Detection on PII-Multi v1 (test)
Loading...
0.942
Entity F1 Score
Llama3.1
0.80056
0.83728
0.874
0.91072
Feb 25, 2025
Entity F1 Score
Strict F1 Score
ROUGE-L F-Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Entity F1 Score
Strict F1 Score
ROUGE-L F-Score
Llama3.1
Model Category=Open-so...
2025.02
0.942
0.883
0.884
DeepSeekV3
Model Category=API-bas...
2025.02
0.927
0.884
0.886
GPT4o
Model Category=API-bas...
2025.02
0.923
0.891
0.893
Claude3.5
Model Category=API-bas...
2025.02
0.92
0.89
0.892
Qwen2.5
Model Category=Open-so...
2025.02
0.918
0.853
0.855
Llama3.1-SLM
Model Category=Open-so...
2025.02
0.869
0.778
0.781
BiLSTM-CRF
Model Category=Traditi...
2025.02
0.828
-
-
Qwen2.5-SLM
Model Category=Open-so...
2025.02
0.806
0.451
0.453
Feedback
Search any
task
Search any
task