Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
PII Detection on PII-Hard v1 (test)
Loading...
0.893
Entity F1 Score
DeepSeekV3
0.67564
0.73207
0.7885
0.84493
Feb 25, 2025
Entity F1 Score
Strict F1 Score
RougeL F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Entity F1 Score
Strict F1 Score
RougeL F1 Score
DeepSeekV3
Model Category=API-bas...
2025.02
0.893
0.838
0.838
Llama3.1
Model Category=Open-so...
2025.02
0.893
0.84
0.841
Qwen2.5
Model Category=Open-so...
2025.02
0.876
0.804
0.806
GPT4o
Model Category=API-bas...
2025.02
0.869
0.817
0.819
Claude3.5
Model Category=API-bas...
2025.02
0.857
0.813
0.818
Qwen2.5-SLM
Model Category=Open-so...
2025.02
0.81
0.591
0.594
Llama3.1-SLM
Model Category=Open-so...
2025.02
0.798
0.718
0.722
BiLSTM-CRF
Model Category=Traditi...
2025.02
0.684
-
-
Feedback
Search any
task
Search any
task