Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
PII Detection on PII-Hard v1 (test)
Loading...
0.893
Entity F1 Score
DeepSeekV3
0.67564
0.73207
0.7885
0.84493
Feb 25, 2025
Entity F1 Score
Strict F1 Score
RougeL F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Entity F1 Score
Strict F1 Score
RougeL F1 Score
DeepSeekV3
Model Category=API-bas...
2025.02
0.893
0.838
0.838
Llama3.1
Model Category=Open-so...
2025.02
0.893
0.84
0.841
Qwen2.5
Model Category=Open-so...
2025.02
0.876
0.804
0.806
GPT4o
Model Category=API-bas...
2025.02
0.869
0.817
0.819
Claude3.5
Model Category=API-bas...
2025.02
0.857
0.813
0.818
Qwen2.5-SLM
Model Category=Open-so...
2025.02
0.81
0.591
0.594
Llama3.1-SLM
Model Category=Open-so...
2025.02
0.798
0.718
0.722
BiLSTM-CRF
Model Category=Traditi...
2025.02
0.684
-
-
Feedback
Search any
task
Search any
task