Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
PII Detection on PII-Distract v1 (test)
Loading...
94.8
Entity F1 Score
Claude3.5
78.056
82.403
86.75
91.097
Feb 25, 2025
Entity F1 Score
Strict F1 Score
RougeL F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Entity F1 Score
Strict F1 Score
RougeL F1 Score
Claude3.5
Model Category=API-bas...
2025.02
94.8
0.91
0.911
Llama3.1
Model Category=Open-so...
2025.02
94.6
0.834
0.835
DeepSeekV3
Model Category=API-bas...
2025.02
94.5
0.658
0.658
Qwen2.5
Model Category=Open-so...
2025.02
94.1
0.647
0.649
Llama3.1-SLM
Model Category=Open-so...
2025.02
87.6
0.551
0.552
GPT4o
Model Category=API-bas...
2025.02
86.8
0.715
0.716
Qwen2.5-SLM
Model Category=Open-so...
2025.02
81.5
0.454
0.456
BiLSTM-CRF
Model Category=Traditi...
2025.02
78.7
-
-
Feedback
Search any
task
Search any
task