Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
PII detection on PUPA
Loading...
68.48
F1 Score
Fine-tuning
15.544
29.287
43.03
56.773
Apr 10, 2026
F1 Score
Updated 5d ago
Evaluation Results
Method
Method
Links
F1 Score
Fine-tuning
Train Tokens (Base Mod...
2026.04
68.48
DSPy MIPROv2
Train Tokens (Base Mod...
2026.04
64.91
AIR
Train Tokens (Embed Mo...
2026.04
59.32
DSPy GEPA
Train Tokens (Base Mod...
2026.04
59.29
TextGrad
Train Tokens (Base Mod...
2026.04
55.54
DSPy BootstrapFewShot (#3)
Train Tokens (Base Mod...
2026.04
44.23
KNN
Inference Tokens (Base...
2026.04
35.08
Initial prompt
Inference Tokens (Base...
2026.04
17.58
Feedback
Search any
task
Search any
task