Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Prompt Hygiene Evaluation on IFEval Hard
Loading...
2.2
Length Ratio
PREFPO-Minimal
1.6996
5.0773
8.455
11.8327
Mar 13, 2026
Length Ratio
Repetition Increase
Similarity
Readability (LLM)
Specification Quality (LLM)
Maintainability (LLM)
Total Score (LLM)
Updated 26d ago
Evaluation Results
Method
Method
Links
Length Ratio
Repetition Increase
Similarity
Readability (LLM)
Specification Quality (LLM)
Maintainability (LLM)
Total Score (LLM)
PREFPO-Minimal
2026.03
2.2
1.2
41.8
1.87
1.65
1.74
5.26
PREFPO
2026.03
4.7
4.4
19.8
1.87
1.6
1.69
5.16
PREFPO-Elo
2026.03
6.37
5.8
15
-
-
-
-
TextGrad
2026.03
14.71
11.7
13.3
1.23
0.63
0.9
2.76
Feedback
Search any
task
Search any
task