Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Text Simplification on BigPatent 500 samples (human evaluation)
Loading...
4.27
Coherence
NRLB
3.3652
3.6001
3.835
4.0699
Apr 12, 2026
Coherence
Simplicity
Faithfulness
Preference
Updated 5d ago
Evaluation Results
Method
Method
Links
Coherence
Simplicity
Faithfulness
Preference
NRLB
Human evaluation group...
2026.04
4.27
4.18
4.12
76.7
NRLB
Human evaluation group...
2026.04
3.78
3.33
3.89
56
AgentSimp
Human evaluation group...
2026.04
3.44
3
3.33
-
AgentSimp
Human evaluation group...
2026.04
3.4
2.35
3.32
-
Feedback
Search any
task
Search any
task