Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Understandability on Understandability Experiment Unknown word strategy
Loading...
7
Significance Count
Claude
0.76
2.38
4
5.62
May 7, 2026
Significance Count
Adjusted R2
Spearman Correlation
Majority Fit
Updated 26d ago
Evaluation Results
Method
Method
Links
Significance Count
Adjusted R2
Spearman Correlation
Majority Fit
Claude
Model Identifier=claud...
2026.05
7
-1.03
-
-
GPT-4o-m
Model Identifier=gpt-4...
2026.05
6
-1.94
-
-
GPT-4o
Model Identifier=gpt-4...
2026.05
5
-2.46
-
-
Llama
Model Identifier=llama...
2026.05
5
0.34
-
-
Grok
Model Identifier=grok-...
2026.05
5
-2.39
-
-
Mistral
Model Identifier=Minis...
2026.05
3
0.49
-
-
Qwen
Model Identifier=Qwen3...
2026.05
3
0.47
-
-
Majority Vote (MUM)
Evaluation Protocol=Ma...
2026.05
1
-
-
-
Feedback
Search any
task
Search any
task