Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Generative Performance on AdvancedIF
Loading...
0.91
Pearson r
RUDE
0.8645
0.88725
0.91
0.93275
May 12, 2026
Pearson r
Pearson p-value
Spearman ρ
Spearman p-value
Updated 21d ago
Evaluation Results
Method
Method
Links
Pearson r
Pearson p-value
Spearman ρ
Spearman p-value
RUDE
2026.05
0.91
0.001
0.84
0.001
Feedback
Search any
task
Search any
task