Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Difficulty Correlation with Human Performance on JEE (n=34)
Loading...
0.44
Pearson R
LLM compare
0.232
0.286
0.34
0.394
Dec 16, 2025
Pearson R
Spearman R
Kendall Tau
Updated 4d ago
Evaluation Results
Method
Method
Links
Pearson R
Spearman R
Kendall Tau
LLM compare
Model=Gemini 2.5 Pro
2025.12
0.44
0.32
0.21
LLM compare
Model=OpenAI o3
2025.12
0.24
0.16
0.1
Feedback
Search any
task
Search any
task