Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Science Question Answering on FrontierScience
Loading...
37.5
Accuracy
ARYA
6.3
14.4
22.5
30.6
Mar 22, 2026
Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
Accuracy
ARYA
Prompting Strategy=Zer...
2026.03
37.5
GPT-5.2
Prompting Strategy=Opt...
2026.03
25.8
GPT-5.2 (pub)
Context=Best Published...
2026.03
25.8
Claude Opus 4.6
Prompting Strategy=Zer...
2026.03
8.8
GPT-5.2
Prompting Strategy=Zer...
2026.03
7.5
Feedback
Search any
task
Search any
task