Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
STEM Reasoning on JEE Main 2026
Loading...
97.26
Pass@1
Qwen3-30B-A3B (Thinking)
81.0568
85.2634
89.47
93.6766
Apr 10, 2026
Pass@1
Updated 5d ago
Evaluation Results
Method
Method
Links
Pass@1
Qwen3-30B-A3B (Thinking)
evaluation_protocol=4-...
2026.04
97.26
Gemini 2.5 Flash
evaluation_protocol=4-...
2026.04
96.22
GPT-5 Mini
evaluation_protocol=4-...
2026.04
95.83
GPT-OSS-120B
evaluation_protocol=4-...
2026.04
95.42
Nemotron 3 Nano 30B A3B
evaluation_protocol=4-...
2026.04
94.84
Aryabhata 2
evaluation_protocol=4-...
2026.04
92.99
GPT-OSS-20B
evaluation_protocol=4-...
2026.04
92.46
GPT-5 Nano
evaluation_protocol=4-...
2026.04
81.68
Feedback
Search any
task
Search any
task