Share your thoughts, 1 month free Claude Pro on usSee more

Math & Code Reasoning on SciQ

66.5Score

DiReCT

Updated 1mo ago

Evaluation Results

Method	Links
DiReCT 2026.05		66.5
InfoBatch 2026.05		65.6
Perplexity-based 2026.05		63.1
GradNorm (IS) 2026.05		62.8
Uniform Sampling 2026.05		61.2
Loss-based 2026.05		60.5
DiReCT 2026.05		48.5
InfoBatch 2026.05		46
GradNorm (IS) 2026.05		45.2
Perplexity-based 2026.05		44.6
Loss-based 2026.05		43.7
Uniform Sampling 2026.05		43.1