Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Logit Divergence Analysis on Physical reasoning domain (calibration set)

0.92Mean KL Divergence (nats)

µCRASP

0.53643.12575.7158.3043May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
0.921
2026.05
1.531.7
2026.05
3.313.6
2026.05
4.114.5
2026.05
10.5111.4