Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scientific Reasoning on SCIENTIFIC

82.8Accuracy

DEER

1.47222.58643.764.814Mar 15, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
82.89,005
2026.03
80.88,893
2026.03
80.38,910
2026.03
77.87,393
2026.03
74.87,394
2026.03
74.86,841
2026.03
74.87,167
2026.03
71.26,267
2026.03
64.76,972
2026.03
64.75,940
2026.03
62.15,659
2026.03
60.15,538
2026.03
59.11,735
2026.03
57.61,655
2026.03
56.16,575
2026.03
56.17,386
2026.03
53.54,365
2026.03
526,575
2026.03
527,134
2026.03
512,249
2026.03
50.56,963
2026.03
48.51,686
2026.03
45.51,876
2026.03
44.41,309
2026.03
34.91,748
2026.03
34.39,515
2026.03
32.310,319
2026.03
31.81,175
2026.03
31.3841
2026.03
28.39,433
2026.03
25.39,769
2026.03
24.89,727
2026.03
21.29,604
2026.03
8.112,823
2026.03
6.113,386
2026.03
4.612,685