Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Inductive Reasoning on DEER (test)

27.55Accuracy

Complementary Steering

16.245219.180122.11525.0499Apr 26, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
27.55-
2026.04
27.13-
2026.04
27.03-
2026.04
26.49-
2026.04
26.36-
2026.04
25.56-
2026.04
24.78-
2026.04
24.52-
2026.04
24.51-
2026.04
24.48-
2026.04
24.25-
2026.04
23.92-
2026.04
23.85-
2026.04
23.16-
2026.04
22.91-
2026.04
18.52-
2026.04
17.24-
2026.04
16.68-