Share your thoughts, 1 month free Claude Pro on usSee more

Reasoning on CLF (test)

99Accuracy

TCR-gold

Updated 4mo ago

Evaluation Results

Method	Links
TCR-gold 2026.01		99
TCR 2026.01		97
Qwen3-8B-Instruct 2026.01		96.9
DoLa 2026.01		94.4
TCR-gold 2026.01		71.3
TCR 2026.01		66.6
Qwen2.5-7B-Instruct 2026.01		56.8
DoLa 2026.01		52.3
TCR-gold 2026.01		32.7
TCR 2026.01		28.2
TCR-gold 2026.01		20.3
LLaMA3-8B-Instruct 2026.01		15.2
TCR 2026.01		11.2
Phi-3-Instruct 2026.01		9.2
DoLa 2026.01		8.8
DoLa 2026.01		7.6