Share your thoughts, 1 month free Claude Pro on usSee more

Long-context reasoning on LongReason

86.9Score

LONGTRACERL-GRPO

Updated 1mo ago

Evaluation Results

Method	Links
LONGTRACERL-GRPO 2026.05		86.9
DocQA 2026.05		86.4
LONGTRACERL 2026.05		85.4
LoongRL 2026.05		84.9
Base 2026.05		84.2
LongRLVR 2026.05		84.2
LONGTRACERL 2026.05		83.8
LongRLVR 2026.05		80.7
DocQA 2026.05		79.9
LoongRL 2026.05		78.7
LONGTRACERL-GRPO 2026.05		78.7
Base 2026.05		78.5
LONGTRACERL-GRPO 2026.05		75.4
LONGTRACERL 2026.05		75.2
Base 2026.05		74.1
DocQA 2026.05		73.7
LoongRL 2026.05		73.3
LongRLVR 2026.05		73.2