Share your thoughts, 1 month free Claude Pro on usSee more

RO reformulation on Large Out-of-Distribution

85.4Accuracy

AutoREM

Updated 2mo ago

Evaluation Results

Method	Links
AutoREM 2026.05		85.4	10,929
Expert Prompt 2026.05		79.2	13,243
Max Thinking 2026.05		77.1	22,695
Base LLM 2026.05		76	11,757
ReasoningBank 2026.05		71.9	11,774
ACE 2026.05		63.5	6,706