Share your thoughts, 1 month free Claude Pro on usSee more

Instruction Following on Instruction Following (test)

0.9NMSE

MENTAT (Detailed Prompt)

Updated 2mo ago

Evaluation Results

Method	Links
MENTAT (Detailed Prompt) 2025.08		0.9	0.43
MENTAT (Basic Prompt) 2025.08		0.95	0.42
MENTAT (Basic Prompt)-Avg 2025.08		1.06	0.38
GEPA 2025.08		1.06	0.46
Gradient Descent 2025.08		1.08	0.36
MENTAT (Detailed Prompt)-Avg 2025.08		1.09	0.39
Detailed Prompt 2025.08		1.16	0.33
Basic Prompt 2025.08		1.18	0.32
MENTAT (Detailed Prompt) Prompt 2025.08		1.24	0.36
MENTAT (Basic Prompt) Prompt 2025.08		1.25	0.35
RL Fine-Tuning 2025.08		1.51	0.37