Share your thoughts, 1 month free Claude Pro on usSee more

Command-to-action mapping on SCAN (length)

99.7Accuracy

Least-to-Most

Updated 4mo ago

Evaluation Results

Method	Links
Least-to-Most 2022.05		99.7
vNQ¹ 2023.06		95.7
PModel 2023.06		91.72
Least-to-Most 2022.05		76
Least-to-Most 2022.05		60.7
Standard prompting 2022.05		16.7
Chain-of-Thought 2022.05		16.2
Standard prompting 2022.05		6
Standard prompting 2022.05		0.4
Chain-of-Thought 2022.05		0
Chain-of-Thought 2022.05		0