Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on GSM8K Zero (test) (ACC, Output Tokens)

78.43Accuracy

TALE-PT-SFT

Updated 5mo ago

Evaluation Results

Method	Links
TALE-PT-SFT 2024.12		78.43	77.85
TALE-PT-DPO 2024.12		78.41	113.41
Directly Answering 2024.12		70.32	13.49
Vanilla CoT 2024.12		65.04	251.08