Share your thoughts, 1 month free Claude Pro on usSee more

Model Calibration on MATH, GSM8K, SelfAware, and TruthfulQA combined

0.086ECE

CARE-GRPO

Updated 4mo ago

Evaluation Results

Method	Links
CARE-GRPO 2026.01		0.086
RKL-GSPO 2026.01		0.088
Base Model 2026.01		0.089
RKL-DAPO 2026.01		0.09
RKL-GRPO 2026.01		0.095
CARE-DAPO 2026.01		0.099
CARE-GSPO 2026.01		0.101
GRPO (No Constraint) 2026.01		0.145
GSPO (No Constraint) 2026.01		0.149
DAPO (No Constraint) 2026.01		0.151