Share your thoughts, 1 month free Claude Pro on usSee more

Self-awareness on SelfAware

51.2Accuracy

Base Model

Updated 4mo ago

Evaluation Results

Method	Links
Base Model 2026.01		51.2
CARE-GSPO 2026.01		50.2
RKL-GSPO 2026.01		49.8
CARE-GRPO 2026.01		49.5
RKL-GRPO 2026.01		49.1
CARE-DAPO 2026.01		49.1
RKL-DAPO 2026.01		48.7
GRPO (No Constraint) 2026.01		35.3
GSPO (No Constraint) 2026.01		32.7
DAPO (No Constraint) 2026.01		32.5