Share your thoughts, 1 month free Claude Pro on usSee more

Harmful Content Safety on StrongReject (SR)

100Evaluation Score (avg@4)

STAR-1

Updated 2mo ago

Evaluation Results

Method	Links
STAR-1 2026.05		100
Self-ReSET 2026.05		100
STAR-1 2026.05		99.8
RECAP 2026.05		99.7
DAPO 2026.05		99.5
DAPO 2026.05		99.3
RECAP 2026.05		98.9
STAR-1 2026.05		98.1
Self-ReSET 2026.05		98
Self-ReSET 2026.05		97.5
RECAP 2026.05		95.8
DAPO 2026.05		95.6
Base 2026.05		95.2
Safechain 2026.05		84.6
Safechain 2026.05		70.1
Safechain 2026.05		61
Base 2026.05		51.4
Base 2026.05		40.1