Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Safety Evaluation on SAGE-Eval 1.0 (test)
Loading...
34.62
Model-level Safety Score
Gemini 2.0 Flash
30.616
31.6555
32.695
33.7345
May 28, 2026
Model-level Safety Score
AUC (Safety)
Updated 5d ago
Evaluation Results
Method
Method
Links
Model-level Safety Score
AUC (Safety)
Gemini 2.0 Flash
Information Setting=Cr...
2026.05
34.62
71.69
Gemini 2.0 Flash
Information Setting=PD...
2026.05
32.69
71.69
Gemini 2.0 Flash
Information Setting=Re...
2026.05
30.77
68.56
Feedback
Search any
task
Search any
task