Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Value Alignment on Helpfulness 4
Loading...
4.364
Conformity Score
Q+IF+COT
2.36616
2.88483
3.4035
3.92217
Jul 22, 2025
Conformity Score
Updated 7d ago
Evaluation Results
Method
Method
Links
Conformity Score
Q+IF+COT
Target Model=Gemini-1....
2025.07
4.364
MODULAR
Target Model=Gemini-1....
2025.07
4.35
PICACO
Target Model=Gemini-1....
2025.07
4.342
MP+SYSTEM 1
Target Model=Gemini-1....
2025.07
4.335
Q+IF
Target Model=Gemini-1....
2025.07
4.332
OPRO
Target Model=Gemini-1....
2025.07
4.33
PICACO
2025.07
4.287
URIAL
Target Model=Gemini-1....
2025.07
4.268
URIAL+SUM
Target Model=Gemini-1....
2025.07
4.248
Q+IF
2025.07
4.247
Q+IF
Model=GPT-4o-mini
2025.07
4.241
Q+IF
Training=SFT
2025.07
4.21
MP+SYSTEM 2
Target Model=Gemini-1....
2025.07
4.163
Q
Target Model=Gemini-1....
2025.07
3.884
Q+IF
Training=SFT+
2025.07
3.25
URIAL
Training=SFT
2025.07
2.443
Feedback
Search any
task
Search any
task