Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Truthfulness Evaluation on TruthfulQA (TQA Metric)
Loading...
54.9
TQA Score
EvoPref-Best
50.636
51.743
52.85
53.957
May 10, 2026
TQA Score
Updated 21d ago
Evaluation Results
Method
Method
Links
TQA Score
EvoPref-Best
Optimization Paradigm=...
2026.05
54.9
EvoPref
Optimization Paradigm=...
2026.05
53.3
SMS-EMOA
Optimization Paradigm=...
2026.05
52.9
MOEA/D
Optimization Paradigm=...
2026.05
52.5
IPO
Optimization Paradigm=...
2026.05
52
DPO
Optimization Paradigm=...
2026.05
51.8
ORPO
Optimization Paradigm=...
2026.05
51.5
CMA-ES
Optimization Paradigm=...
2026.05
51.1
KTO
Optimization Paradigm=...
2026.05
50.8
Feedback
Search any
task
Search any
task