Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Helpfulness evaluation benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Helpfulness evaluation
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
LLaVA-Bench
LLaVA-RLHF
Conversation Score
93.1
11
4d ago
MTBench
GPT-4o
Helpfulness
9.35
8
4d ago
HH-RLHF helpful (test)
DeAL
Helpfulness Fraction
77
7
4d ago
HHH (test)
DPO + OGPSA
HHH Score
90.68
3
4d ago
Showing 4 of 4 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task