Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Human Evaluation on MS MARCO (test)
Loading...
18
Preference: FiD
RBG
3.44
7.22
11
14.78
Mar 1, 2022
Preference: FiD
Preference: RBG
Tie Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Preference: FiD
Preference: RBG
Tie Rate
RBG
Aspect=Relevance
2022.03
18
48
34
RBG
Aspect=Fluency
2022.03
12
26
62
RBG
Aspect=Correctness
2022.03
4
62
34
Feedback
Search any
task
Search any
task