Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-form Question Answering on GroundBench (test)
Loading...
87.5
Faithfulness (Full)
RHIO-13B
76.372
79.261
82.15
85.039
Jan 23, 2025
Faithfulness (Full)
Faithfulness (Partial)
Faithfulness (No)
Completeness (Rate)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Faithfulness (Full)
Faithfulness (Partial)
Faithfulness (No)
Completeness (Rate)
RHIO-13B
Parameters=13B
2025.01
87.5
7.6
4.9
3.8
GPT-4o
Model Type=Proprietary
2025.01
86.5
8.1
5.4
4.2
Llama-3.1-70B-Instruct
Backbone=Llama-3.1, Pa...
2025.01
82.6
10.2
7.2
3.7
SFT-13B
Parameters=13B, Type=S...
2025.01
76.8
13.8
9.4
3.2
Feedback
Search any
task
Search any
task