Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Factuality Evaluation on QAGS CNN
Loading...
0.735
Pearson Correlation
BARTSCORE
0.32212
0.42931
0.5365
0.64369
Jun 22, 2021
Pearson Correlation
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pearson Correlation
BARTSCORE
Fine-tuning task=CNN
2021.06
0.735
BARTSCORE
Fine-tuning task=CNN,...
2021.06
0.719
BARTSCORE
Fine-tuning task=CNN +...
2021.06
0.68
BARTSCORE
Setup=Vanilla
2021.06
0.661
BERTScore
2021.06
0.576
QAGS
2021.06
0.545
PRISM
2021.06
0.479
ROUGE-2
2021.06
0.459
MoverScore
2021.06
0.414
ROUGE-L
2021.06
0.357
ROUGE-1
2021.06
0.338
Feedback
Search any
task
Search any
task