Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Summarization Evaluation (Human Correlation) on arXiv (test)
Loading...
0.75
Relevance
RISE
0.0948
0.2649
0.435
0.6051
Dec 17, 2022
Relevance
Factuality
Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Relevance
Factuality
Average Score
RISE
Training Domain=arXiv
2022.12
0.75
0.33
0.54
RISE
Training Domain=PubMed
2022.12
0.74
0.66
0.7
RISE
Training Domain=Big Pa...
2022.12
0.67
0.5
0.59
RISE
Training Domain=Multi-...
2022.12
0.57
0.71
0.64
SMART
2022.12
0.45
0.38
0.41
BARTScore
2022.12
0.12
0.24
0.17
Feedback
Search any
task
Search any
task