Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Summary Similarity Evaluation on Flash generated summaries 2.5
Loading...
0.872
BERTScore F1
AutoMUP
0.84808
0.85429
0.8605
0.86671
Apr 8, 2026
BERTScore F1
SBERT Similarity
SimCSE Similarity
USE Similarity
ROUGE-L
BLEURT Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
BERTScore F1
SBERT Similarity
SimCSE Similarity
USE Similarity
ROUGE-L
BLEURT Score
AutoMUP
Consensus level=A1
2026.04
0.872
0.72
0.975
0.711
24.6
40.5
AutoMUP
Consensus level=A2
2026.04
0.86
0.634
0.973
0.66
16.6
30.9
AutoMUP
Consensus level=A3
2026.04
0.849
0.614
0.969
0.63
14.4
25.7
Feedback
Search any
task
Search any
task