Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-Document Summarization on SQuALITY
Loading...
35.44
BLEU-1
QAFD-RAG
31.9664
32.8682
33.77
34.6718
Apr 21, 2026
BLEU-1
BLEU-2
ROUGE-1 F1
ROUGE-2 F1
METEOR
Updated 14d ago
Evaluation Results
Method
Method
Links
BLEU-1
BLEU-2
ROUGE-1 F1
ROUGE-2 F1
METEOR
QAFD-RAG
LLM=GPT-4o-mini, Embed...
2026.04
35.44
18.63
28.43
4.79
25.59
LightRAG
LLM=GPT-4o-mini, Embed...
2026.04
34.17
17.41
28.59
4.31
23.27
GraphRAG
LLM=GPT-4o-mini, Embed...
2026.04
33.91
16.12
26.38
4.08
24.38
HippoRAG
LLM=GPT-4o-mini, Embed...
2026.04
33.22
16.74
27.29
3.92
23.41
RAPTOR
LLM=GPT-4o-mini, Embed...
2026.04
32.1
16.58
25.13
3.49
22.87
Feedback
Search any
task
Search any
task