Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-ended Question Answering on NarrativeQA (Acc, BLEU, F1)
Loading...
73.77
Accuracy
w/t BoT
56.7244
61.1497
65.575
70.0003
May 20, 2026
Accuracy
BLEU Score
F1 Score
Updated 12d ago
Evaluation Results
Method
Method
Links
Accuracy
BLEU Score
F1 Score
w/t BoT
Model=Qwen2.5-1.5b-ins...
2026.05
73.77
54.85
73.82
w/t CD
Model=Qwen2.5-1.5b-ins...
2026.05
72.59
53.72
73.25
w/t KLE
Model=Qwen2.5-1.5b-ins...
2026.05
72.03
52.11
72.39
w/t SE
Model=Qwen2.5-1.5b-ins...
2026.05
71.76
52.2
72.24
CARE
Model=Qwen2.5-1.5b-ins...
2026.05
71.4
51.84
71.65
GRPO
Model=Qwen2.5-1.5b-ins...
2026.05
66.23
48.71
67.02
EMPO
Model=Qwen2.5-1.5b-ins...
2026.05
57.38
41.32
60.96
Feedback
Search any
task
Search any
task