Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-ended Question Answering on Qasper
Loading...
13.91
Accuracy
w/t BoT
11.4972
12.1236
12.75
13.3764
May 20, 2026
Accuracy
BLEU Score
F1 Score
Updated 12d ago
Evaluation Results
Method
Method
Links
Accuracy
BLEU Score
F1 Score
w/t BoT
Model=Qwen2.5-1.5b-ins...
2026.05
13.91
10.94
24.37
w/t CD
Model=Qwen2.5-1.5b-ins...
2026.05
13.67
11.28
24.25
w/t KLE
Model=Qwen2.5-1.5b-ins...
2026.05
13.46
11.62
22.26
CARE
Model=Qwen2.5-1.5b-ins...
2026.05
13.32
10.83
22.07
w/t SE
Model=Qwen2.5-1.5b-ins...
2026.05
13.29
10.8
22.23
EMPO
Model=Qwen2.5-1.5b-ins...
2026.05
12
9.58
18.05
GRPO
Model=Qwen2.5-1.5b-ins...
2026.05
11.59
9.4
20.51
Feedback
Search any
task
Search any
task