Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on PaQa
Loading...
46.83
ROUGE-L
RAC_SFT
22.6604
28.9352
35.21
41.4848
Jan 16, 2026
ROUGE-L
BLEU
METEOR
BERTScore (F1)
ALScore
Par-R
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE-L
BLEU
METEOR
BERTScore (F1)
ALScore
Par-R
RAC_SFT
Training=Supervised Fi...
2026.01
46.83
20.17
47.97
90.85
43.36
27.62
RAC_DPO
Training=Direct Prefer...
2026.01
45.26
18.32
46.4
90.41
45.75
28.54
Q-Cond
2026.01
42.46
16.62
41.58
90.12
-
-
QP-Zeroshot
Mode=Zero-shot
2026.01
33.79
10.42
35.84
88.66
-
-
AT-CoT
Reasoning=Chain-of-Tho...
2026.01
23.59
7.07
22.93
85.97
-
-
Feedback
Search any
task
Search any
task