Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-domain question answering on RJUA (test)
Loading...
33
Rouge-L
EAPO
29.152
30.151
31.15
32.149
May 27, 2026
Rouge-L
Reranker Score
RL@8
RR@8
Average Score
Updated 6d ago
Evaluation Results
Method
Method
Links
Rouge-L
Reranker Score
RL@8
RR@8
Average Score
EAPO
Backbone=Qwen3-8B, Eva...
2026.05
33
97.3
36
99.5
66.4
NSR
Backbone=Qwen3-8B, Eva...
2026.05
32.7
97.1
36.1
99
66.2
W-REINFORCE
Backbone=Qwen3-8B, Eva...
2026.05
32.2
95.8
36.6
99.8
66.1
GRPO
Backbone=Qwen3-8B, Eva...
2026.05
31.2
97.6
34.4
98.7
65.5
PSR
Backbone=Qwen3-8B, Eva...
2026.05
29.3
94
32.7
97.9
63.5
Feedback
Search any
task
Search any
task