Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge-intensive Question Answering on HotpotQA (dev)
Loading...
59.8
Accuracy
PMSR
29.952
37.701
45.45
53.199
Aug 31, 2025
Accuracy
Updated 8d ago
Evaluation Results
Method
Method
Links
Accuracy
PMSR
2025.08
59.8
s3
2025.08
59
Search-R1-7B
Backbone=R1-7B
2025.08
58.6
IRCoT
2025.08
50.9
RAG
2025.08
46.6
CoT
2025.08
31.1
Feedback
Search any
task
Search any
task