Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Question Answering on QA OOD StrQA SciQA

98.3StrQA Accuracy

Qwen3-8B pass@N (Upper Bound)

40.99655.87370.7585.627Nov 9, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
98.399.3--
2025.11
91.995.6--
2025.11
88.697.1--
2025.11
88.696.9--
2025.11
88.696.9--
2025.11
88.196.9--
2025.11
88.195.8--
2025.11
87.894.1--
2025.11
87.897.1--
2025.11
87.694.1--
2025.11
87.395.8--
2025.11
87.196.9--
86.892.7--
2025.11
86.692.5--
2025.11
86.696.3--
2025.11
84.694.7--
2025.11
7489.281.683.4
2025.11
54.647.250.962
2025.11
51.845.948.960.6
2025.11
51.249.850.559
2025.11
50.959.755.360.6
2025.11
50.959.755.360.6
2025.11
5045.647.858.6
2025.11
49.46054.760.5
2025.11
48.557.45363
2025.11
45.447.246.357.5
2025.11
44.851.548.259.4
2025.11
43.248.345.852.8