Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Question Answering on NQ, TriviaQA, and PopQA

58.2NQ Accuracy

Qwen2.5-32B-Instruct + Proposed Method

2.66417.08231.545.918Dec 3, 2025Dec 28, 2025Jan 23, 2026Feb 18, 2026Mar 16, 2026Apr 11, 2026May 7, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2026.05
58.281.858.2--
2026.05
56.478.255.6--
2026.05
56.277.655--
2026.05
55.88056.8--
2026.05
55.27856--
2026.05
53.276.255--
2025.12
51.867.648.65648.3
2025.12
51.267.24855.548
2025.12
49.3644853.8-
2025.12
48.664.346.753.2-
2025.12
48.163.345.252.243.9
2026.02
4863.845.752.543.1
2025.12
4863.845.752.543.1
2026.02
47.263.950.753.941.6
2026.02
46.764.250.553.842
2025.12
43.665.248.852.539.1
2026.02
42.962.342.749.339.6
2025.12
42.962.342.749.339.6
2025.12
42.466.460.456.440.9
2026.02
39.55638.844.835
2025.12
39.55638.844.835
2026.02
39.36139.746.738.5
2025.12
39.36139.746.738.5
2025.12
36.456.5394438.2
2026.02
3659.23844.434.8
2025.12
3659.23844.434.8
2026.02
34.958.539.244.230.4
2025.12
34.958.539.244.230.4
2026.02
31.835.412.126.420.7
2026.02
29.753.919.934.526.2
2025.12
29.753.919.934.526.2
2026.02
2753.719.933.527.1
2025.12
2753.719.933.527.1
2025.12
23.346.12230.5-
2026.02
22.447.830.133.423.9
2026.02
15.144.313.124.220.6
2025.12
15.144.313.124.220.6
2026.02
13.440.81422.718.1
2025.12
13.440.81422.718.1
2026.02
4.818.55.49.610.6