Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-Hop Question Answering on HotpotQA, 2Wiki, Musique, and Bamboogle (test)

57.02HotpotQA Score

JADE

-2.2589613.1307728.520543.91023Dec 3, 2025Jan 1, 2026Jan 30, 2026Feb 28, 2026Mar 29, 2026Apr 27, 2026May 26, 2026
Updated 6d ago

Evaluation Results

MethodLinks
2026.01
57.0253.8729.2658.2649.653.86--
2026.01
49.2141.6617.2637.236.3345.57--
2026.05
48.6947.5922.2348.845.24-2.9415.39
2026.05
48.0247.9221.64845.13-3.3413.51
2026.05
47.3846.8120.9546.444.22-2.6816.5
2026.05
47.21-20.634443.02-3.9111
2026.01
46.8739.0317.9738.6935.6443.91--
2026.01
46.8537.7917.5537.3834.8944.18--
2026.01
46.6543.9622.3849.8440.7141.22--
2026.05
46.6445.619.9745.943.19-3.7711.45
2026.05
46.5351.8921.1756.846.82-2.618.01
2026.01
46.2141.5218.2736.5935.6544.72--
2026.05
45.8351.4921.065646.39-3.0115.41
2026.05
45.6249.8319.5150.445.2-3.2813.78
2026.05
45.2151.1219.8454.445.84-2.4518.71
2026.05
44.850.6618.945245.4-3.1614.33
2026.01
42.3839.6225.4834.8535.5837.11--
2026.05
42.1643.7220.4844.840.72-1.5626.1
2026.05
38.7447.7115.164841.27-2.0620.03
2026.05
35.1838.4514.214834.83-3.3110.52
2026.05
30.4232.9212.8344.830.01-3.438.75
2026.01
28.3325.9125.225.2826.1834.89--
2026.05
18.5216.735.421616.1-3.924.11
2026.01
17.7622.588.5817.1416.5217.44--
2026.05
2.851.940.5842.1-4.040.52
2025.12
0.4660.4560.2150.4030.3850.451--
2025.12
0.4520.4430.1970.3950.3720.44--
2025.12
0.4430.4470.1970.4440.3830.441--
2025.12
0.4240.4370.1780.4320.3680.424--
2025.12
0.4140.4090.180.3680.343---
2026.02
0.3990.3890.1550.3520.324---
2026.02
0.3920.3760.1430.3520.316---
2025.12
0.3450.3230.0820.210.240.36--
2025.12
0.3380.3460.130.1390.2380.345--
2025.12
0.3320.3290.0980.250.2520.331--
2026.02
0.3310.310.1240.2320.249---
2025.12
0.3310.310.1240.2320.2490.336--
2026.02
0.3240.3190.1030.2640.253---
2025.12
0.3240.3190.1030.2640.2530.325--
2025.12
0.2970.2740.0660.1280.1910.312--
2025.12
0.2940.360.140.2580.2630.311--
2026.02
0.2840.2730.0490.0880.174---
2025.12
0.2840.2730.0490.0880.1740.303--
2025.12
0.2740.30.0980.1110.1960.317--
2026.02
0.2650.2440.0610.1130.171---
2026.02
0.2550.2260.0470.080.152---
2025.12
0.2550.2260.0470.080.1520.27--
2026.02
0.240.2330.0590.210.186---
2025.12
0.240.2330.0590.210.1860.265--
2026.02
0.2210.2180.0540.320.203---
2025.12
0.2210.2180.0540.320.2030.255--
2026.02
0.2080.2750.060.1920.184---
2025.12
0.2080.2750.060.1920.1840.224--
2026.02
0.2010.2680.0550.2240.187---
2025.12
0.2010.2680.0550.2240.1870.229--
2026.02
0.1860.2480.0440.1120.147---
2026.02
0.1640.1710.0670.240.161---
2026.02
0.1490.2440.020.0240.109---
2025.12
0.1490.2440.020.0240.1090.134--
2026.02
0.0210.0210.00200.011---