Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Extractive Question Answering on Five Extractive QA datasets aggregated

0.91Calibration Score (C)

Mistral-8x22B

-0.03640.20930.4550.7007Dec 30, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
0.910.780.730.81-
2025.12
0.870.760.630.75-
2025.12
0.840.740.70.76-
2025.12
0.810.70.640.72-
2025.12
0.810.690.630.71-
2025.12
0.710.680.710.7-
2025.12
0.680.660.670.67-
2025.12
0.520.650.580.63-
2025.12
0.160.540.440.57-
2025.12
00.510.410.52-