Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multiple Choice Question Answering on CTIBench MCQA

0.819Score

GPT-5

0.479960.567980.6560.74402Jan 28, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.819
2026.01
0.76
2026.01
0.753
2026.01
0.716
2026.01
0.714
2026.01
0.705
2026.01
0.692
2026.01
0.691
2026.01
0.688
2026.01
0.664
2026.01
0.658
2026.01
0.655
2026.01
0.65
2026.01
0.649
2026.01
0.607
2026.01
0.604
2026.01
0.493