Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Narrative Reasoning on MMIU (test)

0.306BLEURT Score

LogicAgent

0.117760.166630.21550.26437Feb 7, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
0.306
2026.02
0.288
2026.02
0.287
2026.02
0.285
2026.02
0.28
2026.02
0.278
2026.02
0.264
2026.02
0.264
2026.02
0.258
2026.02
0.255
2026.02
0.248
2026.02
0.239
2026.02
0.215
2026.02
0.125