Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Narrative Reasoning on VIST (test)

0.456BLEURT

LogicAgent

0.376960.397480.4180.43852Feb 7, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
0.456
2026.02
0.448
2026.02
0.446
2026.02
0.444
2026.02
0.442
2026.02
0.439
2026.02
0.433
2026.02
0.433
2026.02
0.428
2026.02
0.426
2026.02
0.412
2026.02
0.409
2026.02
0.385
2026.02
0.38