Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Narrative Reasoning on VIST (test)

0.456BLEURT

LogicAgent

0.376960.397480.4180.43852Feb 7, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
0.456
2026.02
0.448
2026.02
0.446
2026.02
0.444
2026.02
0.442
2026.02
0.439
2026.02
0.433
2026.02
0.433
2026.02
0.428
2026.02
0.426
2026.02
0.412
2026.02
0.409
2026.02
0.385
2026.02
0.38