Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Narrative Reasoning on WebQA (test)
Loading...
0.623
BLEURT
LogicAgent
0.5606
0.5768
0.593
0.6092
Feb 7, 2026
BLEURT
Updated 4d ago
Evaluation Results
Method
Method
Links
BLEURT
LogicAgent
Number of Parameters=1...
2026.02
0.623
CEN
Model Type=Video Reaso...
2026.02
0.613
NSVS-TL
Model Type=Video Reaso...
2026.02
0.612
NS-DR
Model Type=Video Reaso...
2026.02
0.608
Qwen3-VL-8B
Model Type=Open-Source...
2026.02
0.605
GIT
Model Type=Video Reaso...
2026.02
0.605
GPT-4o
Model Type=Proprietary...
2026.02
0.603
Gemini 1.5 Pro
Model Type=Proprietary...
2026.02
0.599
Qwen2.5-VL-7B
Model Type=Open-Source...
2026.02
0.594
Vid2Seq
Model Type=Video Reaso...
2026.02
0.587
ShareGPT4Video
Model Type=Open-Source...
2026.02
0.585
SEM-POS
Model Type=Video Reaso...
2026.02
0.58
AKGNN
Model Type=Video Reaso...
2026.02
0.58
VideoLLaVA
Model Type=Open-Source...
2026.02
0.563
Feedback
Search any
task
Search any
task