Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Visual Reasoning on NLVR2 (dev)
Loading...
82.5
Accuracy
MADTP
56.1568
62.9959
69.835
76.6741
Mar 5, 2024
Jul 12, 2024
Nov 18, 2024
Mar 28, 2025
Aug 4, 2025
Dec 11, 2025
Apr 20, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
MADTP
Reduce Ratio=0.3, GFLO...
2024.03
82.5
Uncompressed
Reduce Ratio=/, GFLOPS...
2024.03
82.48
MADTP
Reduce Ratio=0.5, GFLO...
2024.03
81.97
MADTP
Reduce Ratio=0.6, GFLO...
2024.03
81.92
MADTP
Reduce Ratio=0.7, GFLO...
2024.03
80.67
UPop
Reduce Ratio=0.3, GFLO...
2024.03
80.33
STP
Reduce Ratio=0.3, GFLO...
2024.03
79.5
CoMP
GFLOPs=26.33±0.38
2026.04
79.13
MADTP
Reduce Ratio=0.8, GFLO...
2024.03
78.28
STP
Reduce Ratio=0.5, GFLO...
2024.03
78.08
MADTP
GFLOPs=26.77±0.23
2026.04
77.16
MH-MoE
Experts Number=8
2024.04
77
UPop
Reduce Ratio=0.5, GFLO...
2024.03
76.89
X-MoE
Experts Number=8
2024.04
75.5
Dense
2024.04
73.8
MiniVLM
Params=52M, Training=1...
2026.04
73.7
UPop
Reduce Ratio=0.6, GFLO...
2024.03
72.85
PixelBERT
Params=110M, Training=...
2026.04
71.7
Efficient
Params=39.2M, Training...
2026.04
71.1
UPop
Reduce Ratio=0.7, GFLO...
2024.03
68.71
UPop
Reduce Ratio=0.8, GFLO...
2024.03
57.17
Feedback
Search any
task
Search any
task