Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-modal Visual Question Answering on VQA-T
Loading...
64.84
Accuracy
16-bit Baseline
49.4584
53.4517
57.445
61.4383
Nov 15, 2024
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
16-bit Baseline
Data Format=16-bit, Ba...
2024.11
64.84
AMXFP4
Data Format=AMXFP4, Ba...
2024.11
59.13
MXFP4
Data Format=MXFP4, Bac...
2024.11
57.88
MXFP4-PoT
Data Format=MXFP4-PoT,...
2024.11
50.05
Feedback
Search any
task
Search any
task