Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Veracity Prediction on VERITE
Loading...
64
Macro F1
Llama-8BQwenVL
39.04
45.52
52
58.48
May 21, 2025
Macro F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Macro F1
Llama-8BQwenVL
Backbone=Llama-8B, Jus...
2025.05
64
Mistral-7BQwenVL
Backbone=Mistral-7B, J...
2025.05
64
Mistral-7BIdefics3
Backbone=Mistral-7B, J...
2025.05
61
Llama-8BIdefics3
Backbone=Llama-8B, Jus...
2025.05
59
RoBERTa-LQwenVL
Backbone=RoBERTa-Large...
2025.05
54
RoBERTa-LIdefics3
Backbone=RoBERTa-Large...
2025.05
50
Llama-8BPaligemma
Backbone=Llama-8B, Jus...
2025.05
50
Mistral-7BPaligemma
Backbone=Mistral-7B, J...
2025.05
50
XLNet-LQwenVL
Backbone=XLNet-Large,...
2025.05
48
XLNet-LIdefics3
Backbone=XLNet-Large,...
2025.05
48
Papadopoulos et al., 2025
Protocol=Baseline
2025.05
47
RoBERTa-LPaligemma
Backbone=RoBERTa-Large...
2025.05
40
XLNet-LPaligemma
Backbone=XLNet-Large,...
2025.05
40
Feedback
Search any
task
Search any
task