Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Veracity Prediction on Factify-2
Loading...
78
Macro F1
Mistral-7BQwenVL
56.16
61.83
67.5
73.17
May 21, 2025
Macro F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Macro F1
Mistral-7BQwenVL
Backbone=Mistral-7B, J...
2025.05
78
RoBERTa-LQwenVL
Backbone=RoBERTa-Large...
2025.05
76
XLNet-LQwenVL
Backbone=XLNet-Large,...
2025.05
76
RoBERTa-LIdefics3
Backbone=RoBERTa-Large...
2025.05
75
Llama-8BQwenVL
Backbone=Llama-8B, Jus...
2025.05
75
Llama-8BIdefics3
Backbone=Llama-8B, Jus...
2025.05
75
Mistral-7BIdefics3
Backbone=Mistral-7B, J...
2025.05
75
XLNet-LIdefics3
Backbone=XLNet-Large,...
2025.05
74
Llama-8BPaligemma
Backbone=Llama-8B, Jus...
2025.05
73
XLNet-LPaligemma
Backbone=XLNet-Large,...
2025.05
72
Mistral-7BPaligemma
Backbone=Mistral-7B, J...
2025.05
72
RoBERTa-LPaligemma
Backbone=RoBERTa-Large...
2025.05
71
Du et al., 2023
Protocol=Baseline
2025.05
57
Feedback
Search any
task
Search any
task