Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Veracity Prediction on MOCHEG
Loading...
57
Macro F1 Score
Llama-8BQwenVL
36.2
41.6
47
52.4
May 21, 2025
Macro F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Macro F1 Score
Llama-8BQwenVL
Backbone=Llama-8B, Jus...
2025.05
57
Mistral-7BPaligemma
Backbone=Mistral-7B, J...
2025.05
57
Llama-8BPaligemma
Backbone=Llama-8B, Jus...
2025.05
56
Mistral-7BQwenVL
Backbone=Mistral-7B, J...
2025.05
56
Mistral-7BIdefics3
Backbone=Mistral-7B, J...
2025.05
56
Llama-8BIdefics3
Backbone=Llama-8B, Jus...
2025.05
55
RoBERTa-LIdefics3
Backbone=RoBERTa-Large...
2025.05
51
XLNet-LIdefics3
Backbone=XLNet-Large,...
2025.05
50
RoBERTa-LPaligemma
Backbone=RoBERTa-Large...
2025.05
48
XLNet-LQwenVL
Backbone=XLNet-Large,...
2025.05
48
XLNet-LPaligemma
Backbone=XLNet-Large,...
2025.05
48
RoBERTa-LQwenVL
Backbone=RoBERTa-Large...
2025.05
47
Cekinel et al., 2025
Protocol=Baseline
2025.05
37
Feedback
Search any
task
Search any
task