Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Explanation Generation on Mocheg 1.0 (test)
Loading...
30.3
ROUGE-1
GPT-4o
24.684
26.142
27.6
29.058
Feb 10, 2026
ROUGE-1
ROUGE-2
BLEU-4
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE-1
ROUGE-2
BLEU-4
GPT-4o
Setting=retrieved evid...
2026.02
30.3
9.9
4.5
MEVER
Setting=retrieved evid...
2026.02
28.5
14.3
10.1
MEVER w/o images
Setting=retrieved evid...
2026.02
28
13.7
9.1
DePlot+FlanT5
Setting=retrieved evid...
2026.02
27.4
11.8
9.9
MochegModel
Setting=retrieved evid...
2026.02
26
11
10.7
JustiLM
Setting=retrieved evid...
2026.02
25.1
11.3
8.2
ECENet
Setting=retrieved evid...
2026.02
24.9
11.1
8.1
Feedback
Search any
task
Search any
task