Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Image-to-interleaved generation on MakeAnything (test)
Loading...
3.15
Temporal Coherence (GPT-4o)
Loom
1.174
1.687
2.2
2.713
Dec 20, 2025
Temporal Coherence (GPT-4o)
Temporal Coherence (Human)
Reference Faithfulness (GPT-4o)
Reference Faithfulness (Human)
Semantic Alignment (GPT-4o)
Semantic Alignment (Human)
Updated 4d ago
Evaluation Results
Method
Method
Links
Temporal Coherence (GPT-4o)
Temporal Coherence (Human)
Reference Faithfulness (GPT-4o)
Reference Faithfulness (Human)
Semantic Alignment (GPT-4o)
Semantic Alignment (Human)
Loom
2025.12
3.15
3.65
3.85
4.15
3.15
2.95
Doubao
2025.12
2.05
2.15
2.65
2.65
3.55
3.85
Bagel
framework=multi-turn d...
2025.12
1.25
1
1.15
1.55
-
-
Feedback
Search any
task
Search any
task