Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Audio-Visual Perception on WorldSense
Loading...
47.4
Score
EchoingPixels
37
39.7
42.4
45.1
Dec 11, 2025
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
EchoingPixels
Model Scale=7B, Token...
2025.12
47.4
Full Model (Qwen2.5-Omni-7B)
Model Scale=7B, Token...
2025.12
46.1
Full Model (Qwen2.5-Omni-3B)
Model Scale=3B, Token...
2025.12
45.4
EchoingPixels
Model Scale=3B, Token...
2025.12
45
EchoingPixels
Model Scale=3B, Token...
2025.12
43.5
EchoingPixels
Model Scale=3B, Token...
2025.12
40.9
IntraModal
Model Scale=7B, Token...
2025.12
40.6
IntraModal
Model Scale=3B, Token...
2025.12
37.4
Feedback
Search any
task
Search any
task