Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Text-Audio-Vision Benchmark

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal ReasoningText-Audio-Vision Benchmark Full Set
Pass@1 Accuracy69
3
Multimodal ReasoningText-Audio-Vision Benchmark Level 5
Pass@1 Acc46
3
Multimodal ReasoningText-Audio-Vision Benchmark Level 4
Pass@1 Accuracy61
3
Multimodal ReasoningText-Audio-Vision Benchmark Level 3
Pass@1 Accuracy65
3
Multimodal ReasoningText-Audio-Vision Benchmark Level 2
Pass@1 Accuracy86
3
Multimodal ReasoningText-Audio-Vision Benchmark Level 1
Pass@1 Accuracy92
3
Showing 6 of 6 rows