Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Image-to-Audio Generation on YTSV T11 (test)
Loading...
82.28
Onset F1 (50ms)
DAC Reconstruction
24.8408
39.7529
54.665
69.5771
May 19, 2025
Onset F1 (50ms)
Onset F1 (100ms)
Onset F1 (200ms)
Fréchet Audio Distance (FAD)
Updated 11d ago
Evaluation Results
Method
Method
Links
Onset F1 (50ms)
Onset F1 (100ms)
Onset F1 (200ms)
Fréchet Audio Distance (FAD)
DAC Reconstruction
Type=Upper-bound
2025.05
82.28
86.43
88.76
0.035
Direct I2A
Training Tasks/Data=OM...
2025.05
52.66
68.45
76.24
0.055
Direct I2A
Training Tasks/Data=OM...
2025.05
51.6
67.92
75.98
0.056
Direct I2A
Training Tasks/Data=YT...
2025.05
27.05
43.32
53.02
0.317
Feedback
Search any
task
Search any
task