Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Audio Instruction Following on VoiceBench

4.78AlpacaEval Score

GPT-4o-Audio

1.89922.64713.3954.1429Jul 3, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
4.784.494.5875.580.2589.2384.176.0298.6586.75
2025.07
4.534.044.1670.4362.4372.5369.769.5398.0877.48
4.413.993.860.0460.8774.066766.4298.2774.52
2025.07
4.363.333.4256.0666.4361.9866.252.6398.2769.31
2025.07
4.213.663.4838.8852.1571.6555.338.1497.6964.53
3.973.423.1836.9839.7553.4152.825.9288.0856.48
2025.07
3.813.823.5639.7842.1965.9361.845.3510064.32
3.743.433.0135.7135.7249.4554.726.3396.7355.8
2025.07
3.673.543.7457.0525.7625.4951.839.1598.2757.39
2025.07
2.011.61.315.6424.0425.9347.410.1244.2329.51