Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Speech Recognition on Long-form video (test)
Loading...
2.08
Avg WER
Seed-ASR
1.9244
2.9747
4.025
5.0753
Jul 5, 2024
Avg WER
WER (video_1)
WER (video_2)
WER (video_3)
WER (video_4)
WER (video_5)
Updated 3d ago
Evaluation Results
Method
Method
Links
Avg WER
WER (video_1)
WER (video_2)
WER (video_3)
WER (video_4)
WER (video_5)
Seed-ASR
SFT Protocol=long-form...
2024.07
2.08
1.44
1.96
1.95
2.56
2.31
Seed-ASR
SFT Protocol=short-for...
2024.07
2.28
1.48
1.99
2.31
2.64
2.73
Transducer-based E2E Model
2024.07
3.92
2.83
3.8
3.8
4.22
4.66
Paraformer-large
2024.07
5.97
5.78
5.36
5.8
6.87
5.96
Feedback
Search any
task
Search any
task