AVSpeech

Benchmarks

Task Name	Dataset Name	SOTA Result
Speech-to-Portrait	AVSpeech (test)	L1 Error31.26	6
Visual Acoustic Matching	AVSpeech-Rooms unseen environments (test)	RTE (s)0.071	5
Audio-Visual Speech Recognition	AVSpeech (1,000 manually filtered samples)	WER25	4
Audio-Visual Speech Extraction	AVSpeech	SI-SDR (dB)10.2	1

Showing 4 of 4 rows