Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Voice Conversion on TIMIT OOD
Loading...
0.484
F0 Correlation
Voc-only
0.35088
0.38544
0.42
0.45456
Jan 27, 2026
F0 Correlation
Speaker Similarity (SpkSim)
UTMOS Score
Word Error Rate (WER)
Updated 4d ago
Evaluation Results
Method
Method
Links
F0 Correlation
Speaker Similarity (SpkSim)
UTMOS Score
Word Error Rate (WER)
Voc-only
alpha=1
2026.01
0.484
0.695
3.7
16.4
Phonological Tokenizer
alpha=0.1
2026.01
0.456
0.762
3.88
9.8
ASR-only
alpha=0
2026.01
0.385
0.756
3.7
10.6
SpeechTokenizer
Type=hybrid
2026.01
0.383
0.726
3.53
18.6
Discrete WavLM
Type=phonetic
2026.01
0.371
0.757
3.63
10.3
WavTokenizer
Type=acoustic
2026.01
0.356
0.256
2.02
34
Feedback
Search any
task
Search any
task