Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data

About

Accent is an integral part of society, reflecting multiculturalism and shaping how individuals express identity. The majority of English speakers are non-native (L2) speakers, yet current Text-To-Speech (TTS) systems primarily model American-accented English due limited accented data. We propose \textit{Accent Vector}, a controllable representation that enables accent manipulation in multilingual TTS without requiring accented training data. \textit{Accent Vector} is derived by fine-tuning a TTS system on native speech of a different language (i.e. non-English) and computing task vectors capturing accent characteristics (i.e. in English). By scaling and interpolating the vector, we achieve fine-grained control over accent strength and generate mixed-accent speech. In addition, it generalizes beyond English, enabling accent control across multiple languages. Objective and human evaluations confirm the effectiveness of Accent Vector for fine-grained and compositional accent control.

Thanathai Lertpetchpun, Thanapat Trachu, Jihwan Lee, Tiantian Feng, Dani Byrd, Shrikanth Narayanan• 2026

Related benchmarks

TaskDatasetResultRank
Accent ShiftingBritish England Accent (test)
Target Accent Probability56.7
2
Accent ShiftingSpanish (evaluation)
Target Accent Probability39.7
2
Accent ShiftingHindi (evaluation set)
Target Accent Probability24.2
2
Accent ShiftingFrench Evaluation Set
Target Accent Probability23.2
2
Accent ShiftingGerman (evaluation set)
Target Accent Probability27.4
2
Accent ShiftingMandarin (evaluation)
Target Accent Probability33.8
2
English-accented speech generationSpanish Speech (test)
English Accent Probability44.69
2
English-accented speech generationGerman Speech (test)
English Accent Probability8.57
2
English-accented speech generationMandarin Speech (test)
Accent Probability (%)3.03
2
Subjective EvaluationSubjective Evaluation US Accent
Perceived Accuracy80
1
Showing 10 of 16 rows

Other info

Follow for update