Cantonese & Chinese Dialects

Cantonese Text to Speech with CosyVoice

Turn text into natural Cantonese — and 18 other Chinese dialects including Sichuan, Shanghai, Dongbei and Tianjin. CosyVoice captures authentic dialect pronunciation and tone.

Enter your text

0/120

Limit 120 characters per generation. 3 trial samples · 20s wait · Pro skips wait

Select a voice

Awnie · Kids Storyteller
US
Awnie · Kids Storyteller

Warm, maternal and soothing delivery for children’s stories and bedtime reading.

Luna · Conversational
US
Luna · Conversational

Unleash warm, natural, and expressive storytelling voice.

Athena · Audiobook
UK
Athena · Audiobook

Clear, formal, and perfectly cadenced British professional voice.

Angus · Warm Narrator
US
Angus · Warm Narrator

Warm, rich, and highly conversational male voice, perfect for narrating stories and books.

Seán · Podcast Host
IE
Seán · Podcast Host

Charismatic and effortless male voice with a friendly Irish lilt, ideal for hosting podcasts and discussions.

Orpheus · Explainer
US
Orpheus · Explainer

Clear, confident voiceover for explainer videos, product demos, and YouTube tutorials.

Arcas · Commercial
US
Arcas · Commercial

Persuasive, polished reads for ads, promos, and brand commercials.

Authentic dialect synthesis

Native Cantonese

Generate fluent Cantonese with correct tones and natural rhythm.

18 Chinese dialects

Beyond Cantonese: Sichuanese, Shanghainese, Dongbei, Tianjin, Chongqing, Xi’an and more.

Dialect voice cloning

Clone a speaker and have them speak a chosen dialect with the same identity.

Pronunciation hotfix

Correct ambiguous characters and homophones for accurate dialect output.

Dialect TTS use cases

Localized media

Dub videos and ads for regional Chinese audiences.

Culture & education

Preserve and teach dialects with spoken examples.

Regional assistants

Build voice assistants that speak a user’s local dialect.

Entertainment

Create dialect characters and comedic content.

Cantonese & dialect TTS FAQ

Does CosyVoice support Cantonese text to speech?

Yes. CosyVoice generates natural Cantonese speech with correct tones, and you can try it free in the playground above.

Which Chinese dialects are supported?

CosyVoice supports 18 Chinese dialects, including Cantonese, Sichuanese, Shanghainese, Dongbei, Tianjin, Chongqing and Xi’an.

Can I clone a voice that speaks a dialect?

Yes. CosyVoice can combine zero-shot voice cloning with dialect synthesis, so a cloned voice can speak a chosen dialect.

Is the dialect TTS free?

CosyVoice is open source under Apache-2.0. You can self-host it for free or try dialect synthesis online here.

Explore more CosyVoice tools