CosyVoice

Multilingual voice generation for natural text-to-speech

CosyVoice brings high-fidelity multilingual speech synthesis, zero-shot voice cloning, and low-latency streaming generation to research and production applications.