The Voice AI Index / Text-to-Speech / #172

yl4579/StyleTTS2

by yl4579 · Text-to-Speech · updated 1y ago

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

momentum

6,288

stars

693

forks

#172

rank

adversarial-trainingdeep-learningdiffusion-modelsganlatent-diffusionlatent-diffusion-modelspytorchspeaker-adaptationspeech-synthesistext-to-speechttswavlm

View on GitHub →

yl4579/StyleTTS2

More in Text-to-Speech