The Voice AI Index / Text-to-Speech / #172
yl4579/StyleTTS2
by yl4579 · Text-to-Speech · updated 1y ago
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
42
momentum
6,288
stars
693
forks
#172
rank
adversarial-trainingdeep-learningdiffusion-modelsganlatent-diffusionlatent-diffusion-modelspytorchspeaker-adaptationspeech-synthesistext-to-speechttswavlm
View on GitHub →