The Voice AI Index / Text-to-Speech / #172
yl4579

yl4579/StyleTTS2

by yl4579 · Text-to-Speech · updated 1y ago

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

42
momentum
6,288
stars
693
forks
#172
rank
adversarial-trainingdeep-learningdiffusion-modelsganlatent-diffusionlatent-diffusion-modelspytorchspeaker-adaptationspeech-synthesistext-to-speechttswavlm
View on GitHub →