The Voice AI Index / Voice Cloning & Conversion / #136
OpenMOSS/MOSS-TTSD
by OpenMOSS · Voice Cloning & Conversion · updated 2mo ago
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enabling zero-shot voice cloning from short audio references.
52
momentum
1,350
stars
132
forks
#136
rank
finetunelarge-language-modelssglangspeech-dialogue-generationstreamingtext-to-speeh
View on GitHub →