The Voice AI Index / Enhancement & Analysis / #134
stepfun-ai/Step-Audio-EditX
by stepfun-ai · Enhancement & Analysis · updated 2mo ago
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
53
momentum
929
stars
69
forks
#134
rank
audio-editingcross-lingualemotion-controlparalinguisticsreinforcement-learningspeaking-stylestyle-controltext-to-speechttsvoice-cloningzero-shot-tts
View on GitHub →