The Voice AI Index / Enhancement & Analysis / #134
stepfun-ai

stepfun-ai/Step-Audio-EditX

by stepfun-ai · Enhancement & Analysis · updated 2mo ago

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

53
momentum
929
stars
69
forks
#134
rank
audio-editingcross-lingualemotion-controlparalinguisticsreinforcement-learningspeaking-stylestyle-controltext-to-speechttsvoice-cloningzero-shot-tts
View on GitHub →