The Voice AI Index / Enhancement & Analysis / #37
FunAudioLLM

FunAudioLLM/SenseVoice

by FunAudioLLM · Enhancement & Analysis · updated 3d ago

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

74
momentum
8,554
stars
779
forks
#37
rank
asraudio-analysisaudio-event-detectioncross-lingualemotion-detectionllmmultilingualpythonpytorchspeech-emotion-recognitionspeech-recognitionspeech-to-text
View on GitHub →