The Voice AI Index / Voice Agents & Realtime / #83
FunAudioLLM/Fun-ASR
by FunAudioLLM · Voice Agents & Realtime · updated 1d ago
End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.
66
momentum
1,267
stars
125
forks
#83
rank
31-languagesasraudio-language-modelchinese-dialectsfun-asrllm-asrmultilingual-asrpytorchreal-time-asrspeaker-diarizationspeech-recognitionspeech-to-text
View on GitHub →