The Voice AI Index / Voice Agents & Realtime / #83

FunAudioLLM/Fun-ASR

by FunAudioLLM · Voice Agents & Realtime · updated 1d ago

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

momentum

1,267

stars

125

forks

#83

rank

31-languagesasraudio-language-modelchinese-dialectsfun-asrllm-asrmultilingual-asrpytorchreal-time-asrspeaker-diarizationspeech-recognitionspeech-to-text

View on GitHub →

FunAudioLLM/Fun-ASR

More in Voice Agents & Realtime