The Voice AI Index / Voice Agents & Realtime / #83
FunAudioLLM

FunAudioLLM/Fun-ASR

by FunAudioLLM · Voice Agents & Realtime · updated 1d ago

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

66
momentum
1,267
stars
125
forks
#83
rank
31-languagesasraudio-language-modelchinese-dialectsfun-asrllm-asrmultilingual-asrpytorchreal-time-asrspeaker-diarizationspeech-recognitionspeech-to-text
View on GitHub →