The Voice AI Index / Speech-to-Text / #195
ictnlp/LLaMA-Omni
by ictnlp · Speech-to-Text · updated 1y ago
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
38
momentum
3,141
stars
223
forks
#195
rank
large-language-modelsmultimodal-large-language-modelsspeech-interactionspeech-language-modelspeech-to-speechspeech-to-text
View on GitHub →