AI Resources

AfriHuBERT

AI Resources
Speech (ASR/TTS)
Docs live

AfriHuBERT is a compact self-supervised speech representation model based on mHuBERT-147, continually pretrained via multilingual adaptive finetuning on over 10,000 hours of speech spanning more than 1,200 African languages and varieties. It improves spoken language identification and ASR over its base model and acts as an encoder for downstream African speech tasks. Its training data was aggregated from sources including BibleTTS, Kallaama, NaijaVoices and NCHLT.

Category
AI Resources
Pricing
Open weights
Country
馃實 Pan-African
Last verified
5 Jul 2026

Tags

speech
african-languages
self-supervised
hubert
speech-encoder

Compare AfriHuBERT

Side-by-side, verified specs against its closest speech (asr/tts) alternatives.

Related in AI Resources