SERENGETI
VerifiedA massively multilingual masked language model covering 517 African languages and varieties across five scripts, achieving state-of-the-art results on the AfroNLU benchmark. Developed by the UBC Deep Learning and NLP Lab as an Afrocentric resource.
- Category
- AI Resources
- Pricing
- Free for research; commercial use requires contacting authors
- Country
- 🌍 Pan-African
- Last verified
- 24 Jun 2026
Tags
Compare SERENGETI
Side-by-side, verified specs against its closest llm alternatives.
Related in AI Resources
N-ATLaS
Nigeria's first government-backed multilingual LLM (Sep 2025): a Llama-3 8B fine-tuned on 400M+ tokens across 4 Nigerian languages. Produced by NCAIR/NITDA and Awarri.
InkubaLM
InkubaLM-0.4B is a 400M-parameter open-weights small language model built from scratch by Lelapa AI for five low-resource African languages (isiZulu, Yoruba, Swahili, isiXhosa, Hausa, plus English/French), using a LLaMA-style architecture trained on 2.4B tokens.
UlizaLlama (Jacaranda Health)
UlizaLlama is a 7B-parameter Swahili-and-English LLM fine-tuned from Meta's Llama 2 (continually pretrained on ~321M Swahili tokens) by Jacaranda Health in Kenya, built to power Swahili maternal-health SMS support for low-income expectant mothers in East Africa.
