AI Resources

Lugha-Llama vs SERENGETI

A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.

Category
AI Resources
AI Resources
Type
LLM
LLM
Country
🌍 Pan-African
🌍 Pan-African
Docs status
Docs live
Docs live
Licensing
Institutional only
Pricing
Open weights (Llama 3.1 community license)
Free for research; commercial use requires contacting authors
Verified
Unverified
Verified
Last verified
5 Jul 2026
5 Jul 2026
Tags
african-languages, swahili, low-resource, wura-corpus, llama-3.1
nlp, masked-language-model, 517-languages, afrocentric, ubc-nlp
Summary
Lugha-Llama is a Llama-3.1-8B model continually pretrained on the WURA African-language corpus to lift performance on low-resource African languages. It ships in three variants (wura, wura_edu, wura_math) and reaches leading results among similarly sized models on the IrokoBench and AfriQA African-language benchmarks. It was built by researchers at Princeton University.
A massively multilingual masked language model covering 517 African languages and varieties across five scripts, achieving state-of-the-art results on the AfroNLU benchmark. Developed by the UBC Deep Learning and NLP Lab as an Afrocentric resource.