Lugha-Llama
Lugha-Llama is a Llama-3.1-8B model continually pretrained on the WURA African-language corpus to lift performance on low-resource African languages. It ships in three variants (wura, wura_edu, wura_math) and reaches leading results among similarly sized models on the IrokoBench and AfriQA African-language benchmarks. It was built by researchers at Princeton University.
- Category
- AI Resources
- Pricing
- Open weights (Llama 3.1 community license)
- Country
- 馃實 Pan-African
- Last verified
- 5 Jul 2026
Tags
Compare Lugha-Llama
Side-by-side, verified specs against its closest llm alternatives.
Related in AI Resources
AfroLM
A multilingual masked language model pretrained from scratch on 23 African languages using a self-active learning framework, outperforming AfriBERTa, mBERT and XLMR-base on NER and sentiment tasks. Created by Bonaventure Dossou and collaborators, published at SustaiNLP/EMNLP 2022.
SERENGETI
A massively multilingual masked language model covering 517 African languages and varieties across five scripts, achieving state-of-the-art results on the AfroNLU benchmark. Developed by the UBC Deep Learning and NLP Lab as an Afrocentric resource.
N-ATLaS
Nigeria's first government-backed multilingual LLM (Sep 2025): a Llama-3 8B fine-tuned on 400M+ tokens across 4 Nigerian languages. Produced by NCAIR/NITDA and Awarri.
