AI Resources
AfroLM vs Lugha-Llama
A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.
Category
AI Resources
AI Resources
Type
LLM
LLM
Country
🌍 Pan-African
🌍 Pan-African
Docs status
Docs live
Docs live
Licensing
Pricing
Free / open weights
Open weights (Llama 3.1 community license)
Verified
Verified
Unverified
Last verified
5 Jul 2026
5 Jul 2026
Tags
nlp, african-languages, masked-language-model, active-learning, data-efficient
african-languages, swahili, low-resource, wura-corpus, llama-3.1
Summary
A multilingual masked language model pretrained from scratch on 23 African languages using a self-active learning framework, outperforming AfriBERTa, mBERT and XLMR-base on NER and sentiment tasks. Created by Bonaventure Dossou and collaborators, published at SustaiNLP/EMNLP 2022.
Lugha-Llama is a Llama-3.1-8B model continually pretrained on the WURA African-language corpus to lift performance on low-resource African languages. It ships in three variants (wura, wura_edu, wura_math) and reaches leading results among similarly sized models on the IrokoBench and AfriQA African-language benchmarks. It was built by researchers at Princeton University.