AI Resources

AfriBERTa vs Cheetah

A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.

Category
AI Resources
AI Resources
Type
NLP Model
NLP Model
Country
🌍 Pan-African
🌍 Pan-African
Docs status
Docs live
Docs live
Licensing
Institutional only
Pricing
Not listed
Free for research use
Verified
Verified
Verified
Last verified
24 Jun 2026
24 Jun 2026
Tags
nlp, multilingual, african-languages, low-resource, masked-language-model
517-languages, ubc-nlp, nlg, text-generation, afronlg
Summary
AfriBERTa is a multilingual masked language model (XLM-RoBERTa architecture, ~126M params) pretrained from scratch on 11 African languages including Amharic, Hausa, Igbo, Swahili, and Yoruba. Built by the Castorini lab (University of Waterloo) for text classification and Named Entity Recognition on low-resource African languages.
A massively multilingual natural language generation model supporting 517 African languages, outperforming baselines on five of seven AfroNLG tasks like summarization and translation. Developed by the UBC Deep Learning and NLP Lab and published at ACL 2024.