AI Resources

EthioLLM vs SERENGETI

A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.

Category
AI Resources
AI Resources
Type
LLM
LLM
Country
🇪🇹 Ethiopia
🌍 Pan-African
Docs status
Docs live
Docs live
Licensing
Institutional only
Pricing
Open weights
Free for research; commercial use requires contacting authors
Verified
Unverified
Verified
Last verified
5 Jul 2026
5 Jul 2026
Tags
amharic, low-resource, masked-language-model, xlm-roberta, ethiopian-languages
nlp, masked-language-model, 517-languages, afrocentric, ubc-nlp
Summary
EthioLLM is a family of multilingual language models (XLM-RoBERTa and mT5 based) for five Ethiopian languages: Amharic, Ge'ez, Afaan Oromoo, Somali and Tigrinya, plus English. The large variant EthioLLM-l-70K is a fine-tuned XLM-RoBERTa-Large used for masked language modeling and downstream tasks like classification, NER and sentiment. It was released by the EthioNLP collective alongside the Ethiobenchmark evaluation suite.
A massively multilingual masked language model covering 517 African languages and varieties across five scripts, achieving state-of-the-art results on the AfroNLU benchmark. Developed by the UBC Deep Learning and NLP Lab as an Afrocentric resource.