EthioLLM
EthioLLM is a family of multilingual language models (XLM-RoBERTa and mT5 based) for five Ethiopian languages: Amharic, Ge'ez, Afaan Oromoo, Somali and Tigrinya, plus English. The large variant EthioLLM-l-70K is a fine-tuned XLM-RoBERTa-Large used for masked language modeling and downstream tasks like classification, NER and sentiment. It was released by the EthioNLP collective alongside the Ethiobenchmark evaluation suite.
- Category
- AI Resources
- Pricing
- Open weights
- Country
- 馃嚜馃嚬 Ethiopia
- Last verified
- 5 Jul 2026
Tags
Compare EthioLLM
Side-by-side, verified specs against its closest llm alternatives.
Related in AI Resources
AfroLM
A multilingual masked language model pretrained from scratch on 23 African languages using a self-active learning framework, outperforming AfriBERTa, mBERT and XLMR-base on NER and sentiment tasks. Created by Bonaventure Dossou and collaborators, published at SustaiNLP/EMNLP 2022.
SERENGETI
A massively multilingual masked language model covering 517 African languages and varieties across five scripts, achieving state-of-the-art results on the AfroNLU benchmark. Developed by the UBC Deep Learning and NLP Lab as an Afrocentric resource.
N-ATLaS
Nigeria's first government-backed multilingual LLM (Sep 2025): a Llama-3 8B fine-tuned on 400M+ tokens across 4 Nigerian languages. Produced by NCAIR/NITDA and Awarri.
