AI Resources

Zabantu-XLM-Roberta

AI Resources
NLP Model
Docs live

Zabantu is a family of XLM-RoBERTa masked language models (roughly 80M to 250M params) trained from scratch on South African Bantu languages including Tshivenda, Zulu, Xhosa, Swati, Northern and Southern Sotho, Setswana and Xitsonga. It serves as a benchmark for low-resource Bantu language NLP. It was built by the Data Science for Social Impact group at the University of Pretoria.

Category
AI Resources
Pricing
Open weights
Country
馃嚳馃嚘 South Africa
Last verified
5 Jul 2026

Tags

south-africa
xlm-roberta
bantu-languages
tshivenda
zulu

Compare Zabantu-XLM-Roberta

Side-by-side, verified specs against its closest nlp model alternatives.

See all verified ai resources in South Africa

Related in AI Resources