Developer Tools
PuoBERTa vs stopwords-sw
A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.
Category
Developer Tools
Developer Tools
Type
NLP Library
NLP Library
Country
🇿🇦 South Africa
🌍 Pan-African
Docs status
Docs live
Docs live
Licensing
Pricing
Free / open-source
Free / open-source
Verified
Verified
Verified
Last verified
24 Jun 2026
24 Jun 2026
Tags
south-africa, python, setswana, roberta, language-model
npm, nlp, json, swahili, stopwords
Summary
PuoBERTa is a RoBERTa-based masked language model purpose-built for Setswana, trained on the PuoData corpus by the Data Science for Social Impact group. It ships with example scripts for fill-mask, news classification, NER and POS tagging via HuggingFace Transformers.
A comprehensive Swahili (sw) stopwords collection distributed in JSON and text formats (npm/bower stopwords-sw) for text preprocessing in NLP pipelines. Maintained by the stopwords-iso project.