Developer Tools
SOMALI_NLP vs uroman
A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.
Category
Developer Tools
Developer Tools
Type
NLP Library
NLP Library
Country
🇸🇴 Somalia
🌍 Pan-African
Docs status
Docs live
Docs live
Licensing
Pricing
Free / open-source
Free / open-source
Verified
Verified
Verified
Last verified
24 Jun 2026
24 Jun 2026
Tags
nlp, python, somali, stemmer, tokenizer
python, amharic, romanization, transliteration, geez
Summary
SOMALI_NLP is a Python NLP toolkit for the Somali language providing stop-word lists, stemmers for morphological analysis, tokenizers, collocation analysis and string-distance and spelling models. It draws on a companion Somali Wikipedia corpus.
uroman is a universal romanizer that converts text in virtually any script to the Latin alphabet, with dedicated handling for Amharic and the Ge'ez/Ethiopic script. It also adds initial support for Coptic and processes script-native numerals.