Developer Tools
amseg vs SOMALI_NLP
A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.
Category
Developer Tools
Developer Tools
Type
NLP Library
NLP Library
Country
🇪🇹 Ethiopia
🇸🇴 Somalia
Docs status
Docs live
Docs live
Licensing
Pricing
Free / open-source
Free / open-source
Verified
Verified
Verified
Last verified
24 Jun 2026
24 Jun 2026
Tags
nlp, python, amharic, ethiopic, tokenization
nlp, python, somali, stemmer, tokenizer
Summary
amseg is an Amharic document segmentation and normalization tool that splits Ethiopic text into sentences and tokens, normalizes character variants and transliterates between Latin and Fidel. Maintained under the University of Hamburg Semantic Models for Amharic project.
SOMALI_NLP is a Python NLP toolkit for the Somali language providing stop-word lists, stemmers for morphological analysis, tokenizers, collocation analysis and string-distance and spelling models. It draws on a companion Somali Wikipedia corpus.