Datasets

MAFAND-MT (masakhane/mafand)

Datasets
Language / NLP
Docs live

Largest news-domain machine translation benchmark for African languages, covering 21 languages with English or French as source. It contains 142,909 parallel sentences in parquet with train, dev and test splits, hosted on HuggingFace. Licensed CC BY-NC 4.0.

Category
Datasets
Pricing
Free / open (CC BY-NC 4.0)
Country
馃實 Pan-African
Last verified
5 Jul 2026

Tags

nlp
african-languages
news
machine-translation
parquet

Compare MAFAND-MT (masakhane/mafand)

Side-by-side, verified specs against its closest language / nlp alternatives.

Related in Datasets