AfriMMLU (IrokoBench) vs MasakhaNER 2.0

A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.

AfriMMLU (IrokoBench)MasakhaNER 2.0

Tags

nlp, african-languages, question-answering, evaluation, benchmark

nlp, ner, named-entity-recognition, african-languages, token-classification

Links

Website

Website Docs GitHub

Summary

Human-translated multiple-choice question-answering evaluation benchmark covering 16 to 17 African languages plus English and French, derived from a subset of MMLU across subjects like maths, geography and law. Distributed as CSV and parquet on HuggingFace and forms part of the IrokoBench suite (MMLU, MGSM, XNLI). Licensed Apache 2.0.

Largest high-quality named-entity-recognition corpus for 20 African languages (incl. Nigerian Pidgin, Hausa, Igbo, Yoruba) with PER/ORG/LOC/DATE tags over news-domain text, totaling ~152,786 rows. Built by the Masakhane community.

Full details: AfriMMLU (IrokoBench)Full details: MasakhaNER 2.0