Research

IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

Research
NLP benchmark
Docs live

IrokoBench is a human-translated evaluation benchmark covering 17 typologically diverse low-resource African languages across three tasks: natural language inference (AfriXNLI), mathematical reasoning (AfriMGSM) and knowledge-based multiple-choice QA (AfriMMLU). The paper evaluates open and proprietary LLMs and documents a large gap between high-resource languages and African languages, with the best open model reaching about 63 percent of GPT-4o performance. It was published at NAACL 2025.

Category
Research
Pricing
Free / open
Country
馃實 Pan-African
Last verified
5 Jul 2026

Tags

african-languages
masakhane
nlp-benchmark
llm-evaluation

Compare IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

Side-by-side, verified specs against its closest nlp benchmark alternatives.

Related in Research