IrokoBench
AI Resources
Benchmark / Eval
Docs live
IrokoBench is a human-translated evaluation benchmark for 16 typologically diverse African languages covering three tasks: natural language inference (AfriXNLI), knowledge QA (AfriMMLU) and mathematical reasoning (AfriMGSM). It is widely used to measure the performance gap between English and African languages in large language models. It was released by the Masakhane community and published at NAACL 2025.
- Category
- AI Resources
- Pricing
- Open
- Country
- 馃實 Pan-African
- Last verified
- 5 Jul 2026
Tags
llm
african-languages
masakhane
evaluation
benchmark