AI Resources

IrokoBench

AI Resources
Benchmark / Eval
Docs live

IrokoBench is a human-translated evaluation benchmark for 16 typologically diverse African languages covering three tasks: natural language inference (AfriXNLI), knowledge QA (AfriMMLU) and mathematical reasoning (AfriMGSM). It is widely used to measure the performance gap between English and African languages in large language models. It was released by the Masakhane community and published at NAACL 2025.

Category
AI Resources
Pricing
Open
Country
馃實 Pan-African
Last verified
5 Jul 2026

Tags

llm
african-languages
masakhane
evaluation
benchmark