IrokoBench

AI Resources

Benchmark / Eval

Docs live

IrokoBench is a human-translated evaluation benchmark for 16 typologically diverse African languages covering three tasks: natural language inference (AfriXNLI), knowledge QA (AfriMMLU) and mathematical reasoning (AfriMGSM). It is widely used to measure the performance gap between English and African languages in large language models. It was released by the Masakhane community and published at NAACL 2025.

Website Documentation

Category: AI Resources
Pricing: Open
Country: 🌍 Pan-African
Last verified: 5 Jul 2026

Tags