Datasets

AfriQA

Verified
Datasets
Language / NLP
Docs live

Cross-lingual open-retrieval question-answering dataset with human-translated QA pairs for 10 African languages (incl. Hausa, Igbo, Yoruba), totaling 12,159 examples across train/validation/test splits. From the Masakhane initiative.

Category
Datasets
Pricing
Free / CC-BY-SA 4.0
Country
🌍 Pan-African
Last verified
24 Jun 2026

Tags

nlp
african-languages
question-answering
cross-lingual
open-retrieval

Compare AfriQA

Side-by-side, verified specs against its closest language / nlp alternatives.

Related in Datasets