Datasets registry
Datasets
Geographic, economic, health, energy, telecom and AI datasets, with format and freshness notes.
2 results in AI / NLP
NaijaSenti / AfriSenti
Largest Nigerian sentiment corpus (~30k tweets per language) covering Hausa, Igbo, Yoruba and Pidgin.
Docs live
AI / NLP
Verified Jun 2026FreeNaijaVoices
Largest African multilingual speech dataset (1,867 hours, May 2025) covering Igbo, Hausa and Yoruba. Non-commercial license.
Docs live
AI / NLP
Verified Jun 2026Free (CC BY-NC-SA)