All countries DatasetsDatasets
Kenya
2 verified resources in Datasets for building in Kenya.
Nigeria178 Pan-African68 Kenya39 South Africa20 Ethiopia10 Ghana7 Egypt6 Rwanda5 Senegal5 BJ4 Tanzania4 Zambia4 CI3 CM2 Kenya2 ML2 Sub-Saharan Africa2 Uganda2 Africa & Near East1 Algeria1 Morocco1 North Africa1 Somalia1 Tunisia1
Kencorpus Kenyan Language Corpus
Text and speech corpus for three Kenyan languages, Swahili, Dholuo and Luhya, containing 4,442 texts (5.6 million words) and 1,152 speech files (177 hours). It also ships derived NLP sets: POS-tagged Dholuo/Luhya, 7,537 Swahili question-answer pairs and 13,400 translated sentences. Downloadable from Harvard Dataverse; released 2022.
Docs live
Language / NLP
Verified Jul 2026Free / openKenya Subnational Administrative Boundaries (OCHA COD)
Official Common Operational Dataset administrative boundaries for Kenya, from national down through counties, sub-counties and wards. Provided on HDX as Shapefile and other GIS formats, maintained by OCHA together with the government. Free download and openly licensed.
Docs live
Geographic
Verified Jul 2026Free / open