Datasets registry
Datasets
Geographic, economic, health, energy, telecom and AI datasets, with format and freshness notes.
35 results
GDP/CPI time-series (World Bank API)
Free, no-auth JSON API for Nigerian macro time-series: GDP, CPI inflation, FDI, remittances, employment and more. The best developer-accessible source for Nigerian economic data.
LGA Boundaries (GRID3)
Authoritative, OSGOF-backed LGA boundary polygons for all 36 states plus FCT (Dec 2020) in GeoJSON/SHP, no auth required.
ACLED conflict data
Event-level conflict and crime data with extensive Nigeria coverage (banditry, kidnapping, farmer-herder conflict, oil theft) via REST API, CSV and R/Python packages.
Recommended over the GTD, which is frozen at 2020.
AfriQA
Cross-lingual open-retrieval question-answering dataset with human-translated QA pairs for 10 African languages (incl. Hausa, Igbo, Yoruba), totaling 12,159 examples across train/validation/test splits. From the Masakhane initiative.
AfriSpeech-200
Pan-African accented English speech corpus of ~200 hours covering 120 African accents from 13 countries and 2,463 speakers across clinical and general domains, with per-accent configs. Released by Intron Health.
AfriSpeech-Dialog
Conversational African-accented speech corpus (~6 hours) of 50 two-speaker dialogues across 11 accents (Hausa, Yoruba, Igbo, Swahili, Sesotho and others) from Nigeria, Kenya and South Africa, for ASR and speaker diarization. By Intron Health.
BuyLetLive Price Index 2024
PDF report of Nigerian property price trends by city, the best public residential price data in a category with almost no open data.
Central Bank of Nigeria Statistics Database
Official online store of Central Bank of Nigeria time-series data across monetary, external, fiscal and real sectors (exchange rates, money-market rates, government securities, reserves, BoP), queryable via a Data Browser with monthly Naira exchange-rate tables exportable to Excel.
Education stats (UNESCO UIS API)
Free JSON API of national education indicators (literacy, net enrollment, completion, out-of-school children) covering Nigeria annually.
Energy time-series (World Bank + EIA)
Best developer-accessible Nigerian energy time-series: World Bank electricity-access % (no auth) and EIA energy production/consumption (free API key required).
FEWS NET Nigeria Staple Food Prices
Weekly staple-food market price data for Nigeria collected by FEWS NET enumerators since 2021, downloadable directly as XLSX, CSV and JSON with no registration, part of the FEWS NET markets-and-trade series and Data Explorer/API.
GRID3 Health Facilities v2.0
Geo-located health facility data published Nov 2024 on the GRID3 Data Hub in SHP/GeoJSON, no auth.
GRID3 Nigeria Settlement Extents v4.0
Geospatial dataset mapping Nigeria's settlements as settlement blocks with block-level building counts/areas, Google Open Buildings 2.5D heights, and Sentinel-2 NDVI/EVI metrics. Released March 2026 by CIESIN/Columbia in OGC GeoPackage format under CC BY-SA 4.0 via the GRID3 Data Hub.
Hausa Visual Genome (HausaVG)
Multimodal Hausa-English dataset of 32,923 images with paired English/Hausa region descriptions (train/dev/test/challenge splits), post-edited by HausaNLP and Bayero University Kano translators for English-to-Hausa machine translation and image description.
Health Facility Registry (HUMDATA)
CSV of 39,081 geo-located Nigerian hospitals and clinics with name, type, LGA, state, coordinates and registration status; no login.
Use this HUMDATA mirror, the official HFR portal (hfr.health.gov.ng) has broken TLS as of June 2026. Last updated 2022.
MasakhaNER 2.0
Largest high-quality named-entity-recognition corpus for 20 African languages (incl. Nigerian Pidgin, Hausa, Igbo, Yoruba) with PER/ORG/LOC/DATE tags over news-domain text, totaling ~152,786 rows. Built by the Masakhane community.
MasakhaNEWS
News-topic-classification dataset for 16 widely spoken African languages (incl. Hausa, Igbo, Yoruba, Nigerian Pidgin), ~31,088 rows in CSV/Parquet with train/val/test splits across seven topic categories. Built by the Masakhane community.
MasakhaPOS
Part-of-speech tagging dataset for 20 African languages (incl. Nigerian Pidgin, Hausa, Igbo, Yoruba) using Universal Dependencies tags, with per-language train/validation/test splits. Built by the Masakhane community.
NBS Nigeria Consumer Price Index & Inflation
Monthly Consumer Price Index and inflation dataset from Nigeria's National Bureau of Statistics, published as downloadable Excel tables, ZIP bundles and PDF reports under rebased COICOP-2018 methodology, with releases through 2026.
NCC telecom stats
Rich Nigerian telecom industry stats (188M+ subscriptions, ~87% teledensity) published only as HTML web tables and annual PDF reports.
No CSV/API export, scraping is the only bulk-access path.
NERC quarterly reports
Rich quarterly regulatory data on DisCo performance, ATC&C losses, GenCo invoices and consumer connections.
Published only as PDFs, requires Camelot or Tabula to extract structured data.
NaijaSenti / AfriSenti
Largest Nigerian sentiment corpus (~30k tweets per language) covering Hausa, Igbo, Yoruba and Pidgin.
NaijaVoices
Largest African multilingual speech dataset (1,867 hours, May 2025) covering Igbo, Hausa and Yoruba. Non-commercial license.
Nigeria General Household Survey (GHS-Panel) Wave 5
Nationally representative ~5,000-household panel survey (511 enumeration areas) by Nigeria's NBS under the World Bank LSMS-ISA program, covering household economic activity, consumption and agriculture for 2023-2024, with full questionnaires and DDI/JSON metadata.
