Datasets

AfriSpeech-200 vs Nigerian Pidgin ASR (nigerian-pidgin-1.0)

A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.

Category
Datasets
Datasets
Type
Speech
Speech
Country
🌍 Pan-African
πŸ‡³πŸ‡¬ Nigeria
Docs status
Docs live
Docs live
Licensing
Pricing
Free / CC-BY-NC-SA 4.0
Free / CC-BY 4.0
Verified
Verified
Verified
Last verified
24 Jun 2026
24 Jun 2026
Tags
speech, asr, african-accents, clinical, audio
speech, asr, audio, nigerian-pidgin, speech-to-text
Summary
Pan-African accented English speech corpus of ~200 hours covering 120 African accents from 13 countries and 2,463 speakers across clinical and general domains, with per-accent configs. Released by Intron Health.
Speech-to-text corpus for Nigerian Pidgin English: 4,277 quality-filtered 16kHz WAV recordings with sentence-level transcriptions from 10 native speakers, split train 2,710 / val 677 / test 892, ~956 MB.