Nigerian Pidgin ASR (nigerian-pidgin-1.0) vs Yoruba Speech-Text Parallel Corpus

A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.

Nigerian Pidgin ASR (nigerian-pidgin-1.0)Yoruba Speech-Text Parallel Corpus

Tags

speech, asr, audio, nigerian-pidgin, speech-to-text

speech, tts, asr, yoruba, parallel-corpus

Links

Website Docs

Summary

Speech-to-text corpus for Nigerian Pidgin English: 4,277 quality-filtered 16kHz WAV recordings with sentence-level transcriptions from 10 native speakers, split train 2,710 / val 677 / test 892, ~956 MB.

Large Yoruba parallel speech-text corpus of 1,647,022 audio-text pairs (~21.5 GB, WAV) aligned with the MMS-300M Forced Aligner for ASR and TTS, with clips of 0.04-12 seconds.

Full details: Nigerian Pidgin ASR (nigerian-pidgin-1.0)Full details: Yoruba Speech-Text Parallel Corpus