Datasets

AfriHate Hate Speech Datasets vs Hausa Visual Genome (HausaVG)

A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.

Category
Datasets
Datasets
Type
Language / NLP
Language / NLP
Country
🌍 Pan-African
🇳🇬 Nigeria
Docs status
Docs live
Docs live
Licensing
Pricing
Free / open
Free / CC-BY-NC-SA 4.0
Verified
Unverified
Verified
Last verified
5 Jul 2026
5 Jul 2026
Tags
nlp, african-languages, hate-speech, abusive-language, twitter
nlp, hausa, machine-translation, multimodal, image-captioning
Summary
Multilingual collection of hate speech and abusive language datasets covering 15 African languages, built from tweets annotated by native speakers. Each instance carries labels from 3 to 4 annotators with anonymous annotator IDs, downloadable on HuggingFace. Published at NAACL 2025.
Multimodal Hausa-English dataset of 32,923 images with paired English/Hausa region descriptions (train/dev/test/challenge splits), post-edited by HausaNLP and Bayero University Kano translators for English-to-Hausa machine translation and image description.