Research

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages vs IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.

Category
Research
Research
Type
NLP benchmark
NLP benchmark
Country
🌍 Pan-African
🌍 Pan-African
Docs status
Docs live
Docs live
Licensing
Pricing
Free / open
Free / open
Verified
Unverified
Unverified
Last verified
5 Jul 2026
5 Jul 2026
Tags
african-languages, nlp-benchmark, sentiment-analysis, semeval
african-languages, masakhane, nlp-benchmark, llm-evaluation
Summary
AfriSenti is a sentiment analysis benchmark of more than 110,000 tweets in 14 African languages spanning four language families, annotated by native speakers. It underpinned SemEval-2023 Task 12, a shared task that attracted more than 200 participants, and documents data collection, annotation and baseline methods for low-resource languages.
IrokoBench is a human-translated evaluation benchmark covering 17 typologically diverse low-resource African languages across three tasks: natural language inference (AfriXNLI), mathematical reasoning (AfriMGSM) and knowledge-based multiple-choice QA (AfriMMLU). The paper evaluates open and proprietary LLMs and documents a large gap between high-resource languages and African languages, with the best open model reaching about 63 percent of GPT-4o performance. It was published at NAACL 2025.