AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages vs IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

A verified, side-by-side comparison. Both records are status-checked by Findra, so you are comparing what each actually offers today, not a stale listing.

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

Tags

african-languages, nlp-benchmark, sentiment-analysis, semeval

african-languages, masakhane, nlp-benchmark, llm-evaluation

Links

Website

Summary

AfriSenti is a sentiment analysis benchmark of more than 110,000 tweets in 14 African languages spanning four language families, annotated by native speakers. It underpinned SemEval-2023 Task 12, a shared task that attracted more than 200 participants, and documents data collection, annotation and baseline methods for low-resource languages.

IrokoBench is a human-translated evaluation benchmark covering 17 typologically diverse low-resource African languages across three tasks: natural language inference (AfriXNLI), mathematical reasoning (AfriMGSM) and knowledge-based multiple-choice QA (AfriMMLU). The paper evaluates open and proprietary LLMs and documents a large gap between high-resource languages and African languages, with the best open model reaching about 63 percent of GPT-4o performance. It was published at NAACL 2025.

Full details: AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages Full details: IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models