AfriSUD: A Dependency Treebank Collection for Evaluating Models on African Languages

arXiv cs.CL·Happy Buzaaba, Cheikh Mouhamadou Bamba Dione, David Ifeoluwa Adelani, Sylvain Kahane, Kim Gerdes, Bruno Guillaume, Kevin Guan, Aremu Anuoluwapo, Naome A. Etori, Shamsuddeen Hassan Muhammad, Utitofon Inyang, Peter Nabende, David Sabiiti Bamutura, Andiswa Bukula, Chinedu Uchechukwu, Rooweither Mabuya, Idris Akinade, Christiane Fellbaum

1d ago

·~1 min·6/12/2026·en·0

Quick Answer

AfriSUD introduces the first large-scale collection of syntactically annotated treebanks for nine African languages, revealing significant syntax gaps in existing NLP models.

Quick Take

AfriSUD introduces the first large-scale collection of syntactically annotated treebanks for nine African languages, revealing significant syntax gaps in existing NLP models. Evaluations of part-of-speech tagging and dependency parsing using non-transformer baselines, multilingual pretrained encoders, and LLMs show limitations in capturing the structural diversity of African languages.

Key Points

AfriSUD includes treebanks for nine diverse African languages from major language families.
Data is verified by native speakers, capturing key typological features like agglutination and tone.
Evaluation reveals significant limitations in models across all nine languages.
Models tested include non-transformer baselines, multilingual pretrained encoders, and LLMs.
Existing architectures may not adequately represent African-language syntax diversity.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Article Excerpt

From source RSS / original summary

arXiv:2606. 12708v1 Announce Type: new Abstract: Despite their linguistic diversity and global significance, African languages remain underrepresented in research and resources to support NLP. We aim to bridge this gap by introducing AfriSUD, the first large-scale collection of syntactically annotated treebanks for nine diverse African languages spanning major language families and regions across Sub-Saharan Africa.

Using the Surface-Syntactic Universal Dependencies (SUD) framework, our community-led effort provides high-quality, native-speaker verified data that capture typological key features such as agglutination and tone. We evaluate a range of models on AfriSUD for part-of-speech tagging and dependency parsing including non-transformer baselines, multilingual pretrained encoders, and LLMs.

Our results reveal a significant syntax gap, where models still show clear limitations across the nine languages, suggesting that existing architectures may not fully capture the structural diversity of African-language syntax.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CL

See more →

arXiv cs.CL·Leyao Wang, Yanan He, Peng Chen, Asaf Yehudai, Yixin Liu, Rex Ying, Michal Shmueli-Scheuer, Arman Cohan

3w ago

FeaturedOriginal

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

AI Summary

The REFLECT benchmark reveals that current LLM judges are unreliable, achieving below 55% accuracy in evaluating reasoning and evidence use, highlighting the need for improved evaluation methods for deep research agents.

#LLM #Agent #Inference #Policy