

Bridging the Data Skills Gap for African Languages
What does it take to ensure African languages are properly represented in the AI systems shaping our future?
In this interactive workshop, computational linguists and data curators from the Tonative will guide you through the foundations of African language data curation and documentation for AI.
As AI systems continue to evolve, many African languages remain underrepresented due to limited high-quality datasets. This session is designed to bridge that gap by equipping language speakers, enthusiasts, and aspiring curators with practical skills to contribute directly to AI model training and evaluation.
During the workshop, participants will learn:
✅ Why the African language data gap exists and its impact on AI systems
✅ The basics of linguistics, morphology, and language documentation
✅ How translation and data validation workflows work in real-world AI projects
✅ The tools and platforms used for dataset curation
✅ How to validate real datasets through a guided hands-on session
No technical background is required. If you are passionate about African languages, language preservation, or inclusive AI systems, this workshop is for you.
This session is open to native speakers, students, researchers, creatives, and anyone interested in contributing to the future of African language technology.
📅 Tuesday, May 26, 2026
⏰ 5:00PM (WAT)
📍Register to attend
We look forward to having you! 🚀