WAXAL: Google Releases 2,400 Hours of Speech Data for 27 African Languages

Google Research open-sources WAXAL, a dataset of 1,846 hours of natural speech and 565 hours of studio recordings covering 27 Sub-Saharan African languages spoken by over 100 million people.
artificial-intelligence
Author

Kabui, Charles

Published

2026-03-19

Keywords

african-languages, speech-recognition, text-to-speech, open-dataset, google-research