# Dataset ## Text * [Tatoeba](https://tatoeba.org/cmn) **Tatoeba is a collection of sentences and translations.** It's collaborative, open, free and even addictive. An open data initiative aimed at translation and speech recognition. ## Speech * [Tatoeba](https://tatoeba.org/cmn) **Tatoeba is a collection of sentences and translations.** It's collaborative, open, free and even addictive. An open data initiative aimed at translation and speech recognition. ### ASR Noise * [asr-noises](https://github.com/speechio/asr-noises)