Microsoft Download Center Archive
Microsoft Speech Corpus (Indian languages) |
This dataset contains conversational and phrasal speech training and test data for Telugu, Tamil and Gujarati languages.
Microsoft Speech Corpus (Indian languages) release contains conversational and phrasal speech training and test data for Telugu, Tamil and Gujarati languages. The data package includes audio and corresponding transcripts. Data provided in this dataset shall not be used for commercial purposes. You may use the data solely for research purposes. If you publish your findings, you must provide the following attribution: “Data provided by Microsoft and SpeechOcean.com”.
Files
Status: LiveThis download is still available on microsoft.com. The downloads below will come directly from the Microsoft Download Center. |
| Files |
|---|
File sizes and hashes are retrieved from the Wayback Machine’s indexes. They may not match the latest versions of files hosted on Microsoft servers.
System Requirements
Operating Systems: Windows 10, Windows 7, Windows 8, Windows 11
- Windows 7, Windows 8, Windows 10, or Windows 11
Installation Instructions
- Click Download and follow the instructions.