Microsoft Download Center Archive

Microsoft Speech Corpus (Indian languages)

  • Published:
  • Version: 1.0
  • Product: Other
  • Language: English

This dataset contains conversational and phrasal speech training and test data for Telugu, Tamil and Gujarati languages.

Microsoft Speech Corpus (Indian languages) release contains conversational and phrasal speech training and test data for Telugu, Tamil and Gujarati languages. The data package includes audio and corresponding transcripts. Data provided in this dataset shall not be used for commercial purposes. You may use the data solely for research purposes. If you publish your findings, you must provide the following attribution: “Data provided by Microsoft and SpeechOcean.com”.

Files

Status: Live

This download is still available on microsoft.com. The downloads below will come directly from the Microsoft Download Center.

Files
microsoftspeechcorpusindianlanguages.zip
  • Size: 12.27 GB

File sizes and hashes are retrieved from the Wayback Machine’s indexes. They may not match the latest versions of files hosted on Microsoft servers.

System Requirements

Operating Systems: Windows 10, Windows 7, Windows 8, Windows 11

  • Windows 7, Windows 8, Windows 10, or Windows 11

Installation Instructions

  • Click Download and follow the instructions.
This page was generated from a snapshot of the Microsoft Download Center made on .