Microsoft Download Center Archive

Microsoft Research IME Corpus

  • Published:
  • Version: 1.0
  • Category: other
  • Language: English

This download consists of data only: it provides a test data set for the task of Japanese character conversion for text input. Last published: December 21, 2005.

This download consists of data only: it provides a test data set for the task of Japanese character conversion for text input. The data set consists of: (1) reference files, which consist of Japanese sentences that are randomly extracted from news articles (no more than one sentence has been extracted per news article); (2) reading files, which consist of corresponding kana readings for the sentences in the reference files; (3) n-best files, which contain 100-best conversion candidates for each sentence in the reading files. More detailed information about the corpus is found in the technical report, Microsoft Research IME Corpus, MSR-TR-2005-168.

Files

Status: Live

This download is still available on microsoft.com. The downloads below will come directly from the Microsoft Download Center.

System Requirements

Operating Systems: Windows 10, Windows 7, Windows 8

  • Windows 7, Windows 8, or Windows 10

Installation Instructions

  • Click Download and follow the instructions.
This page was generated from a snapshot of the Microsoft Download Center made on .
Report a problem