Microsoft Download Center Archive

Microsoft Research Paraphrase Corpus

  • Published:
  • Version: 1.0
  • Category: Tool
  • Language: English

This download consists of data only: a text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship. Last published: March 3, 2005.

  • This download consists of data only: a text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship. No more than 1 sentence has been extracted from any given news article. We have made a concerted effort to correctly associate with each sentence information about its provenance and any associated information about its author. If any attribution information is incorrect or missing, please send email to [email protected] and we will update the file.

Files

Status: Live

This download is still available on microsoft.com. The downloads below will come directly from the Microsoft Download Center.

FileSize
MSRParaphraseCorpus.msi
SHA1: fac1cf160c283943239b013dd660139255be487d
1.22 MB

File sizes and hashes are retrieved from the Wayback Machine’s indexes. They may not match the latest versions of files hosted on Microsoft servers.

System Requirements

Operating Systems: Windows 10, Windows 7, Windows 8

    • Windows 7, Windows 8, or Windows 10

Installation Instructions

    • Click Download and follow the instructions.