Microsoft Download Center Archive

MSR Abstractive Text Compression Dataset

  • Published:
  • Version: 1.0
  • Product: Other
  • Language: English

This dataset contains sentences and short paragraphs with corresponding shorter (compressed) versions. There are up to five compressions for each input text, together with quality judgements of their meaning preservation and grammaticality.

This dataset contains sentences and short paragraphs with corresponding shorter (compressed) versions. There are up to five compressions for each input text, together with quality judgements of their meaning preservation and grammaticality. The dataset is derived using source texts from the Open American National Corpus (ww.anc.org) and crowd-sourcing. More details can be found in the included README and the paper: “A dataset and evaluation metrics for abstractive compression of sentences and short paragraphs” [Toutanova, Brockett, Tran, and Amershi, EMNLP 2016].

Files

Status: Live

This download is still available on microsoft.com. The downloads below will come directly from the Microsoft Download Center.

Files
Release.zip

    File sizes and hashes are retrieved from the Wayback Machine’s indexes. They may not match the latest versions of files hosted on Microsoft servers.

    System Requirements

    Operating Systems: Windows 10, Apple Mac OS X, Windows 8, Linux, Android

    • Windows 8, Windows 10, Android, Apple Mac OS X, Linux

    Installation Instructions

    • Click Download and follow the instructions.
    This page was generated from a snapshot of the Microsoft Download Center made on .