This is the Trace Id: 8554cf9f6e9c3bd992e27e5726a9f0b7
Skip to main content Microsoft 365 Office Azure Copilot Windows Support Windows Apps OneDrive Outlook Moving from Skype to Teams OneNote Microsoft Teams Accessories Xbox games Microsoft AI Microsoft Security Azure Dynamics 365 Microsoft 365 for business Microsoft Power Platform Windows 365 Digital Sovereignty Microsoft Developer Microsoft Learn Support for AI marketplace apps Microsoft Tech Community Microsoft Marketplace Visual Studio Marketplace Rewards Free downloads & security Education Gift cards View Sitemap

MSR Abstractive Text Compression Dataset

This dataset contains sentences and short paragraphs with corresponding shorter (compressed) versions. There are up to five compressions for each input text, together with quality judgements of their meaning preservation and grammaticality.

Important! Selecting a language below will dynamically change the complete page content to that language.

Download
  • Version:

    1.0

    Date Published:

    15/07/2024

    File Name:

    Release.zip

    File Size:

    17.5 MB

    This dataset contains sentences and short paragraphs with corresponding shorter (compressed) versions. There are up to five compressions for each input text, together with quality judgements of their meaning preservation and grammaticality. The dataset is derived using source texts from the Open American National Corpus (ww.anc.org) and crowd-sourcing. More details can be found in the included README and the paper: “A dataset and evaluation metrics for abstractive compression of sentences and short paragraphs” [Toutanova, Brockett, Tran, and Amershi, EMNLP 2016].
  • Supported Operating Systems

    Android, Apple Mac OS X, Linux, Windows 10, Windows 8

    • Windows 8, Windows 10, Android, Apple Mac OS X, Linux
    • Click Download and follow the instructions.