A Repository of Machine Summaries for Generic News Summarization on DUC 2004
This repository includes summaries produced by several state-of-the-art summarization systems and popular baseline systems on DUC 2004 task 2.
We provide summaries from the following systems:
- Baseline systems
FreqSum (probability), TsSum (topic signatures), Centroid, Cont. LexRank, GreedyKL
- State-of-the-art systems
CLASSY 04 (Peer 65), CLASSY 11, DPP, ICSISumm, OCCAMS_V, RegSum, Submodular
The summaries are available here: [link]
A layout of the corpus can be found in the README file.
More details about implementation of the systems, choices of ROUGE settings, pairwise comparison between systems and summary overlap at different levels are in our paper:
Kai Hong, John M. Conroy, Benoit Favre, Alex Kulesza, Hui Lin, and Ani Nenkova
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization
In Proceedings of LREC, 2014 [pdf]