ROUGE 2.0 is a Java Package for Evaluation of Summarization Tasks building on the Perl Implementation of ROUGE with some updated and improved measures.
This is an abstractive summarization demo program. It was mainly used to summarize opinions, but since it does not rely on any domain information, it can be used to summarize any highly redundant text.
This paper presents a flexible framework for generating very short abstractive summaries. The key idea is to use a word graph data structure referred to as the Opinosis-Graph to represent the text to be summarized. Then, we repeatedly find paths through this graph to produce concise summaries. We consider Opinosis a "shallow" abstractive summarizer as it uses the original text itself to generate summaries. This is unlike a true abstractive summarizer that would need a deeper level of natural language understanding.
While the evaluation is on an opinion dataset, the approach itself is general in that, it can be applied to any corpus containing high amounts of redundancies, for example, Twitter comments or user comments on blog/news articles. A very similar work to ours (published at the same time and at the same conference) is the following:
Multi-sentence compression: Finding shortest paths in word graphs
Proceedings of the 23rd International Conference on Computaional Linguistics (COLING 10). Beijing, China, August 23-27, 2010. Katja Filippova
Katja's work was evaluated on a news dataset (google news) for both English and Spanish while ours was evaluated on user reviews from various sources (English only). She studies the informativeness and grammaticality of sentences and in a similar way we evaluate these aspects by studying how close the Opinosis summaries are compared to the human composed summaries in terms of information overlap and readability (using a human assessor).
A. Nenkova, and R. Passonneau. Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004, page 145--152. Boston, Massachusetts, USA, Association for Computational Linguistics, (2004)
H. Zhou, C. Otto, and R. Ewerth. Digital Libraries for Open Knowledge - 23rd International Conference on Theory and Practice of Digital Libraries, TPDL 2019, Oslo, Norway, September 9-12, 2019, Proceedings, volume 11799 of Lecture Notes in Computer Science, page 327-335. Springer, (2019)
T. Takeda, and A. Takasu. JCDL '07: Proceedings of the 7th ACM/IEEE joint conference on Digital libraries, page 438--439. New York, NY, USA, ACM, (2007)
S. Xu, H. Jiang, and F. Lau. IUI '09: Proceedingsc of the 13th international conference on Intelligent user interfaces, page 7--16. New York, NY, USA, ACM, (2008)
S. Tucker, and S. Whittaker. Proceedings of the 13th international conference on Intelligent user interfaces, page 37--46. New York, NY, USA, ACM, (2009)
S. Tucker, and S. Whittaker. Proceedings of the 13th international conference on Intelligent user interfaces, page 37--46. New York, NY, USA, ACM, (2009)
R. Yu, U. Gadiraju, X. Zhu, B. Fetahu, and S. Dietze. The Semantic Web - ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 - June 2, 2016, Revised Selected Papers, page 69--73. (2016)
T. Tran, C. Niederée, N. Kanhabua, U. Gadiraju, and A. Anand. Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM 2015, Melbourne, VIC, Australia, October 19 - 23, 2015, page 1201--1210. (2015)
D. Nguyen, and J. Leveling. Natural Language Processing and Information Systems - 18th International Conference on Applications of Natural Language to Information Systems, NLDB 2013, Salford, UK, June 2013, Proceedings, volume 7934 of Lecture Notes in Computer Science (LNCS), page 90--101. Springer, (2013)
R. Ribaldo, A. Akabane, L. Rino, and T. Pardo. Computational Processing of the Portuguese Language, volume 7243 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2012)
R. Ribaldo, A. Akabane, L. Rino, and T. Pardo. Computational Processing of the Portuguese Language, volume 7243 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2012)
S. Mohammad, B. Dorr, M. Egan, A. Hassan, P. Muthukrishan, V. Qazvinian, D. Radev, and D. Zajic. Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, page 584--592. Stroudsburg, PA, USA, Association for Computational Linguistics, (2009)
S. Miranda-Jiménez, A. Gelbukh, and G. Sidorov. Proceedings of the 21th International Conference on Conceptual Structures (ICCS 2013), volume 7735 of Lecture Notes in Computer Science, page 245-253. Springer, (2013)
M. Hu, and B. Liu. Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, page 168--177. New York, NY, USA, ACM, (2004)
J. Delort, and E. Alfonseca. Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL'12), Avignon, France, (2012)
Y. Wang, Z. Huang, Y. Zeng, N. Zhong, and F. van Harmelen. Proceedings of the 2nd International Workshop on Cyber-Physical Society (IWCPS2011), colocated with the SKG2011, Bejing, China, (October 2011)
H. Zha. Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, page 113-120. Tampere, Finland, (2002)
T. Sakai, and K. Spärck Jones. Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, (2001)
L. Rino, T. Pardo, C. Silla Jr., C. Kaestner, and M. Pombo. Proceedings of the 17th Brazilian Symposium on Artificial Intelligence (SBIA), page 235-244. São Luis-MA, Brazil, (September 2004)
L. Rino, and T. Pardo. Anais do XXIII Congresso da Sociedade Brasileira de Computação - Volume VIII: III Jornada de Minicursos de Inteligência Artificial, page 203-245. (2003)
L. Rino, and M. Nunes. Notas Didáticas do ICMC, 67. Instituto de Ciências Matemáticas e de Computação - Universidade de São Paulo, São Carlos-SP, (Outubro 2005)
T. Pardo, L. Rino, and M. Nunes. Proceedings of the 1st International Information Technology Symposium - I2TS, page 1-6. Florianópolis-SC, Brazil, (October 2002)
T. Pardo, L. Rino, and M. Nunes. Anais do IV Encontro Nacional de Inteligência Artificial - ENIA, page 1-10. Campinas-SP, Brasil, (2 a 8 de Agosto 2003)
T. Pardo, L. Rino, and M. Nunes. Proceedings of the 6th Workshop on Computational Processing of Written and Spoken Portuguese (PROPOR), volume 2721 of LNAI, page 210-218. Springer-Verlag, (June 2003)
T. Pardo, and L. Rino. Série de Relatórios do NILC, NILC-TR-03-09. Núcleo Interinstitucional de Lingüística Computacional (NILC), São Carlos-SP, (Outubro 2003)
T. Pardo, L. Antiqueira, M. Nunes, O. Oliveira Jr., and L. Costa. Proceedings of the 7th Workshop on Computational Processing of Written and Spoken Portuguese (PROPOR), volume 3960 of LNAI, page 1-10. Springer-Verlag, (May 2006)
K. McKeown, R. Passonneau, D. Elson, A. Nenkova, and J. Hirschberg. Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, page 210-217. Salvador, Brazil, ACM Press, (2005)
I. Mani, E. Bloedorn, and B. Gates. Proceedings of the Spring Symposium on Intelligent Text Summarization (AAAI 98), page 69-76. Stanford, CA, AAAI Press, (March 1998)
D. Leite, and L. Rino. Proceedings of the International Joint Conference IBERAMIA-SBIA 2006, volume 4140 of LNAI, page 462-471. Springer-Verlag, (2006)