QUERY-BASED SUMMARIZATION METHODS FOR CONVERSATIONAL AGENTS: AN OVERVIEW

Ketakee Nimavat; Hetal Joshiara

doi:10.26483/ijarcs.v8i8.4788

PDF

Published: Oct 20, 2017

DOI: https://doi.org/10.26483/ijarcs.v8i8.4788

Keywords:

query based summarization, extractive summarization, abstractive summarization, Conversational agents, raw unstructured text data, text cubes,

Ketakee Nimavat

UG student , Computer Engineering, L.D. College Of Engineering

Hetal Joshiara

Professor, Computer Engineering Department, L.D. College Of Engineering

Abstract

Summarization is a topic that will be of a great important in the coming age since intelligent assistants especially the ones in the form of conversational agents will have to sift through the abundance of raw unstructured text data to provide relevant information. The data will be in the form of Social media posts, content websites and other user generated text content from which the user shall require tailored information from and about the data. The paper hence explores various methods for summarization and focuses particularly on extracting the gist from the perspective of a given keyword i.e. query based summarization from raw unstructured text data sources available at scale. Along with that, the need for a proper framework to mine relevant knowledge from the said data is acknowledged and the challenges that a conversational agent would hence face are identified. Various approaches that contribute to building a framework and solve the identified challenges are explored as well. It is hoped that the approaches discussed in the paper will be of use to researchers building algorithms in areas of knowledge mining and understanding, such as summarization, that deal with the challenges that are expected to arise.

Downloads

Download data is not yet available.

Issue

Vol. 8 No. 8 (2017): September-October

Section

Articles

COPYRIGHT

Submission of a manuscript implies: that the work described has not been published before, that it is not under consideration for publication elsewhere; that if and when the manuscript is accepted for publication, the authors agree to automatic transfer of the copyright to the publisher.

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
The journal allows the author(s) to retain publishing rights without restrictions.
The journal allows the author(s) to hold the copyright without restrictions.

References

M. Gambhir and V. Gupta, â€œRecent automatic text summarization techniques: a survey,â€ Artif. Intell. Rev., vol. 47, no. 1, 2017.

A. Nenkova, â€œAutomatic Summarization,â€ Found. TrendsÂ® Inf. Retr., vol. 5, no. 2, pp. 103â€“233, 2011.

S. Fisher and B. Roark, â€œQuery-Focused Summarization By Supervised Sentence Ranking and Skewed Word Distributions,â€ Proc. 6th Doc. Underst. Conf. . DUC, 2006.

K.-F. Wong, M. Wu, and W. Li, â€œExtractive Summarization Using Supervised and Semi-supervised Learning,â€ Proc. 22nd Int. Conf. Comput. Linguist. 1. Assoc. Comput. Linguist. 2008., no. August, pp. 985â€“992, 2008.

L. Logeswaran, H. Lee, and D. Radev, â€œSentence Ordering using Recurrent Neural Networks,â€ pp. 1â€“15, 2016.

S. Park and B. R. Cha, â€œQuery-based multi-document summarization using non-negative semantic feature and NMF clustering,â€ Proc. - 4th Int. Conf. Networked Comput. Adv. Inf. Manag. NCM 2008, vol. 2, pp. 609â€“614, 2008.

V. Gupta, â€œHybrid Algorithm for Multilingual Summarization,â€ pp. 717â€“727, 2013.

P. Nema, M. Khapra, A. Laha, and B. Ravindran, â€œDiversity driven Attention Model for Query-based Abstractive Summarization,â€ 2017.

V. Gupta and G. S. Lehal, â€œA Survey of Text Summarization Extractive techniques,â€ J. Emerg. Technol. Web Intell., vol. 2, no. 3, pp. 258â€“268, 2010.

M. Allahyari et al., â€œText Summarization Techniques: A Brief Survey,â€ no. 1, 2017.

J.-G. Yao, X. Wan, and J. Xiao, â€œPhrase-based Compressive Cross-Language Summarization,â€ Conf. Empir. Methods Nat. Lang. Process., no. September, pp. 118â€“127, 2015.

D. Wang, S. Zhu, and T. Li, â€œSumView: A Web-based engine for summarizing product reviews and customer opinions,â€ Expert Syst. Appl., vol. 40, no. 1, pp. 27â€“33, 2013.

D. Radev, W. Fan, H. Qi, H. Wu, and A. Grewal, â€œProbabilistic question answering on the Web,â€ J. Am. Soc. Inf. Sci. Technol., vol. 56, no. 6, pp. 571â€“583, 2005.

G. Carenini, R. T. Ng, and X. Zhou, â€œSummarizing email conversations with clue words,â€ Proc. 16th Int. Conf. World Wide Web - WWW â€™07, p. 91, 2007.

A. Nenkova and A. Bagga, â€œFacilitating Email Thread Access by Extractive Summary Generation,â€ Recent Adv. Nat. Lang. Process. III, Sel. Pap. from RANLPâ€™03, vol. 260, pp. 287â€“296, 2003.

O. Rambow, L. Shrestha, J. Chen, and C. Lauridsen, â€œSummarizing Email Threads,â€ Proc. HLT-NAACL 2004 Short Pap. XX - HLT-NAACL â€™04, pp. 105â€“108, 2004.

M. A. H. Khan, D. Bollegala, G. Liu, and K. Sezaki, â€œMulti-tweet summarization of real-time events,â€ Proc. - Soc. 2013, no. September, pp. 128â€“133, 2013.

L. Shou, Z. Wang, K. Chen, and G. Chen, â€œSumblr: continuous summarization of evolving tweet streams,â€ Proc. 36th Int. ACM SIGIR Conf. Res. Dev. Inf. Retr. - SIGIR â€™13, p. 533, 2013.

X. Liu, Y. Li, F. Wei, and M. Zhou, â€œGraph-Based Multi-Tweet Summarization using Social Signals.,â€ Coling, vol. 2, no. December 2012, pp. 1699â€“1714, 2012.

T. Mondal, P. Pramanik, I. Bhattacharya, A. Saha, and N. Boral, â€œTowards development of FOPL based tweet summarization technique in a post disaster scenario: From survey to solution,â€ 2017 51st Annu. Conf. Inf. Sci. Syst. CISS 2017, 2017.

J. B. S. Ong, Z. Wang, R. S. M. Goh, X. F. Yin, X. Xin, and X. Fu, â€œUnderstanding Natural Disasters as Risks in Supply Chain Management through Web Data Analysis,â€ Int. J. Comput. Commun. Eng., vol. 4, no. 2, pp. 126â€“133, 2015.

Z. Lu and K. Grauman, â€œStory-driven summarization for egocentric video,â€ Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 2714â€“2721, 2013.

A. Fuji and T. Ishikawa, â€œA System for Summarizing and Visualizing Arguments in Subjective Documents: Toward Supporting Decision Making,â€ in Proceedings of the Workshop on Sentiment and Subjectivity in Text, 2006, vol. 69â€“72, no. July, pp. 15â€“22.

D. R. Radev, H. Jing, and M. Budzikowska, â€œCentroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies,â€ Inf. Process. Manag. 40.6 919-938., vol. 40, no. 6, p. 10, 2000.

M. Damova and I. Koychev, â€œQuery-Based Summarization : A survey.â€

J. M. Conroy and J. G. Stewart, â€œCLASSY Query-Based Multi-Document Summarization,â€ Proc. DUC2005, 2005.

Q. M. Summarization, S. D. Silva, N. Joshi, S. Rao, S. Venkatraman, and S. Shrawne, â€œImproved Algorithms for Document Classification &,â€ vol. 3, no. 4, 2011.

D. J. Brenes, D. Gayo-Avello, and K. PÃ©rez-GonzÃ¡lez, â€œSurvey and evaluation of query intent detection methods,â€ Proc. 2009 Work. Web Search Click Data - WSCD â€™09, pp. 1â€“7, 2009.

L. Wang, H. Raghavan, V. Castelli, R. Florian, and C. Cardie, â€œA Sentence Compression Based Framework to Query-Focused Multi-Document Summarization,â€ 2016.

S. Gupta, A. Nenkova, and D. Jurafsky, â€œMeasuring importance and query relevance in topic-focused multi-document summarization,â€ Proc. 45th Annu. Meet. ACL Interact. Poster Demonstr. Sess. - ACL â€™07, no. June, p. 193, 2007.

C. Y. Lin, â€œRouge: A package for automatic evaluation of summaries,â€ Proc. Work. text Summ. branches out (WAS 2004), no. 1, pp. 25â€“26, 2004.

A. Abdi, N. Idris, R. M. Alguliyev, and R. M. Aliguliyev, â€œQuery-based multi-documents summarization using linguistic knowledge and content word expansion,â€ Soft Comput., vol. 21, no. 7, pp. 1785â€“1801, 2017.

F. Tao et al., â€œMulti-Dimensional, Phrase-Based Summarization in

Text Cubes,â€ Data Eng., p. 74, 2016.

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

References