IR @ Goa University

Extractive based email summarization: An unsupervised hybrid approach using graph based sentence ranking and K-means clustering algorithm

Show simple item record

dc.contributor.author Shivolkar, M.
dc.contributor.author Pawar, J.D.
dc.contributor.author Baskar, S.
dc.date.accessioned 2016-08-25T04:46:27Z
dc.date.available 2016-08-25T04:46:27Z
dc.date.issued 2016
dc.identifier.citation Proc. 3. Workshop on Indian Language Data: Resources and Evaluation (WILDRE3), Portoroz, Slovenia. 24 May 2016. 2016; 49-54. en_US
dc.identifier.uri http://irgu.unigoa.ac.in/drs/handle/unigoa/4463
dc.description.abstract Over the years, Automatic Text Summarization is widely studied by many researchers. Here, an attempt is made to generate an automatic summary of a given text document based on an unsupervised hybrid model. The model comprises of an extractive method: a Graph-based text ranking and K-means: a clustering algorithm. Ranked sentences are obtained using the graph-theoretic ranking model here word frequency, word position, and string pattern based ranking are calculated. The K-Means algorithm generates the coherent topic clusters. Using the output of Graph-based method and K-means clusters, Sentence Importance Score(SIS) is calculated for each sentence, where top 70 percent ranked sentences and centralised topics of each cluster (excluding those topics which fall in the outlier zone) are used. The unsupervised hybrid approach is an attempt to inherit one of the human practice of reading and then summarizing the text in short while keeping the original insight of that text by the virtue of important sentences and keywords. The system is tested on dataset for Summarization and Keyword Extraction from Emails which on evaluation gives an average of 0.57 score on ROUGE 2.0 tool. en_US
dc.publisher JNU, New Delhi; Microsoft Res Lab, Bangalore and Anna Univ, Chennai en_US
dc.subject Computer Science and Technology en_US
dc.title Extractive based email summarization: An unsupervised hybrid approach using graph based sentence ranking and K-means clustering algorithm en_US
dc.type Conference article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search IR


Advanced Search

Browse

My Account