Full text

Turn on search term navigation

Copyright © 2020 Atif Khan et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. http://creativecommons.org/licenses/by/4.0/

Abstract

With the growing information on web, online movie review is becoming a significant information resource for Internet users. However, online users post thousands of movie reviews on daily basis and it is hard for them to manually summarize the reviews. Movie review mining and summarization is one of the challenging tasks in natural language processing. Therefore, an automatic approach is desirable to summarize the lengthy movie reviews, and it will allow users to quickly recognize the positive and negative aspects of a movie. This study employs a feature extraction technique called bag of words (BoW) to extract features from movie reviews and represent the reviews as a vector space model or feature vector. The next phase uses Naïve Bayes machine learning algorithm to classify the movie reviews (represented as feature vector) into positive and negative. Next, an undirected weighted graph is constructed from the pairwise semantic similarities between classified review sentences in such a way that the graph nodes represent review sentences, while the edges of graph indicate semantic similarity weight. The weighted graph-based ranking algorithm (WGRA) is applied to compute the rank score for each review sentence in the graph. Finally, the top ranked sentences (graph nodes) are chosen based on highest rank scores to produce the extractive summary. Experimental results reveal that the proposed approach is superior to other state-of-the-art approaches.

Details

Title
Movie Review Summarization Using Supervised Learning and Graph-Based Ranking Algorithm
Author
Khan, Atif 1   VIAFID ORCID Logo  ; Gul, Muhammad Adnan 1 ; Zareei, Mahdi 2   VIAFID ORCID Logo  ; Biswal, R R 2 ; Zeb, Asim 3   VIAFID ORCID Logo  ; Naeem, Muhammad 3 ; Yousaf Saeed 4 ; Salim, Naomie 5 

 Department of Computer Science, Islamia College University Peshawar, Peshawar 25000, KP, Pakistan 
 Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Zapopan, Jalisco 45138, Mexico 
 Department of Computer Science, Abbottabad University of Science and Technology, Abbottabad 25000, Pakistan 
 Department of Information Technology, University of Haripur, Haripur, KP, Pakistan 
 School of Computing, Faculty of Engineering, Universiti Teknologi Malaysia, Johor Bahru, Malaysia 
Editor
Luis Javier Herrera
Publication year
2020
Publication date
2020
Publisher
John Wiley & Sons, Inc.
ISSN
16875265
e-ISSN
16875273
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2412819328
Copyright
Copyright © 2020 Atif Khan et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. http://creativecommons.org/licenses/by/4.0/