Full text

Turn on search term navigation

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Text clustering is one of the efficient unsupervised learning techniques used to partition a huge number of text documents into a subset of clusters. In which, each cluster contains similar documents and the clusters contain dissimilar text documents. Nature-inspired optimization algorithms have been successfully used to solve various optimization problems, including text document clustering problems. In this paper, a comprehensive review is presented to show the most related nature-inspired algorithms that have been used in solving the text clustering problem. Moreover, comprehensive experiments are conducted and analyzed to show the performance of the common well-know nature-inspired optimization algorithms in solving the text document clustering problems including Harmony Search (HS) Algorithm, Genetic Algorithm (GA), Particle Swarm Optimization (PSO) Algorithm, Ant Colony Optimization (ACO), Krill Herd Algorithm (KHA), Cuckoo Search (CS) Algorithm, Gray Wolf Optimizer (GWO), and Bat-inspired Algorithm (BA). Seven text benchmark datasets are used to validate the performance of the tested algorithms. The results showed that the performance of the well-known nurture-inspired optimization algorithms almost the same with slight differences. For improvement purposes, new modified versions of the tested algorithms can be proposed and tested to tackle the text clustering problems.

Details

Title
Nature-Inspired Optimization Algorithms for Text Document Clustering—A Comprehensive Analysis
Author
Abualigah, Laith 1   VIAFID ORCID Logo  ; Gandomi, Amir H 2   VIAFID ORCID Logo  ; Mohamed Abd Elaziz 3   VIAFID ORCID Logo  ; Hussien, Abdelazim G 4   VIAFID ORCID Logo  ; Khasawneh, Ahmad M 1   VIAFID ORCID Logo  ; Alshinwan, Mohammad 1   VIAFID ORCID Logo  ; Houssein, Essam H 5   VIAFID ORCID Logo 

 Faculty of Computer Sciences and Informatics, Amman Arab University, Amman 11953, Jordan; [email protected] (L.A.); [email protected] (A.M.K.); [email protected] (M.A.) 
 Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo, NSW 2007, Australia 
 Department of Mathematics, Faculty of Science, Zagazig University, Zagazig 44519, Egypt; [email protected] 
 Faculty of Science, Fayoum University, Faiyum 63514, Egypt; [email protected] 
 Faculty of Computers and Information, Minia University, Minia 61519, Egypt; [email protected] 
First page
345
Publication year
2020
Publication date
2020
Publisher
MDPI AG
e-ISSN
19994893
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2524376448
Copyright
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.