Abstract

Although Clustering Algorithm Based on Sparse Feature Vector (CABOSFV) and its related algorithms are efficient for high dimensional sparse data clustering, there exist several imperfections. Such imperfections as subjective parameter designation and order sensibility of clustering process would eventually aggravate the time complexity and quality of the algorithm. This paper proposes a parameter adjustment method of Bidirectional CABOSFV for optimization purpose. By optimizing Parameter Vector (PV) and Parameter Selection Vector (PSV) with the objective function of clustering validity, an improved Bidirectional CABOSFV algorithm using simulated annealing is proposed, which circumvents the requirement of initial parameter determination. The experiments on UCI data sets show that the proposed algorithm, which can perform multi-adjustment clustering, has a higher accurateness than single adjustment clustering, along with a decreased time complexity through iterations.

Details

Title
Improved Bidirectional CABOSFV Based on Multi-Adjustment Clustering and Simulated Annealing
Author
Yang, Minghan 1 ; Gao, Xuedong 1 ; Li, Ling 2 

 Donlinks School of Economics and Management, University of Science and Technology Beijing (USTB), Beijing, China 
 School of Business, Anhui University, Hefei, China 
Pages
27-42
Publication year
2016
Publication date
2016
Publisher
De Gruyter Poland
ISSN
13119702
e-ISSN
13144081
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3154903808
Copyright
© 2016. This work is published under http://creativecommons.org/licenses/by-nc-nd/3.0 (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.