Full Text

Turn on search term navigation

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Technologies have driven big data collection across many fields, such as genomics and business intelligence. This results in a significant increase in variables and data points (observations) collected and stored. Although this presents opportunities to better model the relationship between predictors and the response variables, this also causes serious problems during data analysis, one of which is the multicollinearity problem. The two main approaches used to mitigate multicollinearity are variable selection methods and modified estimator methods. However, variable selection methods may negate efforts to collect more data as new data may eventually be dropped from modeling, while recent studies suggest that optimization approaches via machine learning handle data with multicollinearity better than statistical estimators. Therefore, this study details the chronological developments to mitigate the effects of multicollinearity and up-to-date recommendations to better mitigate multicollinearity.

Details

Title
Mitigating the Multicollinearity Problem and Its Machine Learning Approach: A Review
Author
Jireh Yi-Le Chan 1   VIAFID ORCID Logo  ; Hong Leow, Steven Mun 1 ; Khean Thye Bea 1 ; Cheng, Wai Khuen 2   VIAFID ORCID Logo  ; Phoong, Seuk Wai 3   VIAFID ORCID Logo  ; Zeng-Wei, Hong 4   VIAFID ORCID Logo  ; Yen-Lin, Chen 5   VIAFID ORCID Logo 

 Faculty of Business and Finance, Universiti Tunku Abdul Rahman, Kampar 31900, Malaysia; [email protected] (S.M.H.L.); [email protected] (K.T.B.) 
 Faculty of Information and Communication Technology, Universiti Tunku Abdul Rahman, Kampar 31900, Malaysia; [email protected] 
 Department of Management, Faculty of Business and Economics, Universiti Malaya, Kuala Lumpur 50603, Malaysia; [email protected] 
 Department of Information Engineering and Computer Science, Feng Chia University, Taichung 407102, Taiwan; [email protected] 
 Department of Computer Science and Information Engineering, National Taipei University of Technology, Taipei 106344, Taiwan 
First page
1283
Publication year
2022
Publication date
2022
Publisher
MDPI AG
e-ISSN
22277390
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2652997058
Copyright
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.