Content area
Ensemble techniques are crucial for preprocessing near-infrared (NIR) data, yet effectively integrating information from multiple preprocessing methods remains challenging. While multi-block approaches have been introduced to optimize preprocessing selection, they face issues such as block order dependency, slow optimization, and limited interpretability. This study proposes PFCOVSC—a fast, order-independent, and interpretable ensemble preprocessing strategy integrating multi-block fusion and variable selection. The method combines diverse preprocessed data into a unified matrix and employs the efficient fCovsel technique to select informative variables and construct an ensemble model. Evaluated against SPORT and PROSAC on three public datasets, PFCOVSC substantially reduced prediction root mean squared error (RMSE) on wheat and meat datasets by 17%, 13% and 49%, 20%, respectively, while performing comparably on tablet data. The method also demonstrated advantages in computational speed and model interpretability, offering a promising new direction for preprocessing ensemble strategies.
Details
; Shujat, Ali 2
; Huang Guangzao 2 ; Yuan Leiming 2
; Shi, Wen 2 ; Wang, Xin 3
; Zhang Lechao 3 1 Department of Power Supply and Consumption Technology, Beijing Railway Electrification College, Beijing 102202, China
2 College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou 325000, China
3 School of Robot Engineering, Wenzhou University of Technology, Wenzhou 325000, China