Content area

Abstract

Ensemble techniques are crucial for preprocessing near-infrared (NIR) data, yet effectively integrating information from multiple preprocessing methods remains challenging. While multi-block approaches have been introduced to optimize preprocessing selection, they face issues such as block order dependency, slow optimization, and limited interpretability. This study proposes PFCOVSC—a fast, order-independent, and interpretable ensemble preprocessing strategy integrating multi-block fusion and variable selection. The method combines diverse preprocessed data into a unified matrix and employs the efficient fCovsel technique to select informative variables and construct an ensemble model. Evaluated against SPORT and PROSAC on three public datasets, PFCOVSC substantially reduced prediction root mean squared error (RMSE) on wheat and meat datasets by 17%, 13% and 49%, 20%, respectively, while performing comparably on tablet data. The method also demonstrated advantages in computational speed and model interpretability, offering a promising new direction for preprocessing ensemble strategies.

Details

1009240
Title
Pre-Processing Ensemble Modeling Based on Faster Covariate Selection Calibration for Near-Infrared Spectroscopy
Author
Wu, Yonghong 1 ; Zhou Yukun 2 ; Chen, Xiaojing 2 ; Xie Zhonghao 2   VIAFID ORCID Logo  ; Shujat, Ali 2   VIAFID ORCID Logo  ; Huang Guangzao 2 ; Yuan Leiming 2   VIAFID ORCID Logo  ; Shi, Wen 2 ; Wang, Xin 3   VIAFID ORCID Logo  ; Zhang Lechao 3 

 Department of Power Supply and Consumption Technology, Beijing Railway Electrification College, Beijing 102202, China 
 College of Electrical and Electronic Engineering, Wenzhou University, Wenzhou 325000, China 
 School of Robot Engineering, Wenzhou University of Technology, Wenzhou 325000, China 
Publication title
Volume
15
Issue
21
First page
11325
Number of pages
16
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-10-22
Milestone dates
2025-07-10 (Received); 2025-09-24 (Accepted)
Publication history
 
 
   First posting date
22 Oct 2025
ProQuest document ID
3271549874
Document URL
https://www.proquest.com/scholarly-journals/pre-processing-ensemble-modeling-based-on-faster/docview/3271549874/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-11-13
Database
ProQuest One Academic