Content area

Abstract

Near‐infrared (NIR) spectroscopy data encounter challenges in data processing such as peak overlapping, information redundancy, and background or noise, which complicate the evaluation of weak differences among similar samples. Therefore, accurately identifying these differences and assessing similarities are essential in practical applications for sample classification and further replacement of raw materials in the product formulation. In this work, 32 data preprocessing strategies of NIR data were systematically combined for comprehensive comparison, and 11 methods for similarity analysis were evaluated to attain optimal performance. Using the rationality of similarity evaluation as the assessment criterion, the combination of NIR data pretreatment methods of “standard normal variate (SNV) + first‐order derivative by Savitzky–Golay (1D/SG) + maximum–minimum scaling (MMS) + spectral similarity by combinatorial strategy (SS/CS)” is ultimately preferred as the most effective combination for similarity evaluation. It uses SNV transformation, 1D/SG, MMS, and scattering correction to eliminate the scattering effect, enhance the signal‐to‐noise ratio (SNR) of the distinction of overlapping peaks, and improve data comparability. After this, the widely used methods for similarity evaluation were employed for comprehensive analysis and comparison of the rationality, such as Euclidean distance, correlation coefficient, and divergence information. The evaluation strategy proposed in this work can effectively distinguish the difference among the tobacco samples existing in 10 different categories. The similarity among typical samples in the same class is above 0.9, while the values in different classes are below 0.7. In real applications for method validation, recognition precision of tobacco samples with blending of interfering mixtures reaches 5%, which is conducted using complex tobacco materials for formulation replacement and optimization. The satisfactory results introduce robust and CS that outperforms traditional single‐method approaches to resolve weak spectral differences through real‐world tobacco formulation replacement applications. It can be widely used in the areas related to NIR for similarity evaluation, such as pharmaceuticals, food quality control, and environmental monitoring.

Details

1009240
Business indexing term
Title
Comprehensive Comparison of Similarity Evaluation and Discovery of Weak Spectral Variations of Near‐Infrared Spectroscopy for Tobacco Formulation Replacement
Author
Zhang, Yipeng 1   VIAFID ORCID Logo  ; Jiang, Hui 2   VIAFID ORCID Logo  ; Ling, Jun 1   VIAFID ORCID Logo  ; Wen, Liliang 2   VIAFID ORCID Logo  ; Yan, Keliang 1   VIAFID ORCID Logo  ; Chen, Aiming 2   VIAFID ORCID Logo  ; Zeng, Zhongda 3   VIAFID ORCID Logo  ; Wang, Miaomiao 4   VIAFID ORCID Logo  ; Yang, Qianxu 1   VIAFID ORCID Logo 

 R & D Center, , China Tobacco Yunnan Industrial Co. Ltd., , Kunming, , , Yunnan, China, ynzy-tobacco.com 
 Department of R & D, , Dalian Chem Data Solution Information Technology Co. Ltd., , Dalian, , , Liaoning, China 
 College of Environmental and Chemical Engineering, , Dalian University, , Dalian, , , Liaoning, China, dlu.edu.cn 
 Xinjiang Science & Technology Resource Sharing Service Center, , Xinjiang Key Laboratory of Featured Functional Food Nutrition and Safety Testing, , Kexue North Road 374, Urumqi, , , Xinjiang Uygur, China 
Volume
2025
Issue
1
Number of pages
14
Publication year
2025
Publication date
2025
Publisher
John Wiley & Sons, Inc.
Place of publication
New York
Country of publication
United States
ISSN
16878760
e-ISSN
16878779
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-11-30
Milestone dates
2025-09-04 (manuscriptRevised); 2025-11-30 (publishedOnlineFinalForm); 2025-02-14 (manuscriptReceived); 2025-10-21 (manuscriptAccepted)
Publication history
 
 
   First posting date
30 Nov 2025
ProQuest document ID
3276750152
Document URL
https://www.proquest.com/scholarly-journals/comprehensive-comparison-similarity-evaluation/docview/3276750152/se-2?accountid=208611
Copyright
© 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-12-01
Database
ProQuest One Academic