Full text

Turn on search term navigation

Copyright © 2019 Xue-Zhen Hong et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. http://creativecommons.org/licenses/by/4.0/

Abstract

This work presents a reliable approach to trace teas’ geographical origins despite changes in teas caused by different harvest years. A total of 1447 tea samples collected from various areas in 2014 (660 samples) and 2015 (787 samples) were detected by FT-NIR. Seven classifiers trained on the 2014 dataset all succeeded to trace origins of samples collected in 2014; however, they all failed to predict origins for the 2015 samples due to different data distributions and imbalanced dataset. Three outlier detection based undersampling approaches—one-class SVM (OC-SVM), isolation forest and elliptic envelope—were then proposed; as a result, the highest macro average recall (MAR) for the 2015 dataset was improved from 56.86% to 73.95% (by SVM). A model updating approach was also applied, and the prediction MAR was significantly improved with increase in the updating rate. The best MAR (90.31%) was first achieved by the OC-SVM combined SVM classifier at a 50% rate.

Details

Title
Tracing Geographical Origins of Teas Based on FT-NIR Spectroscopy: Introduction of Model Updating and Imbalanced Data Handling Approaches
Author
Xue-Zhen, Hong 1 ; Xian-Shu Fu 2 ; Zheng-Liang, Wang 2 ; Zhang, Li 3 ; Xiao-Ping, Yu 2 ; Zi-Hong, Ye 2   VIAFID ORCID Logo 

 College of Quality & Safety Engineering, China Jiliang University, Xueyuan Street, Xiasha Higher Education District, Hangzhou 310018, China; BioCircuits Institute, University of California, La Jolla, San Diego, CA 92093, USA 
 Zhejiang Provincial Key Laboratory of Biometrology and Inspection & Quarantine, College of Life Sciences, China Jiliang University, Xueyuan Street, Xiasha Higher Education District, Hangzhou 310018, China 
 Department of Computer Science, Zhejiang University, Hangzhou 310027, China 
Editor
Andrey Bogomolov
Publication year
2019
Publication date
2019
Publisher
John Wiley & Sons, Inc.
ISSN
20908865
e-ISSN
20908873
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2166678593
Copyright
Copyright © 2019 Xue-Zhen Hong et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. http://creativecommons.org/licenses/by/4.0/