Abstract

General machine learning requires a large number of training samples, meaning a lot of manpower and material resources. To solve this problem, active learning and its algorithms are often used. Active learning is a form of semi-supervised machine learning where the algorithm can choose which data it wants to learn from and then use the smallest and most effective labeled data set to make predictions or classifications. This article will show an example of using active learning to predict red wine quality. The predictive modeling approach the author chose was the K-Nearest Neighbor and the active learning algorithm was ranked batch-mode sampling. By observing the learning curve, the author found that generally the prediction accuracy of active learning would increase as the number of iterations increased. The author compared the experiment with another case using classic iris flower data set, and concluded that the prediction accuracy of active learning for different data sets depends on many factors, such as the correlation between the independent and dependant variables, the size of the data set and the number of iterations.

Details

Title
Red wine quality prediction through active learning
Author
Zhou Tingwei 1 

 University of Electronic Science and Technology of China, Chengdu, Sichuan, 611731, China 
Publication year
2021
Publication date
Jul 2021
Publisher
IOP Publishing
ISSN
17426588
e-ISSN
17426596
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2550677015
Copyright
© 2021. This work is published under http://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.