Content area

Abstract

Currently machine learning approaches used in Quantitative Structure Activity Relationship (QSAR) model generation impose restrictions and/or make assumptions on how the training set descriptors correlate with a target activity. kScore has been developed as the first machine learning approach that does not require the training data to conform to a defined kernel, accommodates uneven data point distributions in the descriptor space, and optimizes the weight of each dimension in the descriptor space in order to identify the descriptors most relevant to the target property. The ability of kScore to adapt to virtually any correlation makes it essential that generalization terms be included to inhibit overtraining. The Structural Risk Minimization principle and the linear ε-insensitive loss terms have been added to the kScore optimization function. The resulting kScore algorithm has proven to be quite universal across several datasets and either produces results similar to or outperforms the most predictive machine learning algorithms tested, such as SVM, kNN, Recursive Partitioning, Neural Networks, Gaussian Process, and the Bayesian Classifier.[PUBLICATION ABSTRACT]

Details

Title
kScore: a novel machine learning approach that is not dependent on the data structure of the training set
Author
Oloff, Scott; Muegge, Ingo
Pages
87-95
Publication year
2007
Publication date
Jan 2007
Publisher
Springer Nature B.V.
ISSN
0920654X
e-ISSN
15734951
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
737242968
Copyright
Springer Science+Business Media, LLC 2007