Content area

Abstract

A central problem in forming accurate regression equations in QSAR studies isthe selection of appropriate descriptors for the compounds under study. Wedescribe a novel procedure for using inductive logic programming (ILP) todiscover new indicator variables (attributes) for QSAR problems, and show thatthese improve the accuracy of the derived regression equations. ILP techniqueshave previously been shown to work well on drug design problems where thereis a large structural component or where clear comprehensible rules arerequired. However, ILP techniques have had the disadvantage of only being ableto make qualitative predictions (e.g. active, inactive) and not to predictreal numbers (regression). We unify ILP and linear regression techniques togive a QSAR method that has the strength of ILP at describing stericstructure, with the familiarity and power of linear regression. We evaluatedthe utility of this new QSAR technique by examining the prediction ofbiological activity with and without the addition of new structural indicatorvariables formed by ILP. In three out of five datasets examined the additionof ILP variables produced statistically better results (P < 0.01) over theoriginal description. The new ILP variables did not increase the overallcomplexity of the derived QSAR equations and added insight into possiblemechanisms of action. We conclude that ILP can aid in the process of drugdesign.[PUBLICATION ABSTRACT]

Details

Title
The discovery of indicator variables for QSAR using inductive logic programming
Author
King, Ross D; Srinivasan, Ashwin
Pages
571-80
Publication year
1997
Publication date
Nov 1997
Publisher
Springer Nature B.V.
ISSN
0920654X
e-ISSN
15734951
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
737866212
Copyright
Kluwer Academic Publishers 1997