Ant colony optimization as a feature selection method in the QSAR modeling of anti-HIV-1 activities of 3-(3,5-dimethylbenzyl)uracil derivatives using MLR, PLS and SVM regression

Mohammad Goodarzi, Matheus P. Freitas, Richard Jensen

Research output: Contribution to journalArticlepeer-review

62 Citations (Scopus)

Abstract

A quantitative structure–activity relationship (QSAR) modeling was carried out for the anti-HIV-1 activities of 3-(3,5-dimethylbenzyl)uracil derivatives. The ant colony optimization (ACO) strategy was used as a feature selection (descriptor selection) and model development method. Modeling of the relationship between selected molecular descriptors and pEC50 data was achieved by linear (multiple linear regression—MLR, and partial least squares regression—PLS) and nonlinear (support-vector machine regression; SVMR) methods. The QSAR models were validated by cross-validation, as well as through the prediction of activities of an external set of compounds. Both linear and nonlinear methods were found to be better than a PLS-based method using forward stepwise selection, resulting in accurate predictions, especially for the SVM regression. The squared correlation coefficients of experimental versus predicted activities for the test set obtained by MLR, PLS and SVMR models using ACO feature selection were 0.942, 0.945 and 0.991, respectively.
Original languageEnglish
Pages (from-to)123-129
Number of pages7
JournalChemometrics and Intelligent Laboratory Systems
Volume89
Issue number2
DOIs
Publication statusPublished - 15 Oct 2009

Keywords

  • QSAR
  • Anti-HIV-1 activities
  • 3-(3,5-Dimethylbenzyl)uracil derivatives
  • Ant colony optimization
  • Linear and nonlinear regression methods

Fingerprint

Dive into the research topics of 'Ant colony optimization as a feature selection method in the QSAR modeling of anti-HIV-1 activities of 3-(3,5-dimethylbenzyl)uracil derivatives using MLR, PLS and SVM regression'. Together they form a unique fingerprint.

Cite this