logo
CONTENT TYPES

Figure 1Loading Img

Exploiting Multiple Descriptor Sets in QSAR Studies

View Author Information
Department of Computer and Mathematical Sciences, University of Toronto Scarborough, Toronto, Ontario M1C 1A4, Canada
Department of Statistics, The University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada
*E-mail: [email protected]. Phone: +1-604-822-3339. Fax: +1-604-822-6960.
Cite this: J. Chem. Inf. Model. 2016, 56, 3, 501–509
Publication Date (Web):February 23, 2016
https://doi.org/10.1021/acs.jcim.5b00663
Copyright © 2016 American Chemical Society
Article Views
568
Altmetric
-
Citations
LEARN ABOUT THESE METRICS

Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.

Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.

The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.

Read OnlinePDF (1014 KB)

Abstract

Abstract Image

A quantitative structure–activity relationship (QSAR) is a model relating a specific biological response to the chemical structures of compounds. There are many descriptor sets available to characterize chemical structure, raising the question of how to choose among them or how to use all of them for training a QSAR model. Making efficient use of all sets of descriptors is particularly problematic when active compounds are rare among the assay response data. We consider various strategies to make use of the richness of multiple descriptor sets when assay data are poor in active compounds. Comparisons are made using data from four bioassays, each with five sets of molecular descriptors. The recommended method takes all available descriptors from all sets and uses an algorithm to partition them into groups called phalanxes. Distinct statistical models are trained, each based on only the descriptors in one phalanx, and the models are then averaged in an ensemble of models. By giving the descriptors a chance to contribute in different models, the recommended method uses more of the descriptors in model averaging. This results in better ranking of active compounds to identify a shortlist of drug candidates for development.

Cited By


This article is cited by 6 publications.

  1. Xiaoyi Zhang, Wenling Niu, Tang Tang, Chengfei Hou, Yajie Guo, Ren Kong. A Strategy to Find Novel Candidate DKAs Inhibitors Using Modified QSAR Model with Favorable Druggability Properties. Chemical Research in Chinese Universities 2019, 35 (6) , 1111-1118. https://doi.org/10.1007/s40242-019-9183-5
  2. Hwanho Choi, Hongsuk Kang, Kee-Choo Chung, Hwangseo Park. Development and application of a comprehensive machine learning program for predicting molecular biochemical and pharmacological properties. Physical Chemistry Chemical Physics 2019, 21 (9) , 5189-5199. https://doi.org/10.1039/C8CP07002D
  3. Govindan Subramanian, Gennady Poda. In silico ligand-based modeling of h BACE-1 inhibitors. Chemical Biology & Drug Design 2018, 91 (3) , 817-827. https://doi.org/10.1111/cbdd.13147
  4. Anne Tromelin, Claire Chabanet, Karine Audouze, Florian Koensgen, Elisabeth Guichard. Multivariate statistical analysis of a large odorants database aimed at revealing similarities and links between odorants and odors. Flavour and Fragrance Journal 2018, 33 (1) , 106-126. https://doi.org/10.1002/ffj.3430
  5. Miao Qi, Ting Wang, Yugen Yi, Na Gao, Jun Kong, Jianzhong Wang. Joint L2,1 Norm and Fisher Discrimination Constrained Feature Selection for Rational Synthesis of Microporous Aluminophosphates. Molecular Informatics 2017, 36 (4) , 1600076. https://doi.org/10.1002/minf.201600076
  6. Igor I. Baskin, David Winkler, Igor V. Tetko. A renaissance of neural networks in drug discovery. Expert Opinion on Drug Discovery 2016, 11 (8) , 785-795. https://doi.org/10.1080/17460441.2016.1201262

Pair your accounts.

Export articles to Mendeley

Get article recommendations from ACS based on references in your Mendeley library.

Pair your accounts.

Export articles to Mendeley

Get article recommendations from ACS based on references in your Mendeley library.

You’ve supercharged your research process with ACS and Mendeley!

STEP 1:
Click to create an ACS ID

Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

OOPS

You have to login with your ACS ID befor you can login with your Mendeley account.

MENDELEY PAIRING EXPIRED
Your Mendeley pairing has expired. Please reconnect

This website uses cookies to improve your user experience. By continuing to use the site, you are accepting our use of cookies. Read the ACS privacy policy.

CONTINUE