Article
Random Forest Prediction of Mutagenicity from Empirical Physicochemical Descriptors
Purchase the full-text
- PDF/HTML,
figures/images,
references and tables,
(where available)
Abstract

Fast-to-calculate empirical physicochemical descriptors were investigated for their ability to predict mutagenicity (positive or negative Ames test) from the molecular structure. Fast methods are highly desired for the screening of large libraries of compounds. Global molecular descriptors and MOLMAP descriptors of bond properties were used to train random forests. Error percentages as low as 15% and 16% were achieved for an external test set with 472 compounds and for the training set with 4083 structures, respectively. High sensitivity and specificity were observed. Random forests were able to associate meaningful probabilities to the predictions and to explain the predictions in terms of similarities between query structures and compounds in the training set.
Citing Articles
Citation data is made available by participants in CrossRef's Cited-by Linking service. For a more comprehensive list of citations to this article, users are encouraged to perform a search in SciFinder.
This article has been cited by 7 ACS Journal articles (5 most recent appear below).

Self Organizing Maps for Analysis of Polycyclic Aromatic Hydrocarbons 3-Way Data from Spilled Oils
R. Fernández-Varela, M. P. Gómez-Carracedo, D. Ballabio, J. M. Andrade, V. Consonni and R. TodeschiniAnalytical Chemistry2010 82 (10), 4264-4271Self Organizing Maps for Analysis of Polycyclic Aromatic Hydrocarbons 3-Way Data from Spilled Oils
R. Fernández-Varela, M. P. Gómez-Carracedo, D. Ballabio, J. M. Andrade, V. Consonni and R. TodeschiniAnalytical Chemistry2010 82 (10), 4264-4271In this paper, the application of a new method based on self-organizing maps (SOM; termed MOLMAP, molecular map of atom-level properties) to handle 3-way data generated in a monitoring environmental study is presented. The study comprised 50 polycyclic ...

The Ensemble Bridge Algorithm: A New Modeling Tool for Drug Discovery Problems
Mark Culp, Kjell Johnson, George MichailidisJournal of Chemical Information and Modeling2010 50 (2), 309-316The Ensemble Bridge Algorithm: A New Modeling Tool for Drug Discovery Problems
Mark Culp, Kjell Johnson, George MichailidisJournal of Chemical Information and Modeling2010 50 (2), 309-316Ensemble algorithms have been historically categorized into two separate paradigms, boosting and random forests, which differ significantly in the way each ensemble is constructed. Boosting algorithms represent one extreme, where an iterative greedy ...

Current Trends in Ligand-Based Virtual Screening: Molecular Representations, Data Mining Methods, New Application Areas, and Performance Evaluation
Hanna Geppert, Martin Vogt and Jürgen BajorathJournal of Chemical Information and Modeling2010 50 (2), 205-216Current Trends in Ligand-Based Virtual Screening: Molecular Representations, Data Mining Methods, New Application Areas, and Performance Evaluation
Hanna Geppert, Martin Vogt and Jürgen BajorathJournal of Chemical Information and Modeling2010 50 (2), 205-216

Application of Random Forest Approach to QSAR Prediction of Aquatic Toxicity
Pavel G. Polishchuk, Eugene N. Muratov, Anatoly G. Artemenko, Oleg G. Kolumbin, Nail N. Muratov and Victor E. Kuz’minJournal of Chemical Information and Modeling2009 49 (11), 2481-2488Application of Random Forest Approach to QSAR Prediction of Aquatic Toxicity
Pavel G. Polishchuk, Eugene N. Muratov, Anatoly G. Artemenko, Oleg G. Kolumbin, Nail N. Muratov and Victor E. Kuz’minJournal of Chemical Information and Modeling2009 49 (11), 2481-2488This work is devoted to the application of the random forest approach to QSAR analysis of aquatic toxicity of chemical compounds tested on Tetrahymena pyriformis. The simplex representation of the molecular structure approach implemented in HiT QSAR ...

Practical Outcomes of Applying Ensemble Machine Learning Classifiers to High-Throughput Screening (HTS) Data Analysis and Screening
Kirk Simmons, John Kinney, Aaron Owens, Daniel A. Kleier, Karen Bloch, Dave Argentar, Alicia Walsh and Ganesh VaidyanathanJournal of Chemical Information and Modeling2008 48 (11), 2196-2206Practical Outcomes of Applying Ensemble Machine Learning Classifiers to High-Throughput Screening (HTS) Data Analysis and Screening
Kirk Simmons, John Kinney, Aaron Owens, Daniel A. Kleier, Karen Bloch, Dave Argentar, Alicia Walsh and Ganesh VaidyanathanJournal of Chemical Information and Modeling2008 48 (11), 2196-2206Over the years numerous papers have presented the effectiveness of various machine learning methods in analyzing drug discovery biological screening data. The predictive performance of models developed using these methods has traditionally been evaluated ...
Tools
-
Add to Favorites
-
Download Citation
-
Email a Colleague -
Permalink
Order Reprints
Rights & Permissions
Citation Alerts
History
- Published In Issue January 22, 2007
- Received November 28, 2005
Cart

ACS
Network






