Comparison of Cellular Morphological Descriptors and Molecular Fingerprints for the Prediction of Cytotoxicity- and Proliferation-Related AssaysClick to copy article linkArticle link copied!
- Srijit SealSrijit SealCentre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United KingdomMore by Srijit Seal
- Hongbin YangHongbin YangCentre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United KingdomMore by Hongbin Yang
- Luis VollmersLuis VollmersCentre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United KingdomMore by Luis Vollmers
- Andreas Bender*Andreas Bender*Email: [email protected]Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United KingdomMore by Andreas Bender
Abstract

Cell morphology features, such as those from the Cell Painting assay, can be generated at relatively low costs and represent versatile biological descriptors of a system and thereby compound response. In this study, we explored cell morphology descriptors and molecular fingerprints, separately and in combination, for the prediction of cytotoxicity- and proliferation-related in vitro assay endpoints. We selected 135 compounds from the MoleculeNet ToxCast benchmark data set which were annotated with Cell Painting readouts, where the relatively small size of the data set is due to the overlap of required annotations. We trained Random Forest classification models using nested cross-validation and Cell Painting descriptors, Morgan and ErG fingerprints, and their combinations. While using leave-one-cluster-out cross-validation (with clusters based on physicochemical descriptors), models using Cell Painting descriptors achieved higher average performance over all assays (Balanced Accuracy of 0.65, Matthews Correlation Coefficient of 0.28, and AUC-ROC of 0.71) compared to models using ErG fingerprints (BA 0.55, MCC 0.09, and AUC-ROC 0.60) and Morgan fingerprints alone (BA 0.54, MCC 0.06, and AUC-ROC 0.56). While using random shuffle splits, the combination of Cell Painting descriptors with ErG and Morgan fingerprints further improved balanced accuracy on average by 8.9% (in 9 out of 12 assays) and 23.4% (in 8 out of 12 assays) compared to using only ErG and Morgan fingerprints, respectively. Regarding feature importance, Cell Painting descriptors related to nuclei texture, granularity of cells, and cytoplasm as well as cell neighbors and radial distributions were identified to be most contributing, which is plausible given the endpoint considered. We conclude that cell morphological descriptors contain complementary information to molecular fingerprints which can be used to improve the performance of predictive cytotoxicity models, in particular in areas of novel structural space.
Cited By
Smart citations by scite.ai include citation statements extracted from the full text of the citing article. The number of the statements may be higher than the number of citations provided by ACS Publications if one paper cites another multiple times or lower if scite has not yet processed some of the citing articles.
This article is cited by 36 publications.
- Srijit Seal, Manas Mahale, Miguel García-Ortegón, Chaitanya K. Joshi, Layla Hosseini-Gerami, Alex Beatson, Matthew Greenig, Mrinal Shekhar, Arijit Patra, Caroline Weis, Arash Mehrjou, Adrien Badré, Brianna Paisley, Rhiannon Lowe, Shantanu Singh, Falgun Shah, Bjarki Johannesson, Dominic Williams, David Rouquie, Djork-Arné Clevert, Patrick Schwab, Nicola Richmond, Christos A. Nicolaou, Raymond J. Gonzalez, Russell Naven, Carolin Schramm, Lewis R Vidler, Kamel Mansouri, W. Patrick Walters, Deidre Dalmas Wilk, Ola Spjuth, Anne E. Carpenter, Andreas Bender. Machine Learning for Toxicity Prediction Using Chemical Structures: Pillars for Success in the Real World. Chemical Research in Toxicology 2025, 38
(5)
, 759-807. https://doi.org/10.1021/acs.chemrestox.5c00033
- Srijit Seal, Dominic Williams, Layla Hosseini-Gerami, Manas Mahale, Anne E. Carpenter, Ola Spjuth, Andreas Bender. Improved Detection of Drug-Induced Liver Injury by Integrating Predicted In Vivo and In Vitro Data. Chemical Research in Toxicology 2024, 37
(8)
, 1290-1305. https://doi.org/10.1021/acs.chemrestox.4c00015
- Srijit Seal, Ola Spjuth, Layla Hosseini-Gerami, Miguel García-Ortegón, Shantanu Singh, Andreas Bender, Anne E. Carpenter. Insights into Drug Cardiotoxicity from Biological and Chemical Data: The First Public Classifiers for FDA Drug-Induced Cardiotoxicity Rank. Journal of Chemical Information and Modeling 2024, 64
(4)
, 1172-1186. https://doi.org/10.1021/acs.jcim.3c01834
- Robert I. Horne, Jared Wilson-Godber, Alicia González Díaz, Z. Faidon Brotzakis, Srijit Seal, Rebecca C. Gregory, Andrea Possenti, Sean Chia, Michele Vendruscolo. Using Generative Modeling to Endow with Potency Initially Inert Compounds with Good Bioavailability and Low Toxicity. Journal of Chemical Information and Modeling 2024, 64
(3)
, 590-596. https://doi.org/10.1021/acs.jcim.3c01777
- Vanille Lejal, Natacha Cerisier, David Rouquié, Olivier Taboureau. Assessment of Drug-Induced Liver Injury through Cell Morphology and Gene Expression Analysis. Chemical Research in Toxicology 2023, 36
(9)
, 1456-1470. https://doi.org/10.1021/acs.chemrestox.2c00381
- Marina Garcia de
Lomana, Paula Andrea Marin Zapata, Floriane Montanari. Predicting the Mitochondrial Toxicity of Small Molecules: Insights from Mechanistic Assays and Cell Painting Data. Chemical Research in Toxicology 2023, 36
(7)
, 1107-1120. https://doi.org/10.1021/acs.chemrestox.3c00086
- Maria-Anna Trapotsi, Elizabeth Mouchet, Guy Williams, Tiziana Monteverde, Karolina Juhani, Riku Turkki, Filip Miljković, Anton Martinsson, Lewis Mervin, Kenneth R. Pryde, Erik Müllers, Ian Barrett, Ola Engkvist, Andreas Bender, Kevin Moreau. Cell Morphological Profiling Enables High-Throughput Screening for PROteolysis TArgeting Chimera (PROTAC) Phenotypic Signature. ACS Chemical Biology 2022, 17
(7)
, 1733-1744. https://doi.org/10.1021/acschembio.2c00076
- Nicole C. Kleinstreuer, (National Institute of Environmental Health Sciences)Igor V. Tetko, (Associate Editor)Weida Tong (Food and Drug Administration). Introduction to Special Issue: Computational Toxicology. Chemical Research in Toxicology 2021, 34
(2)
, 171-175. https://doi.org/10.1021/acs.chemrestox.1c00032
- Wenqi Jiao, Kechao Li, Min Zhou, Nana Zhou, Qiusong Chen, Tao Hu, Chongchong Qi. Determining whether biochar can effectively increase crop yields: A machine learning model development with imbalanced data. Environmental Technology & Innovation 2025, 38 , 104154. https://doi.org/10.1016/j.eti.2025.104154
- Flavio M. Morelli, Marian Raschke, Natalia Jungmann, Michaela Bairlein, Marina García de Lomana. Predicting in vitro assays related to liver function using probabilistic machine learning. Toxicology 2025, 9 , 154195. https://doi.org/10.1016/j.tox.2025.154195
- T. Dorval. Beyond Images: Data Extraction, Analysis and Interpretation. 2025, 75-98. https://doi.org/10.1039/9781837676941-00075
- Srijit Seal, Maria-Anna Trapotsi, Ola Spjuth, Shantanu Singh, Jordi Carreras-Puigvert, Nigel Greene, Andreas Bender, Anne E. Carpenter. Cell Painting: a decade of discovery and innovation in cellular imaging. Nature Methods 2025, 22
(2)
, 254-268. https://doi.org/10.1038/s41592-024-02528-8
- Sonja Sievers, Herbert Waldmann, Slava Ziegler. Phenotypic drug discovery. 2025https://doi.org/10.1016/B978-0-443-29808-0.00009-1
- Eneko Madorran, Miha Ambrož, Jure Knez, Monika Sobočan. An Overview of the Current State of Cell Viability Assessment Methods Using OECD Classification. International Journal of Molecular Sciences 2025, 26
(1)
, 220. https://doi.org/10.3390/ijms26010220
- Johanna B. Brüggenthies, Jakob Dittmer, Eva Martin, Igor Zingman, Ibrahim Tabet, Helga Bronner, Sarah Groetzner, Julia Sauer, Mozhgan Dehghan Harati, Rebekka Scharnowski, Julia Bakker, Katharina Riegger, Caroline Heinzelmann, Birgit Ast, Robert Ries, Sophie A. Fillon, Anna Bachmayr-Heyda, Kerstin Kitt, Marc A. Grundl, Ralf Heilker, Lina Humbeck, Michael Schuler, Bernd Weigle. Insights into the Identification of iPSC- and Monocyte-Derived Macrophage-Polarizing Compounds by AI-Fueled Cell Painting Analysis Tools. International Journal of Molecular Sciences 2024, 25
(22)
, 12330. https://doi.org/10.3390/ijms252212330
- Shengkun Ni, Xiangtai Kong, Yingying Zhang, Zhengyang Chen, Zhaokun Wang, Zunyun Fu, Ruifeng Huo, Xiaochu Tong, Ning Qu, Xiaolong Wu, Kun Wang, Wei Zhang, Runze Zhang, Zimei Zhang, Jiangshan Shi, Yitian Wang, Ruirui Yang, Xutong Li, Sulin Zhang, Mingyue Zheng. Identifying compound-protein interactions with knowledge graph embedding of perturbation transcriptomics. Cell Genomics 2024, 4
(10)
, 100655. https://doi.org/10.1016/j.xgen.2024.100655
- Marek Grosicki, Kamila Wojnar-Lason, Sylwester Mosiolek, Lukasz Mateuszuk, Marta Stojak, Stefan Chlopicki. Distinct profile of antiviral drugs effects in aortic and pulmonary endothelial cells revealed by high-content microscopy and cell painting assays. Toxicology and Applied Pharmacology 2024, 490 , 117030. https://doi.org/10.1016/j.taap.2024.117030
- Yueshan Zhao, Ji Youn Park, Da Yang, Min Zhang. A computational framework to in silico screen for drug-induced hepatocellular toxicity. Toxicological Sciences 2024, 201
(1)
, 14-25. https://doi.org/10.1093/toxsci/kfae078
- Joy Ku, Prashanth Asuri. Stem cell-based approaches for developmental neurotoxicity testing. Frontiers in Toxicology 2024, 6 https://doi.org/10.3389/ftox.2024.1402630
- Floriane Odje, David Meijer, Elena von Coburg, Justin J. J. van der Hooft, Sebastian Dunst, Marnix H. Medema, Andrea Volkamer. Unleashing the potential of cell painting assays for compound activities and hazards prediction. Frontiers in Toxicology 2024, 6 https://doi.org/10.3389/ftox.2024.1401036
- Anjana S Desai, Anindita Bandopadhyaya, Aparna Ashok, Maneesha, Neeru Bhagat. Decoding characteristics of key physical properties in silver nanoparticles by attaining centroids for cytotoxicity prediction through data cleansing. Machine Learning: Science and Technology 2024, 5
(2)
, 025059. https://doi.org/10.1088/2632-2153/ad51cb
- Srijit Seal, Jordi Carreras-Puigvert, Shantanu Singh, Anne E. Carpenter, Ola Spjuth, Andreas Bender, . From pixels to phenotypes: Integrating image-based profiling with cell health data as BioMorph features improves interpretability. Molecular Biology of the Cell 2024, 35
(3)
https://doi.org/10.1091/mbc.E23-08-0298
- Guangyan Tian, Philip J Harrison, Akshai P Sreenivasan, Jordi Carreras-Puigvert, Ola Spjuth. Combining molecular and cell painting image data for mechanism of action prediction. Artificial Intelligence in the Life Sciences 2023, 3 , 100060. https://doi.org/10.1016/j.ailsci.2023.100060
- Srijit Seal, Hongbin Yang, Maria-Anna Trapotsi, Satvik Singh, Jordi Carreras-Puigvert, Ola Spjuth, Andreas Bender. Merging bioactivity predictions from cell morphology and chemical fingerprint models using similarity to training data. Journal of Cheminformatics 2023, 15
(1)
https://doi.org/10.1186/s13321-023-00723-x
- Anika Liu, Srijit Seal, Hongbin Yang, Andreas Bender. Using chemical and biological data to predict drug toxicity. SLAS Discovery 2023, 28
(3)
, 53-64. https://doi.org/10.1016/j.slasd.2022.12.003
- Lijo John, Hridoy Jyoti Mahanta, Y. Soujanya, G. Narahari Sastry. Assessing machine learning approaches for predicting failures of investigational drug candidates during clinical trials. Computers in Biology and Medicine 2023, 153 , 106494. https://doi.org/10.1016/j.compbiomed.2022.106494
- Natacha Cerisier, Bryan Dafniet, Anne Badel, Olivier Taboureau. Linking chemicals, genes and morphological perturbations to diseases. Toxicology and Applied Pharmacology 2023, 461 , 116407. https://doi.org/10.1016/j.taap.2023.116407
- Mainak Chatterjee, Kunal Roy. Quantitative structure-activity relationships (QSARs) in medicinal chemistry. 2023, 3-38. https://doi.org/10.1016/B978-0-443-18638-7.00029-3
- Hongbin Yang, Olga Obrezanova, Amy Pointon, Will Stebbeds, Jo Francis, Kylie A. Beattie, Peter Clements, James S. Harvey, Graham F. Smith, Andreas Bender. Prediction of inotropic effect based on calcium transients in human iPSC-derived cardiomyocytes and machine learning. Toxicology and Applied Pharmacology 2023, 459 , 116342. https://doi.org/10.1016/j.taap.2022.116342
- Andi Alijagic, Nikolai Scherbak, Oleksandr Kotlyar, Patrik Karlsson, Xuying Wang, Inger Odnevall, Oldřich Benada, Ali Amiryousefi, Lena Andersson, Alexander Persson, Jenny Felth, Henrik Andersson, Maria Larsson, Alexander Hedbrant, Samira Salihovic, Tuulia Hyötyläinen, Dirk Repsilber, Eva Särndahl, Magnus Engwall. A Novel Nanosafety Approach Using Cell Painting, Metabolomics, and Lipidomics Captures the Cellular and Molecular Phenotypes Induced by the Unintentionally Formed Metal-Based (Nano)Particles. Cells 2023, 12
(2)
, 281. https://doi.org/10.3390/cells12020281
- Srijit Seal, Jordi Carreras-Puigvert, Maria-Anna Trapotsi, Hongbin Yang, Ola Spjuth, Andreas Bender. Integrating cell morphology with gene expression and chemical structure to aid mitochondrial toxicity detection. Communications Biology 2022, 5
(1)
https://doi.org/10.1038/s42003-022-03763-5
- Jaeseong Jeong, Donghyeon Kim, Jinhee Choi. Application of ToxCast/Tox21 data for toxicity mechanism-based evaluation and prioritization of environmental chemicals: Perspective and limitations. Toxicology in Vitro 2022, 84 , 105451. https://doi.org/10.1016/j.tiv.2022.105451
- Jonne Rietdijk, Tanya Aggarwal, Polina Georgieva, Maris Lapins, Jordi Carreras-Puigvert, Ola Spjuth. Morphological profiling of environmental chemicals enables efficient and untargeted exploration of combination effects. Science of The Total Environment 2022, 832 , 155058. https://doi.org/10.1016/j.scitotenv.2022.155058
- Maria-Anna Trapotsi, Layla Hosseini-Gerami, Andreas Bender. Computational analyses of mechanism of action (MoA): data, methods and integration. RSC Chemical Biology 2022, 3
(2)
, 170-200. https://doi.org/10.1039/D1CB00069A
- Morgan Thomas, Andrew Boardman, Miguel Garcia-Ortegon, Hongbin Yang, Chris de Graaf, Andreas Bender. Applications of Artificial Intelligence in Drug Design: Opportunities and Challenges. 2022, 1-59. https://doi.org/10.1007/978-1-0716-1787-8_1
- Jonne Rietdijk, Marianna Tampere, Aleksandra Pettke, Polina Georgiev, Maris Lapins, Ulrika Warpman-Berglund, Ola Spjuth, Marjo-Riitta Puumalainen, Jordi Carreras-Puigvert. A phenomics approach for antiviral drug discovery. BMC Biology 2021, 19
(1)
https://doi.org/10.1186/s12915-021-01086-1
Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.
Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.
The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.