CGRdb2.0: A Python Database Management System for Molecules, Reactions, and Chemical DataClick to copy article linkArticle link copied!
- Timur GimadievTimur GimadievInstitute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Kita-ku, 001-0021 Sapporo, JapanMore by Timur Gimadiev
- Ramil NugmanovRamil NugmanovLaboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, 18, Kremlyovskaya str., 420008 Kazan, RussiaMore by Ramil Nugmanov
- Aigul KhakimovaAigul KhakimovaJSC ≪BIOCAD≫, Petrodvortsoviy District, Strelna, Svyazi st., Bld. 34, Liter A, 198515 St. Petersburg, RussiaMore by Aigul Khakimova
- Adeliya FatykhovaAdeliya FatykhovaLaboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, 18, Kremlyovskaya str., 420008 Kazan, RussiaMore by Adeliya Fatykhova
- Timur Madzhidov*Timur Madzhidov*Email: [email protected]Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, 18, Kremlyovskaya str., 420008 Kazan, RussiaMore by Timur Madzhidov
- Pavel Sidorov*Pavel Sidorov*Email: [email protected]Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Kita-ku, 001-0021 Sapporo, JapanMore by Pavel Sidorov
- Alexandre Varnek*Alexandre Varnek*Email: [email protected]Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Kita-ku, 001-0021 Sapporo, JapanLaboratory of Chemoinformatics, UMR 7140 CNRS, University of Strasbourg, 4, Blaise Pascal Str., 67081 Strasbourg, FranceMore by Alexandre Varnek
Abstract

This work introduces CGRdb2.0─an open-source database management system for molecules, reactions, and chemical data. CGRdb2.0 is a Python package connecting to a PostgreSQL database that enables native searches for molecules and reactions without complicated SQL syntax. The library provides out-of-the-box implementations for similarity and substructure searches for molecules, as well as similarity and substructure searches for reactions in two ways─based on reaction components and based on the Condensed Graph of Reaction approach, the latter significantly accelerating the performance. In benchmarking studies with the RDKit database cartridge, we demonstrate that CGRdb2.0 performs searches faster for smaller data sets, while allowing for interactive access to the retrieved data.
Cited By
Smart citations by scite.ai include citation statements extracted from the full text of the citing article. The number of the statements may be higher than the number of citations provided by ACS Publications if one paper cites another multiple times or lower if scite has not yet processed some of the citing articles.
This article is cited by 8 publications.
- Mohammad Ali, Yuta Mizuno, Seiji Akiyama, Yuuya Nagata, Tamiki Komatsuzaki. Enumeration Approach to Atom-to-Atom Mapping Accelerated by Ising Computing. Journal of Chemical Information and Modeling 2025, Article ASAP.
- Ramil Nugmanov. PaCh (Packed Chemicals): Computationally Effective Binary Format for Chemical Structure Encoding. Journal of Chemical Information and Modeling 2024, 64
(8)
, 3173-3179. https://doi.org/10.1021/acs.jcim.3c01720
- Ramil Nugmanov, Natalia Dyubankova, Andrey Gedich, Joerg Kurt Wegner. Bidirectional Graphormer for Reactivity Understanding: Neural Network Trained to Reaction Atom-to-Atom Mapping Task. Journal of Chemical Information and Modeling 2022, 62
(14)
, 3307-3315. https://doi.org/10.1021/acs.jcim.2c00344
- Matthias Rarey, Marc C. Nicklaus, Wendy Warr. Special Issue on Reaction Informatics and Chemical Space. Journal of Chemical Information and Modeling 2022, 62
(9)
, 2009-2010. https://doi.org/10.1021/acs.jcim.2c00390
- Eddie T. C. Huang, Jai-Sing Yang, Ken Y. K. Liao, Warren C. W. Tseng, C. K. Lee, Michelle Gill, Colin Compas, Simon See, Fuu-Jen Tsai. Predicting blood–brain barrier permeability of molecules with a large language model and machine learning. Scientific Reports 2024, 14
(1)
https://doi.org/10.1038/s41598-024-66897-y
- Fedor V. Ryzhkov, Yuliya E. Ryzhkova, Michail N. Elinson. Python tools for structural tasks in chemistry. Molecular Diversity 2024, 11 https://doi.org/10.1007/s11030-024-10889-7
- Pavel Sidorov, Nobuya Tsuji. A Primer on 2D Descriptors in Selectivity Modeling for Asymmetric Catalysis. Chemistry – A European Journal 2024, 30
(10)
https://doi.org/10.1002/chem.202302837
- Lung-Yi Chen, Yi-Pei Li. Machine learning-guided strategies for reaction conditions design and optimization. Beilstein Journal of Organic Chemistry 2024, 20 , 2476-2492. https://doi.org/10.3762/bjoc.20.212
Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.
Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.
The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.