Graph Comparison of Molecular Crystals in Band Gap Prediction Using Neural NetworksClick to copy article linkArticle link copied!
- Takuya Taniguchi*Takuya Taniguchi*Email: [email protected]Center for Data Science, Waseda University, 1-6-1 Nishiwaseda, Shinjuku-ku, Tokyo 169-8050, JapanMore by Takuya Taniguchi
- Mayuko HosokawaMayuko HosokawaDepartment of Advanced Science and Engineering, Graduate School of Advanced Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-Ku, Tokyo 169-8555, JapanMore by Mayuko Hosokawa
- Toru AsahiToru AsahiDepartment of Advanced Science and Engineering, Graduate School of Advanced Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-Ku, Tokyo 169-8555, JapanMore by Toru Asahi
Abstract
In material informatics, the representation of the material structure is fundamentally essential to obtaining better prediction results, and graph representation has attracted much attention in recent years. Molecular crystals can be graphically represented in molecular and crystal representations, but a comparison of which representation is more effective has not been examined. In this study, we compared the prediction accuracy between molecular and crystal graphs for band gap prediction. The results showed that the prediction accuracies using crystal graphs were better than those obtained using molecular graphs. While this result is not surprising, error analysis quantitatively evaluated that the error of the crystal graph was 0.4 times that of the molecular graph with moderate correlation. The novelty of this study lies in the comparison of molecular crystal representations and in the quantitative evaluation of the contribution of crystal structures to the band gap.
This publication is licensed under
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
Introduction
Results and Discussion
Data Set and Structure Representations
Regression on Band Gap
representation | GNN | R2 | MAE | RMSE |
---|---|---|---|---|
MolGraph | SchNet | 0.732 (0.004) | 0.401 (0.002) | 0.527 (0.004) |
MEGNet | 0.776 (0.006) | 0.360 (0.003) | 0.482 (0.007) | |
CGCNN | 0.754 (0.011) | 0.375 (0.011) | 0.505 (0.011) | |
MolGraphH | SchNet | 0.693 (0.006) | 0.427 (0.005) | 0.564 (0.006) |
MEGNet | 0.758 (0.014) | 0.379 (0.013) | 0.501 (0.015) | |
CGCNN | 0.750 (0.007) | 0.385 (0.007) | 0.509 (0.007) | |
simple CrystGraph | SchNet | 0.862 (0.008) | 0.279 (0.009) | 0.378 (0.011) |
MEGNet | 0.892 (0.006) | 0.249 (0.014) | 0.335 (0.010) | |
CGCNN | 0.858 (0.007) | 0.283 (0.005) | 0.383 (0.009) | |
medium CrystGraph | SchNet | 0.833 (0.055) | 0.308 (0.055) | 0.412 (0.066) |
MEGNet | 0.895 (0.004) | 0.240 (0.005) | 0.329 (0.006) | |
CGCNN | 0.856 (0.010) | 0.280 (0.014) | 0.386 (0.013) | |
complicated CrystGraph | SchNet | 0.810 (0.084) | 0.331 (0.072) | 0.437 (0.094) |
MEGNet | 0.879 (0.007) | 0.259 (0.006) | 0.354 (0.010) | |
CGCNN | 0.870 (0.004) | 0.277 (0.004) | 0.367 (0.006) |
Each value is the average, and the bracket represents standard deviation.
Generalization Ability and Large Screening
Conclusions
Methods
Data Collecting
Molecular and Crystal Representations
Graph Neural Networks
Supporting Information
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acsomega.3c05224.
Hyperparameters, data set for generalization test, data distribution (PDF)
Terms & Conditions
Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.
Acknowledgments
This work was partly executed under the cooperation of organization between Waseda University and ENEOS Corporation. This work was financially supported by the JSPS Grant-in-Aid (22K14747) and Waseda University Grant for Special Research Projects (2020C-530, 2021C-404, and 2022C-313).
References
This article references 35 other publications.
- 1Ramprasad, R.; Batra, R.; Pilania, G.; Mannodi-Kanakkithodi, A.; Kim, C. Machine learning in materials informatics: recent applications and prospects. npj Comput. Mater. 2017, 3, 54, DOI: 10.1038/s41524-017-0056-5Google ScholarThere is no corresponding record for this reference.
- 2Butler, K. T.; Davies, D. W.; Cartwright, H.; Isayev, O.; Walsh, A. Machine learning for molecular and materials science. Nature 2018, 559, 547– 555, DOI: 10.1038/s41586-018-0337-2Google Scholar2https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXhtl2jt7vL&md5=13d36f27db8d59f558fe28e946b4b009Machine learning for molecular and materials scienceButler, Keith T.; Davies, Daniel W.; Cartwright, Hugh; Isayev, Olexandr; Walsh, AronNature (London, United Kingdom) (2018), 559 (7715), 547-555CODEN: NATUAS; ISSN:0028-0836. (Nature Research)Here we summarize recent progress in machine learning for the chem. sciences. We outline machine-learning techniques that are suitable for addressing research questions in this domain, as well as future directions for the field. We envisage a future in which the design, synthesis, characterization and application of mols. and materials is accelerated by artificial intelligence.
- 3Schmidt, J.; Marques, M. R.; Botti, S.; Marques, M. A. Recent advances and applications of machine learning in solid-state materials science. npj Comput. Mater. 2019, 5, 83, DOI: 10.1038/s41524-019-0221-0Google ScholarThere is no corresponding record for this reference.
- 4Rogers, D.; Hahn, M. Extended-connectivity fingerprints. J. Chem. Inf. Model. 2010, 50, 742– 754, DOI: 10.1021/ci100050tGoogle Scholar4https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXlt1Onsbg%253D&md5=cd6c736cd7a3d280b67f5316acce8006Extended-Connectivity FingerprintsRogers, David; Hahn, MathewJournal of Chemical Information and Modeling (2010), 50 (5), 742-754CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Extended-connectivity fingerprints (ECFPs) are a novel class of topol. fingerprints for mol. characterization. Historically, topol. fingerprints were developed for substructure and similarity searching. ECFPs were developed specifically for structure-activity modeling. ECFPs are circular fingerprints with a no. of useful qualities: they can be very rapidly calcd.; they are not predefined and can represent an essentially infinite no. of different mol. features (including stereochem. information); their features represent the presence of particular substructures, allowing easier interpretation of anal. results; and the ECFP algorithm can be tailored to generate different types of circular fingerprints, optimized for different uses. While the use of ECFPs has been widely adopted and validated, a description of their implementation has not previously been presented in the literature.
- 5Musil, F.; Grisafi, A.; Bartók, A. P.; Ortner, C.; Csányi, G.; Ceriotti, M. Physics-inspired structural representations for molecules and materials. Chem. Rev. 2021, 121, 9759– 9815, DOI: 10.1021/acs.chemrev.1c00021Google Scholar5https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXhs1aisL3J&md5=b75f32ea20fa0d83fb028415f540e1f3Physics-inspired structural representations for molecules and materialsMusil, Felix; Grisafi, Andrea; Bartok, Albert P.; Ortner, Christoph; Csanyi, Gabor; Ceriotti, MicheleChemical Reviews (Washington, DC, United States) (2021), 121 (16), 9759-9815CODEN: CHREAY; ISSN:0009-2665. (American Chemical Society)A review. The first step in the construction of a regression model or a data-driven anal., aiming to predict or elucidate the relationship between the at. scale structure of matter and its properties, involves transforming the Cartesian coordinates of the atoms into a suitable representation. The development of at.-scale representations has played, and continues to play, a central role in the success of machine-learning methods for chem. and materials science. This review summarizes the current understanding of the nature and characteristics of the most commonly used structural and chem. descriptions of atomistic structures, highlighting the deep underlying connections between different frameworks, and the ideas that lead to computationally efficient and universally applicable models. It emphasizes the link between properties, structures, their phys. chem. and their math. description, provides examples of recent applications to a diverse set of chem. and materials science problems, and outlines the open questions and the most promising research directions in the field.
- 6Schütt, K. T.; Sauceda, H. E.; Kindermans, P. J.; Tkatchenko, A.; Müller, K. R. Schnet–a deep learning architecture for molecules and materials. J. Chem. Phys. 2018, 148, 241722 DOI: 10.1063/1.5019779Google Scholar6https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXms1Ggurs%253D&md5=988638d520a423f529a16b35031243aaSchNet - A deep learning architecture for molecules and materialsSchuett, K. T.; Sauceda, H. E.; Kindermans, P.-J.; Tkatchenko, A.; Mueller, K.-R.Journal of Chemical Physics (2018), 148 (24), 241722/1-241722/11CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Deep learning has led to a paradigm shift in artificial intelligence, including web, text, and image search, speech recognition, as well as bioinformatics, with growing impact in chem. physics. Machine learning, in general, and deep learning, in particular, are ideally suitable for representing quantum-mech. interactions, enabling us to model nonlinear potential-energy surfaces or enhancing the exploration of chem. compd. space. Here, we present the deep learning architecture SchNet that is specifically designed to model atomistic systems by making use of continuous-filter convolutional layers. We demonstrate the capabilities of SchNet by accurately predicting a range of properties across chem. space for mols. and materials, where our model learns chem. plausible embeddings of atom types across the periodic table. Finally, we employ SchNet to predict potential-energy surfaces and energy-conserving force fields for mol. dynamics simulations of small mols. and perform an exemplary study on the quantum-mech. properties of C20-fullerene that would have been infeasible with regular ab initio mol. dynamics. (c) 2018 American Institute of Physics.
- 7Xie, T.; Grossman, J. C. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett. 2018, 120, 145301 DOI: 10.1103/PhysRevLett.120.145301Google Scholar7https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXltFSnu7c%253D&md5=93beb5675af86cf95e07c82c136f3511Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material PropertiesXie, Tian; Grossman, Jeffrey C.Physical Review Letters (2018), 120 (14), 145301CODEN: PRLTAO; ISSN:1079-7114. (American Physical Society)The use of machine learning methods for accelerating the design of cryst. materials usually requires manually constructed feature vectors or complex transformation of atom coordinates to input the crystal structure, which either constrains the model to certain crystal types or makes it difficult to provide chem. insights. Here, we develop a crystal graph convolutional neural networks framework to directly learn material properties from the connection of atoms in the crystal, providing a universal and interpretable representation of cryst. materials. Our method provides a highly accurate prediction of d. functional theory calcd. properties for eight different properties of crystals with various structure types and compns. after being trained with 104 data points. Further, our framework is interpretable because one can ext. the contributions from local chem. environments to global properties. Using an example of perovskites, we show how this information can be utilized to discover empirical rules for materials design.
- 8Chen, C.; Ye, W.; Zuo, Y.; Zheng, C.; Ong, S. P. Graph networks as a universal machine learning framework for molecules and crystals. Chem. Mater. 2019, 31, 3564– 3572, DOI: 10.1021/acs.chemmater.9b01294Google Scholar8https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXntFaqt7g%253D&md5=c924547f17083428802d5698682d5971Graph Networks as a Universal Machine Learning Framework for Molecules and CrystalsChen, Chi; Ye, Weike; Zuo, Yunxing; Zheng, Chen; Ong, Shyue PingChemistry of Materials (2019), 31 (9), 3564-3572CODEN: CMATEX; ISSN:0897-4756. (American Chemical Society)Graph networks are a new machine learning (ML) paradigm that supports both relational reasoning and combinatorial generalization. Here, we develop universal MatErials Graph Network (MEGNet) models for accurate property prediction in both mols. and crystals. We demonstrate that the MEGNet models outperform prior ML models such as the SchNet in 11 out of 13 properties of the QM9 mol. data set. Similarly, we show that MEGNet models trained on ∼60 000 crystals in the Materials Project substantially outperform prior ML models in the prediction of the formation energies, band gaps, and elastic moduli of crystals, achieving better than d. functional theory accuracy over a much larger data set. We present two new strategies to address data limitations common in materials science and chem. First, we demonstrate a phys. intuitive approach to unify four sep. mol. MEGNet models for the internal energy at 0 K and room temp., enthalpy, and Gibbs free energy into a single free energy MEGNet model by incorporating the temp., pressure, and entropy as global state inputs. Second, we show that the learned element embeddings in MEGNet models encode periodic chem. trends and can be transfer-learned from a property model trained on a larger data set (formation energies) to improve property models with smaller amts. of data (band gaps and elastic moduli).
- 9Jiang, Y.; Yang, Z.; Guo, J.; Li, H.; Liu, Y.; Guo, Y.; Li, M.; Pu, X. Coupling complementary strategy to flexible graph neural network for quick discovery of coformer in diverse co-crystal materials. Nat. Commun. 2021, 12, 5950 DOI: 10.1038/s41467-021-26226-7Google Scholar9https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXit1aqsbjE&md5=e877fd347df52869466130c176db61ecCoupling complementary strategy to flexible graph neural network for quick discovery of coformer in diverse co-crystal materialsJiang, Yuanyuan; Yang, Zongwei; Guo, Jiali; Li, Hongzhen; Liu, Yijing; Guo, Yanzhi; Li, Menglong; Pu, XuemeiNature Communications (2021), 12 (1), 5950CODEN: NCAOBW; ISSN:2041-1723. (Nature Research)Cocrystal engineering have been widely applied in pharmaceutical, chem. and material fields. However, how to effectively choose coformer has been a challenging task on expts. Here we develop a graph neural network (GNN) based deep learning framework to quickly predict formation of the cocrystal. In order to capture main driving force to crystn. from 6819 pos. and 1052 neg. samples reported by expts., a feasible GNN framework is explored to integrate important prior knowledge into end-to-end learning on the mol. graph. The model is strongly validated against seven competitive models and three challenging independent test sets involving pharmaceutical cocrystals, π-π cocrystals and energetic cocrystals, exhibiting superior performance with accuracy higher than 96%, confirming its robustness and generalization. Furthermore, one new energetic cocrystal predicted is successfully synthesized, showcasing high potential of the model in practice. All the data and source codes are available at https://github.com/Saoge123/ccgnet for aiding cocrystal community.
- 10Park, C. W.; Wolverton, C. Developing an improved crystal graph convolutional neural network framework for accelerated materials discovery. Phys. Rev. Mater. 2020, 4, 063801 DOI: 10.1103/PhysRevMaterials.4.063801Google Scholar10https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhsFKis7fP&md5=0b5903b05d47535d7c2967b7d9d6fc6bDeveloping an improved crystal graph convolutional neural network framework for accelerated materials discoveryPark, Cheol Woo; Wolverton, ChrisPhysical Review Materials (2020), 4 (6), 063801CODEN: PRMHBS; ISSN:2475-9953. (American Physical Society)The recently proposed crystal graph convolutional neural network (CGCNN) offers a highly versatile and accurate machine learning (ML) framework by learning material properties directly from graphlike representations of crystal structures ("crystal graphs"). Here, we develop an improved variant of the CGCNN model (iCGCNN) that outperforms the original by incorporating information of the Voronoi tessellated crystal structure, explicit three-body correlations of neighboring constituent atoms, and an optimized chem. representation of interat. bonds in the crystal graphs. We demonstrate the accuracy of the improved framework in two distinct illustrations: First, when trained/validated on 180 000/20 000 d. functional theory (DFT) calcd. thermodn. stability entries taken from the Open Quantum Materials Database (OQMD) and evaluated on a sep. test set of 230 000 entries, iCGCNN achieves a predictive accuracy that is significantly improved, i.e., 20% higher than that of the original CGCNN. Second, when used to assist a high-throughput search for materials in the ThCr2Si2 structure-type, iCGCNN exhibited a success rate of 31% which is 155 times higher than an undirected high-throughput search and 2.4 times higher than that of the original CGCNN. Using both CGCNN and iCGCNN, we screened 132 600 compds. with elemental decorations of the ThCr2Si2 prototype crystal structure and identified a total of 97 unique stable compds. by performing 757 DFT calcns., accelerating the computational time of the high-throughput search by a factor of 65. Our results suggest that the iCGCNN can be used to accelerate high-throughput discoveries of new materials by quickly and accurately identifying cryst. compds. with properties of interest.
- 11Louis, S. Y.; Zhao, Y.; Nasiri, A.; Wang, X.; Song, Y.; Liu, F.; Hu, J. Graph convolutional neural networks with global attention for improved materials property prediction. Phys. Chem. Chem. Phys. 2020, 22, 18141– 18148, DOI: 10.1039/D0CP01474EGoogle Scholar11https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhsFSmu7bL&md5=9190661d65053bc86d36c270be20cf0dGraph convolutional neural networks with global attention for improved materials property predictionLouis, Steph-Yves; Zhao, Yong; Nasiri, Alireza; Wang, Xiran; Song, Yuqi; Liu, Fei; Hu, JianjunPhysical Chemistry Chemical Physics (2020), 22 (32), 18141-18148CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)The development of an efficient and powerful machine learning (ML) model for materials property prediction (MPP) remains an important challenge in materials science. While various techniques have been proposed to ext. physicochem. features in MPP, graph neural networks (GNN) have also shown very strong capability in capturing effective features for high-performance MPP. Nevertheless, current GNN models do not effectively differentiate the contributions from different atoms. In this paper we develop a novel graph neural network model called GATGNN for predicting properties of inorg. materials. GATGNN is characterized by its compn. of augmented graph-attention layers (AGAT) and a global attention layer. The application of AGAT layers and global attention layers resp. learn the local relationship among neighboring atoms and overall contribution of the atoms to the material's property; together making our framework achieve considerably better prediction performance on various tested properties. Through extensive expts., we show that our method is able to outperform existing state-of-the-art GNN models while it can also provide a measurable insight into the correlation between the atoms and their material property.
- 12Cheng, J.; Zhang, C.; Dong, L. A geometric-information-enhanced crystal graph network for predicting properties of materials. Commun. Mater. 2021, 2, 92, DOI: 10.1038/s43246-021-00194-3Google Scholar12https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XotVCku78%253D&md5=44ae02fdbbde3d1eef3d2e2fe77fb860A geometric-information-enhanced crystal graph network for predicting properties of materialsCheng, Jiucheng; Zhang, Chunkai; Dong, LifengCommunications Materials (2021), 2 (1), 92CODEN: CMOAGE; ISSN:2662-4443. (Nature Portfolio)Graph neural networks (GNNs) have been used previously for identifying new cryst. materials. However, geometric structure is not usually taken into consideration, or only partially. Here, we develop a geometric-information-enhanced crystal graph neural network (GeoCGNN) to predict the properties of cryst. materials. By considering the distance vector between each node and its neighbors, our model can learn full topol. and spatial geometric structure information. Furthermore, we incorporate an effective method based on the mixed basis functions to encode the geometric information into our model, which outperforms other GNN methods in a variety of databases. For example, for predicting formation energy our model is 25.6%, 14.3% and 35.7% more accurate than CGCNN, MEGNet and iCGCNN models, resp. For band gap, our model outperforms CGCNN by 27.6% and MEGNet by 12.4%.
- 13Choudhary, K.; DeCost, B. Atomistic line graph neural network for improved materials property predictions. npj Comput. Mater. 2021, 7, 185, DOI: 10.1038/s41524-021-00650-1Google ScholarThere is no corresponding record for this reference.
- 14Pandey, S.; Qu, J.; Stevanović, V.; John, P. S.; Gorai, P. Predicting energy and stability of known and hypothetical crystals using graph neural network. Patterns 2021, 2, 100361 DOI: 10.1016/j.patter.2021.100361Google Scholar14https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BB2cfntlejtA%253D%253D&md5=fc71858fe7691e1e930bf632ef025975Predicting energy and stability of known and hypothetical crystals using graph neural networkPandey Shubham; Stevanovic Vladan; Gorai Prashun; Qu Jiaxing; St John PeterPatterns (New York, N.Y.) (2021), 2 (11), 100361 ISSN:.The discovery of new inorganic materials in unexplored chemical spaces necessitates calculating total energy quickly and with sufficient accuracy. Machine learning models that provide such a capability for both ground-state (GS) and higher-energy structures would be instrumental in accelerated screening. Here, we demonstrate the importance of a balanced training dataset of GS and higher-energy structures to accurately predict total energies using a generic graph neural network architecture. Using [Formula: see text] 16,500 density functional theory calculations from the National Renewable Energy Laboratory (NREL) Materials Database and [Formula: see text] 11,000 calculations for hypothetical structures as our training database, we demonstrate that our model satisfactorily ranks the structures in the correct order of total energies for a given composition. Furthermore, we present a thorough error analysis to explain failure modes of the model, including both prediction outliers and occasional inconsistencies in the training data. By examining intermediate layers of the model, we analyze how the model represents learned structures and properties.
- 15Frey, N. C.; Akinwande, D.; Jariwala, D.; Shenoy, V. B. Machine learning-enabled design of point defects in 2d materials for quantum and neuromorphic information processing. ACS Nano 2020, 14, 13406– 13417, DOI: 10.1021/acsnano.0c05267Google Scholar15https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhslKqtLfF&md5=1ade77df47fb8faf2dbe58746ccb4788Machine Learning-Enabled Design of Point Defects in 2D Materials for Quantum and Neuromorphic Information ProcessingFrey, Nathan C.; Akinwande, Deji; Jariwala, Deep; Shenoy, Vivek B.ACS Nano (2020), 14 (10), 13406-13417CODEN: ANCAC3; ISSN:1936-0851. (American Chemical Society)Engineered point defects in two-dimensional (2D) materials offer an attractive platform for solid-state devices that exploit tailored optoelectronic, quantum emission, and resistive properties. Naturally occurring defects are also unavoidably important contributors to material properties and performance. The immense variety and complexity of possible defects make it challenging to exptl. control, probe, or understand at.-scale defect-property relationships. Here, we develop an approach based on deep transfer learning, machine learning, and first-principles calcns. to rapidly predict key properties of point defects in 2D materials. We use physics-informed featurization to generate a minimal description of defect structures and present a general picture of defects across materials systems. We identify over one hundred promising, unexplored dopant defect structures in layered metal chalcogenides, hexagonal nitrides, and metal halides. These defects are prime candidates for quantum emission, resistive switching, and neuromorphic computing.
- 16Fung, V.; Zhang, J.; Juarez, E.; Sumpter, B. G. Benchmarking graph neural networks for materials chemistry. npj Comput. Mater. 2021, 7, 84, DOI: 10.1038/s41524-021-00554-0Google Scholar16https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXhvFehs7zM&md5=e49c3e0b3e69dda5e5b41a33e3eba307Benchmarking graph neural networks for materials chemistryFung, Victor; Zhang, Jiaxin; Juarez, Eric; Sumpter, Bobby G.npj Computational Materials (2021), 7 (1), 84CODEN: NCMPCS; ISSN:2057-3960. (Nature Research)Graph neural networks (GNNs) have received intense interest as a rapidly expanding class of machine learning models remarkably well-suited for materials applications. To date, a no. of successful GNNs have been proposed and demonstrated for systems ranging from crystal stability to electronic property prediction and to surface chem. and heterogeneous catalysis. However, a consistent benchmark of these models remains lacking, hindering the development and consistent evaluation of new models in the materials field. Here, we present a workflow and testing platform, MatDeepLearn, for quickly and reproducibly assessing and comparing GNNs and other machine learning models. We use this platform to optimize and evaluate a selection of top performing GNNs on several representative datasets in computational materials chem. From our investigations we note the importance of hyperparameter selection and find roughly similar performances for the top models once optimized. We identify several strengths in GNNs over conventional models in cases with compositionally diverse datasets and in its overall flexibility with respect to inputs, due to learned rather than defined representations. Meanwhile several weaknesses of GNNs are also obsd. including high data requirements, and suggestions for further improvement for applications in materials chem. are discussed.
- 17Jalali, M.; Tsotsalas, M.; Wöll, C. MOFSocialNet: Exploiting metal-organic framework relationships via social network analysis. Nanomaterials 2022, 12, 704, DOI: 10.3390/nano12040704Google Scholar17https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XlslWmtLo%253D&md5=1bb836c9d57b2127944c9e1b8a45050eMOFSocialNet: Exploiting Metal-Organic Framework Relationships via Social Network AnalysisJalali, Mehrdad; Tsotsalas, Manuel; Woell, ChristofNanomaterials (2022), 12 (4), 704CODEN: NANOKO; ISSN:2079-4991. (MDPI AG)The no. of metal-org. frameworks (MOF) as well as the no. of applications of this material are growing rapidly. With the no. of characterized compds. exceeding 100,000, manual sorting becomes impossible. At the same time, the increasing computer power and established use of automated machine learning approaches makes data science tools available, that provide an overview of the MOF chem. space and support the selection of suitable MOFs for a desired application. Among the different data science tools, graph theory approaches, where data generated from numerous real-world applications is represented as a graph (network) of interconnected objects, has been widely used in a variety of scientific fields such as social sciences, health informatics, biol. sciences, agricultural sciences and economics. We describe the application of a particular graph theory approach known as social network anal. to MOF materials and highlight the importance of community (group) detection and graph node centrality. In this first application of the social network anal. approach to MOF chem. space, we created MOFSocialNet. This social network is based on the geometrical descriptors of MOFs available in the CoRE-MOFs database. MOFSocialNet can discover communities with similar MOFs structures and identify the most representative MOFs within a given community. In addn., anal. of MOFSocialNet using social network anal. methods can predict MOF properties more accurately than conventional ML tools. The latter advantage is demonstrated for the prediction of gas storage properties, the most important property of these porous reticular networks.
- 18Chen, C.; Ong, S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Comput. Sci. 2022, 2, 718– 728, DOI: 10.1038/s43588-022-00349-3Google ScholarThere is no corresponding record for this reference.https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=&md5=9874b665cc7a056b8e2f928dd3112440
- 19Egorova, O.; Hafizi, R.; Woods, D. C.; Day, G. M. Multifidelity statistical machine learning for molecular crystal structure prediction. J. Phys. Chem. A 2020, 124, 8065– 8078, DOI: 10.1021/acs.jpca.0c05006Google Scholar19https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhslequ7jL&md5=15ef21ac6c7aafa85f5e4fc3bb3e0fceMultifidelity Statistical Machine Learning for Molecular Crystal Structure PredictionEgorova, Olga; Hafizi, Roohollah; Woods, David C.; Day, Graeme M.Journal of Physical Chemistry A (2020), 124 (39), 8065-8078CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)Statistical machine learning was applied to predict expensive hybrid functional DFT (PBE0) calcns. of crystal structures using a multifidelity approach to reevaluate the energies of crystal structures predicted with an inexpensive force field. The method uses an autoregressive Gaussian process, making use of less expensive GGA DFT (PBE) calcns. to bridge the gap between the force field and PBE0 energies. The method is benchmarked on the crystal structure landscapes of 3 small, H-bonded org. mols. and produces accurate predictions of energies and crystal structure ranking using small nos. of the most expensive calcns.; the PBE0 energies can be predicted with errors of <1 kJ mol-1 with between 4.2 and 6.8% of the cost of the full calcns. As the developed model is probabilistic, how the uncertainties in predicted energies impact the assessment of the energetic ranking of crystal structures are discussed.
- 20Han, Y.; Ali, I.; Wang, Z.; Cai, J.; Wu, S.; Tang, J.; Zhang, L.; Ren, J.; Xiao, R.; Lu, Q.; Hang, L. Machine learning accelerates quantum mechanics predictions of molecular crystals. Phys. Rep. 2021, 934, 1– 71, DOI: 10.1016/j.physrep.2021.08.002Google Scholar20https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXhvFWiurzI&md5=829b4c028e46feb7be228b2062790582Machine learning accelerates quantum mechanics predictions of molecular crystalsHan, Yanqiang; Ali, Imran; Wang, Zhilong; Cai, Junfei; Wu, Sicheng; Tang, Jiequn; Zhang, Lin; Ren, Jiahao; Xiao, Rui; Lu, Qianqian; Hang, Lei; Luo, Hongyuan; Li, JinjinPhysics Reports (2021), 934 (), 1-71CODEN: PRPLCM; ISSN:0370-1573. (Elsevier B.V.)A review. Quantum mechanics (QM) approaches (DFT, MP2, CCSD(T), etc.) play an important role in calcg. mols. and crystals with a high accuracy and acceptable efficiency. In recent years, with the development of artificial intelligence technol., machine learning (ML) has played an increasingly essential role in accelerating the QM calcns. and predictions of mol. crystals, as well as the discovery of novel materials. This review provides state-of-the-art information and prospects for QM theories, fragment-based methods and ML methods, as well as their up-to-date applications in predicting small inorg. mols., large drug mols. and relevant mol. crystals. The discussed applications include ML potential energy surface (PES) construction, crystal structure prediction (CSP), chem. reaction prediction and predictions of a series of properties, such as structure, energy, at. force, bond length, chem. shift, supercond., super-hardness, vibrational spectra, phase transition and diagram. This work also reviews software and packages built recently based on ML methods for property predictions and PES constructions in the field of physics and chem. For the three discussed methods, the most time-consuming one is the high-level all-atom QM method, which is capable of describing electronic structures with high accuracy and thus predicts properties that are consistent with the exptl. results. The second one, fragment-based QM method, requires less computational time than all-atom QM, which can accelerate all-atom QM calcns. for large systems by dividing the entire system into subsystems, presenting a considerable efficiency increase. The computational complexities for fragment-based QM and all-atom QM are N - N2 and N5-N7 (N is the size of the system), resp. A well-trained ML model can make the above predictions within seconds while ensuring a high prediction accuracy, where its prediction cost and accuracy are detd. by the training data and the training process. Therefore, it is challenging for ML applications in physics and chem. to generate highly accurate and powerful ML models while ensuring sufficient datasets. This work not only provides an overview of the recent progress in QM theories, fragment-based methods, ML methods and several ML-based software programs and applications on small inorg. mols., large drug mols. and relevant crystals, but also shed light on ML methods in accelerating QM prediction, optimization and novel crystal material design.
- 21Ishizaki, K.; Sugimoto, R.; Hagiwara, Y.; Koshima, H.; Taniguchi, T.; Asahi, T. Actuation performance of a photo-bending crystal modeled by machine learning-based regression. CrystEngComm 2021, 23, 5839– 5847, DOI: 10.1039/D1CE00208BGoogle Scholar21https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXmtVSqu7k%253D&md5=3ac98d93899abba3690d8a0f9114e0b6Actuation performance of a photo-bending crystal modeled by machine learning-based regressionIshizaki, Kazuki; Sugimoto, Ryota; Hagiwara, Yuki; Koshima, Hideko; Taniguchi, Takuya; Asahi, ToruCrystEngComm (2021), 23 (34), 5839-5847CODEN: CRECF4; ISSN:1466-8033. (Royal Society of Chemistry)Photomech. mol. crystals are a family of mech. responsive materials that have been recognized as a novel type of actuator. The actuation properties of photomech. crystals should be characterized. However, deflection and force, which are crucial for actuators, are dependent on exptl. conditions such as light intensity and crystal size, and thus the no. of combinations under different conditions is infinite. This causes difficulty in obtaining the relationship between the exptl. conditions and actuation outputs. To solve this problem, this paper presents a machine learning-based regression approach for modeling the deflection and force of a photo-bending crystal. The deflection and force were exptl. measured for various light intensities and crystal sizes. These time-series data of deflection and force were represented by feature values via exponential fitting for bending and unbending processes. The features of the max. value and speed of deflection and force were analyzed by polynomial regression and variable selection. Through this process, the most fitted models were constructed for deflection and force and most of them were interpreted by material mechanics. This statistical strategy will potentially be applied to control or optimize other functions of stimuli-responsive crystals.
- 22Groom, C. R.; Bruno, I. J.; Lightfoot, M. P.; Ward, S. C. The Cambridge structural database. Acta Crystallogr., Sect. B 2016, 72, 171– 179, DOI: 10.1107/S2052520616003954Google Scholar22https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC28Xls1Kntro%253D&md5=f9c65ab86fc9db429588c95b0da3f9b2The Cambridge Structural DatabaseGroom, Colin R.; Bruno, Ian J.; Lightfoot, Matthew P.; Ward, Suzanna C.Acta Crystallographica, Section B: Structural Science, Crystal Engineering and Materials (2016), 72 (2), 171-179CODEN: ACSBDA; ISSN:2052-5206. (International Union of Crystallography)The Cambridge Structural Database (CSD) contains a complete record of all published org. and metal-org. small-mol. crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chem. data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chem. editors prior to entering the database. A key component of this processing is the reliable assocn. of the chem. identity of the structure studied with the exptl. data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of addnl. exptl. data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of std. identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface.
- 23Gražulis, S.; Daškevič, A.; Merkys, A.; Chateigner, D.; Lutterotti, L.; Quirós, M.; Serebryanaya, N. R.; Moeck, P.; Downs, R. T.; Bail, A. L. Crystallography Open Database (COD): an open-access collection of crystal structures and platform for world-wide collaboration. Nucleic Acids Res. 2012, 40, D420– D427, DOI: 10.1093/nar/gkr900Google Scholar23https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXhs12htbnP&md5=670b6cc1f2758728c303c4ad89e1cc6fCrystallography Open Database (COD): an open-access collection of crystal structures and platform for world-wide collaborationGrazulis, Saulius; Daskevic, Adriana; Merkys, Andrius; Chateigner, Daniel; Lutterotti, Luca; Quiros, Miguel; Serebryanaya, Nadezhda R.; Moeck, Peter; Downs, Robert T.; Le Bail, ArmelNucleic Acids Research (2012), 40 (D1), D420-D427CODEN: NARHAD; ISSN:0305-1048. (Oxford University Press)Using an open-access distribution model, the Crystallog. Open Database (COD, http://www.crystallog.net) collects all known small mol. / small to medium sized unit cell' crystal structures and makes them available freely on the Internet. As of today, the COD has aggregated ∼150 000 structures, offering basic search capabilities and the possibility to download the whole database, or parts thereof using a variety of std. open communication protocols. A newly developed website provides capabilities for all registered users to deposit published and so far unpublished structures as personal communications or pre-publication depositions. Such a setup enables extension of the COD database by many users simultaneously. This increases the possibilities for growth of the COD database, and is the first step towards establishing a world wide Internet-based collaborative platform dedicated to the collection and curation of structural knowledge.
- 24Olsthoorn, B.; Geilhufe, R. M.; Borysov, S. S.; Balatsky, A. V. Band gap prediction for large organic crystal structures with machine learning. Adv. Quantum Technol. 2019, 2, 1900023 DOI: 10.1002/qute.201900023Google ScholarThere is no corresponding record for this reference.
- 25Bartók, A. P.; Kondor, R.; Csányi, G. On representing chemical environments. Phys. Rev. B 2013, 87, 184115 DOI: 10.1103/PhysRevB.87.184115Google Scholar25https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3sXpvFClu7Y%253D&md5=f7739275562b8e77d4532f00da8814fbOn representing chemical environmentsBartok, Albert P.; Kondor, Risi; Csanyi, GaborPhysical Review B: Condensed Matter and Materials Physics (2013), 87 (18), 184115/1-184115/16CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)We review some recently published methods to represent at. neighborhood environments, and analyze their relative merits in terms of their faithfulness and suitability for fitting potential energy surfaces. The crucial properties that such representations (sometimes called descriptors) must have are differentiability with respect to moving the atoms and invariance to the basic symmetries of physics: rotation, reflection, translation, and permutation of atoms of the same species. We demonstrate that certain widely used descriptors that initially look quite different are specific cases of a general approach, in which a finite set of basis functions with increasing angular wave nos. are used to expand the at. neighborhood d. function. Using the example system of small clusters, we quant. show that this expansion needs to be carried to higher and higher wave nos. as the no. of neighbors increases in order to obtain a faithful representation, and that variants of the descriptors converge at very different rates. We also propose an altogether different approach, called Smooth Overlap of Atomic Positions, that sidesteps these difficulties by directly defining the similarity between any two neighborhood environments, and show that it is still closely connected to the invariant descriptors. We test the performance of the various representations by fitting models to the potential energy surface of small silicon clusters and the bulk crystal.
- 26Wengert, S.; Csányi, G.; Reuter, K.; Margraf, J. T. Data-efficient machine learning for molecular crystal structure prediction. Chem. Sci. 2021, 12, 4536– 4546, DOI: 10.1039/D0SC05765GGoogle Scholar26https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXjslWrtr4%253D&md5=b168772ffa3e022a39d1e57a954804ffData-efficient machine learning for molecular crystal structure predictionWengert, Simon; Csanyi, Gabor; Reuter, Karsten; Margraf, Johannes T.Chemical Science (2021), 12 (12), 4536-4546CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)The combination of modern machine learning (ML) approaches with high-quality data from quantum mech. (QM) calcns. can yield models with an unrivalled accuracy/cost ratio. However, such methods are ultimately limited by the computational effort required to produce the ref. data. In particular, ref. calcns. for periodic systems with many atoms can become prohibitively expensive for higher levels of theory. This trade-off is crit. in the context of org. crystal structure prediction (CSP). Here, a data-efficient ML approach would be highly desirable, since screening a huge space of possible polymorphs in a narrow energy range requires the assessment of a large no. of trial structures with high accuracy. In this contribution, we present tailored Δ-ML models that allow screening a wide range of crystal candidates while adequately describing the subtle interplay between intermol. interactions such as H-bonding and many-body dispersion effects. This is achieved by enhancing a physics-based description of long-range interactions at the d. functional tight binding (DFTB) level-for which an efficient implementation is available-with a short-range ML model trained on high-quality first-principles ref. data. The presented workflow is broadly applicable to different mol. materials, without the need for a single periodic calcn. at the ref. level of theory. We show that this even allows the use of wavefunction methods in CSP.
- 27Takagi, D.; Ishizaki, K.; Asahi, T.; Taniguchi, T. Molecular screening for solid–solid phase transition by machine learning. Digital Discovery 2023, 2, 1126, DOI: 10.1039/D3DD00034FGoogle Scholar27https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3sXhvVygurvF&md5=027166e2d42022ecf59c4e8d6846490dMolecular screening for solid-solid phase transitions by machine learningTakagi, Daisuke; Ishizaki, Kazuki; Asahi, Toru; Taniguchi, TakuyaDigital Discovery (2023), 2 (4), 1126-1133CODEN: DDIIAI; ISSN:2635-098X. (Royal Society of Chemistry)The solid-solid phase transition in mol. crystals is generally found by chance empirically. In this study, we constructed a machine learning framework to screen mols. that will exhibit solid-solid phase transitions in their cryst. states, based on pos.-unlabeled learning. We trained classification models using the pos. dataset we constructed manually and the unlabeled data extd. from the Cambridge Structural Database. The best classifier works as a suggester, and 9 substances among the suggested 113 mols. were found to exhibit solid-solid phase transitions according to the literature and expts. The finding probability of 8.0% is much higher than the probability of phase transition in the database, suggesting the effectiveness of mol. selection by this workflow. We also found that the mol. structure is weakly related to the transition temp. by regression anal. The findings of this study are useful for designing functional mol. crystals with solid-solid phase transitions.
- 28Musil, F.; De, S.; Yang, J.; Campbell, J. E.; Day, G. M.; Ceriotti, M. Machine learning for the structure–energy–property landscapes of molecular crystals. Chem. Sci. 2018, 9, 1289– 1300, DOI: 10.1039/C7SC04665KGoogle Scholar28https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXhvFGnsbfE&md5=48b184722478d1a988a2090f765135e2Machine learning for the structure-energy-property landscapes of molecular crystalsMusil, Felix; De, Sandip; Yang, Jack; Campbell, Joshua E.; Day, Graeme M.; Ceriotti, MicheleChemical Science (2018), 9 (5), 1289-1300CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)Mol. crystals play an important role in several fields of science and technol. They frequently crystallize in different polymorphs with substantially different phys. properties. To help guide the synthesis of candidate materials, at.-scale modeling can be used to enumerate the stable polymorphs and to predict their properties, as well as to propose heuristic rules to rationalize the correlations between crystal structure and materials properties. Here we show how a recently-developed machine-learning (ML) framework can be used to achieve inexpensive and accurate predictions of the stability and properties of polymorphs, and a data-driven classification that is less biased and more flexible than typical heuristic rules. We discuss, as examples, the lattice energy and property landscapes of pentacene and two azapentacene isomers that are of interest as org. semiconductor materials. We show that we can est. force field or DFT lattice energies with sub-kJ mol-1 accuracy, using only a few hundred ref. configurations, and reduce by a factor of ten the computational effort needed to predict charge mobility in the crystal structures. The automatic structural classification of the polymorphs reveals a more detailed picture of mol. packing than that provided by conventional heuristics, and helps disentangle the role of hydrogen bonded and π-stacking interactions in detg. mol. self-assembly. This observation demonstrates that ML is not just a black-box scheme to interpolate between ref. calcns., but can also be used as a tool to gain intuitive insights into structure-property relations in mol. crystal engineering.
- 29Quirós, M.; Gražulis, S.; Girdzijauskaitė, S.; Merkys, A.; Vaitkus, A. Using SMILES strings for the description of chemical connectivity in the Crystallography Open Database. J. Cheminf. 2018, 10, 23, DOI: 10.1186/s13321-018-0279-6Google Scholar29https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXmtFWiurk%253D&md5=99cd2cb8206426aae7de48ea932fe169Using SMILES strings for the description of chemical connectivity in the Crystallography Open DatabaseQuiros, Miguel; Grazulis, Saulius; Girdzijauskaite, Saule; Merkys, Andrius; Vaitkus, AntanasJournal of Cheminformatics (2018), 10 (), 23/1-23/17CODEN: JCOHB3; ISSN:1758-2946. (Chemistry Central Ltd.)Computer descriptions of chem. mol. connectivity are necessary for searching chem. databases and for predicting chem. properties from mol. structure. In this article, the ongoing work to describe the chem. connectivity of entries contained in the Crystallog. Open Database (COD) in SMILES format is reported. This collection of SMILES is publicly available for chem. (substructure) search or for any other purpose on an open-access basis, as is the COD itself. The conventions that have been followed for the representation of compds. that do not fit into the valence bond theory are outlined for the most frequently found cases. The procedure for getting the SMILES out of the CIF files starts with checking whether the atoms in the asym. unit are a chem. acceptable image of the compd. When they are not (mol. in a symmetry element, disorder, polymeric species, etc.), the previously published cif_mol. program is used to get such image in many cases. The program package Open Babel is then applied to get SMILES strings from the CIF files (either those directly taken from the COD or those produced by cif_mol. when applicable). The results are then checked and/or fixed by a human editor, in a computer-aided task that at present still consumes a great deal of human time. Even if the procedure still needs to be improved to make it more automatic (and hence faster), it has already yielded more than 160,000 curated chem. structures and the purpose of this article is to announce the existence of this work to the chem. community as well as to spread the use of its results.
- 30Feng, X.; Becke, A. D.; Johnson, E. R. Theoretical investigation of polymorph-and coformer-dependent photoluminescence in molecular crystals. CrystEngComm 2021, 23, 4264– 4271, DOI: 10.1039/D1CE00383FGoogle Scholar30https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXhtFSkurrK&md5=4789ac696c9e43357bc15af8988abfedTheoretical investigation of polymorph- and coformer-dependent photoluminescence in molecular crystalsFeng, Xibo; Becke, Axel D.; Johnson, Erin R.CrystEngComm (2021), 23 (24), 4264-4271CODEN: CRECF4; ISSN:1466-8033. (Royal Society of Chemistry)Polymorph- and coformer-dependent photoluminescence (PL) are among the variety of novel solid-state PL phenomena recently obsd. in many mol. crystals. They are of particular research interest due to their direct connections to two heavily investigated topics in crystal engineering: polymorphism and cocrystn. Herein, we apply a novel computational methodol., initially proposed and successfully applied in our previous investigation of piezochromism, to theor. modeling of the polymorph- and coformer-dependent PL in the well-known ROY polymorphs and the recently synthesized 9-acetylanthracene (9-ACA) cocrystals, resp. Our methodol. offers satisfactory prediction of the exptl. obsd. color zoning for the ROY polymorphs and provides good qual. and quant. accuracy for the emission (fluorescence) energies of the 9-ACA cocrystals, although the results in both cases may be adversely affected by delocalization error in the d.-functional methods employed. While the polymorph-dependent PL in ROY is found to be controlled by the intramol. geometry, modeling of the periodic crystal environment is necessary for accurate prediction of the coformer-dependent PL in the 9-ACA cocrystals, which is driven by charge transfer.
- 31Aziz, A.; Sidat, A.; Talati, P.; Crespo-Otero, R. Understanding the solid state luminescence and piezochromic properties in polymorphs of an anthracene derivative. Phys. Chem. Chem. Phys. 2022, 24, 2832– 2842, DOI: 10.1039/D1CP05192JGoogle Scholar31https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XhsF2ls7g%253D&md5=7bb4c15ceff7e88447a96420c707f5a2Understanding the solid state luminescence and piezochromic properties in polymorphs of an anthracene derivativeAziz, Alex; Sidat, Amir; Talati, Priyesh; Crespo-Otero, RachelPhysical Chemistry Chemical Physics (2022), 24 (5), 2832-2842CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)Luminescent mol. crystals have gained significant research interest for optoelectronic applications. However, fully understanding their structural and electronic relationships in the condensed phase and under external stimuli remains a significant challenge. Here, piezochromism in the mol. crystal 9,10-bis((E)-2-(pyridin-4-yl)vinyl)anthracene (BP4VA) is studied using a combination of d. functional theory (DFT) and time-dependent TD-DFT. We investigate the effects that mol. packing and geometry have on the electronic and phonon structure and the excited state properties in this archetypal system. We find that the luminescence properties are red-shifted with the transition from a herringbone to a sheet packing arrangement. An almost continuous red-shift in the band gap is found with the application of an external pressure through the enhancement of π-π and CH-π interactions, and is a mechanism in fine tuning an emissive response. The anal. of the phonon structure of the mol. crystal suggests restriction of motion in the herringbone packing arrangement, with motion restricted at higher pressure. This is supported by the Huang-Rhys factors which show a decrease in the reorganisation energy with the application of pressure. Ultimately, a balance between the decrease in reorganisation energies and the increase in exciton coupling will det. whether nonradiative decay is enhanced or decreased with the increase in pressure in these systems.
- 32Odom, S. A.; Caruso, M. M.; Finke, A. D.; Prokup, A. M.; Ritchey, J. A.; Leonard, J. H.; White, S. R.; Sottos, N. R.; Moore, J. S. Restoration of conductivity with TTF-TCNQ charge-transfer salts. Adv. Funct. Mater. 2010, 20, 1721– 1727, DOI: 10.1002/adfm.201000159Google Scholar32https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXntVyju7Y%253D&md5=a590d7aeaab369f76599fcd14cf8ab70Restoration of Conductivity with TTF-TCNQ Charge-Transfer SaltsOdom, Susan A.; Caruso, Mary M.; Finke, Aaron D.; Prokup, Alex M.; Ritchey, Joshua A.; Leonard, John H.; White, Scott R.; Sottos, Nancy R.; Moore, Jeffrey S.Advanced Functional Materials (2010), 20 (11), 1721-1727CODEN: AFMDC6; ISSN:1616-301X. (Wiley-VCH Verlag GmbH & Co. KGaA)The formation of the conductive TTF-TCNQ (tetrathiafulvalene-tetracyanoquinodimethane) charge-transfer salt via rupture of microencapsulated solns. of its individual components is reported. Solns. of TTF and TCNQ in various solvents are sep. incorporated into poly(urea-formaldehyde) core-shell microcapsules. Rupture of a mixt. of TTF-contg. microcapsules and TCNQ-contg. microcapsules gave the cryst. salt, as verified by FTIR spectroscopy and powder x-ray diffraction. Preliminary measurements demonstrate the partial restoration of cond. of severed gold electrodes in the presence of TTF-TCNQ derived in situ. This is the 1st microcapsule system for the restoration of cond. in mech. damaged electronic devices in which the repairing agent is not conductive until its release.
- 33Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 1994, 50, 17953, DOI: 10.1103/PhysRevB.50.17953Google Scholar33https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BC2sfjslSntA%253D%253D&md5=1853d67af808af2edab58beaab5d3051Projector augmented-wave methodBlochlPhysical review. B, Condensed matter (1994), 50 (24), 17953-17979 ISSN:0163-1829.There is no expanded citation for this reference.
- 34Kresse, G.; Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 1999, 59, 1758, DOI: 10.1103/PhysRevB.59.1758Google Scholar34https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK1MXkt12nug%253D%253D&md5=78a73e92a93f995982fc481715729b14From ultrasoft pseudopotentials to the projector augmented-wave methodKresse, G.; Joubert, D.Physical Review B: Condensed Matter and Materials Physics (1999), 59 (3), 1758-1775CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)The formal relationship between ultrasoft (US) Vanderbilt-type pseudopotentials and Blochl's projector augmented wave (PAW) method is derived. The total energy functional for US pseudopotentials can be obtained by linearization of two terms in a slightly modified PAW total energy functional. The Hamilton operator, the forces, and the stress tensor are derived for this modified PAW functional. A simple way to implement the PAW method in existing plane-wave codes supporting US pseudopotentials is pointed out. In addn., crit. tests are presented to compare the accuracy and efficiency of the PAW and the US pseudopotential method with relaxed-core all-electron methods. These tests include small mols. (H2, H2O, Li2, N2, F2, BF3, SiF4) and several bulk systems (diamond, Si, V, Li, Ca, CaF2, Fe, Co, Ni). Particular attention is paid to the bulk properties and magnetic energies of Fe, Co, and Ni.
- 35Perdew, J. P.; Burke, K.; Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 1996, 77, 3865, DOI: 10.1103/PhysRevLett.77.3865Google Scholar35https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK28XmsVCgsbs%253D&md5=55943538406ee74f93aabdf882cd4630Generalized gradient approximation made simplePerdew, John P.; Burke, Kieron; Ernzerhof, MatthiasPhysical Review Letters (1996), 77 (18), 3865-3868CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)Generalized gradient approxns. (GGA's) for the exchange-correlation energy improve upon the local spin d. (LSD) description of atoms, mols., and solids. We present a simple derivation of a simple GGA, in which all parameters (other than those in LSD) are fundamental consts. Only general features of the detailed construction underlying the Perdew-Wang 1991 (PW91) GGA are invoked. Improvements over PW91 include an accurate description of the linear response of the uniform electron gas, correct behavior under uniform scaling, and a smoother potential.
Cited By
This article is cited by 1 publications.
- Xinxin Niu, Qian Zhang, Yanfeng Dang, Wenping Hu, Yajing Sun. MolPackL: Quantification and Interpretation of Intermolecular Interactions Driven by Molecular Packing. Journal of the American Chemical Society 2024, 146
(34)
, 24075-24084. https://doi.org/10.1021/jacs.4c08132
Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.
Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.
The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.
Recommended Articles
References
This article references 35 other publications.
- 1Ramprasad, R.; Batra, R.; Pilania, G.; Mannodi-Kanakkithodi, A.; Kim, C. Machine learning in materials informatics: recent applications and prospects. npj Comput. Mater. 2017, 3, 54, DOI: 10.1038/s41524-017-0056-5There is no corresponding record for this reference.
- 2Butler, K. T.; Davies, D. W.; Cartwright, H.; Isayev, O.; Walsh, A. Machine learning for molecular and materials science. Nature 2018, 559, 547– 555, DOI: 10.1038/s41586-018-0337-22https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXhtl2jt7vL&md5=13d36f27db8d59f558fe28e946b4b009Machine learning for molecular and materials scienceButler, Keith T.; Davies, Daniel W.; Cartwright, Hugh; Isayev, Olexandr; Walsh, AronNature (London, United Kingdom) (2018), 559 (7715), 547-555CODEN: NATUAS; ISSN:0028-0836. (Nature Research)Here we summarize recent progress in machine learning for the chem. sciences. We outline machine-learning techniques that are suitable for addressing research questions in this domain, as well as future directions for the field. We envisage a future in which the design, synthesis, characterization and application of mols. and materials is accelerated by artificial intelligence.
- 3Schmidt, J.; Marques, M. R.; Botti, S.; Marques, M. A. Recent advances and applications of machine learning in solid-state materials science. npj Comput. Mater. 2019, 5, 83, DOI: 10.1038/s41524-019-0221-0There is no corresponding record for this reference.
- 4Rogers, D.; Hahn, M. Extended-connectivity fingerprints. J. Chem. Inf. Model. 2010, 50, 742– 754, DOI: 10.1021/ci100050t4https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXlt1Onsbg%253D&md5=cd6c736cd7a3d280b67f5316acce8006Extended-Connectivity FingerprintsRogers, David; Hahn, MathewJournal of Chemical Information and Modeling (2010), 50 (5), 742-754CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Extended-connectivity fingerprints (ECFPs) are a novel class of topol. fingerprints for mol. characterization. Historically, topol. fingerprints were developed for substructure and similarity searching. ECFPs were developed specifically for structure-activity modeling. ECFPs are circular fingerprints with a no. of useful qualities: they can be very rapidly calcd.; they are not predefined and can represent an essentially infinite no. of different mol. features (including stereochem. information); their features represent the presence of particular substructures, allowing easier interpretation of anal. results; and the ECFP algorithm can be tailored to generate different types of circular fingerprints, optimized for different uses. While the use of ECFPs has been widely adopted and validated, a description of their implementation has not previously been presented in the literature.
- 5Musil, F.; Grisafi, A.; Bartók, A. P.; Ortner, C.; Csányi, G.; Ceriotti, M. Physics-inspired structural representations for molecules and materials. Chem. Rev. 2021, 121, 9759– 9815, DOI: 10.1021/acs.chemrev.1c000215https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXhs1aisL3J&md5=b75f32ea20fa0d83fb028415f540e1f3Physics-inspired structural representations for molecules and materialsMusil, Felix; Grisafi, Andrea; Bartok, Albert P.; Ortner, Christoph; Csanyi, Gabor; Ceriotti, MicheleChemical Reviews (Washington, DC, United States) (2021), 121 (16), 9759-9815CODEN: CHREAY; ISSN:0009-2665. (American Chemical Society)A review. The first step in the construction of a regression model or a data-driven anal., aiming to predict or elucidate the relationship between the at. scale structure of matter and its properties, involves transforming the Cartesian coordinates of the atoms into a suitable representation. The development of at.-scale representations has played, and continues to play, a central role in the success of machine-learning methods for chem. and materials science. This review summarizes the current understanding of the nature and characteristics of the most commonly used structural and chem. descriptions of atomistic structures, highlighting the deep underlying connections between different frameworks, and the ideas that lead to computationally efficient and universally applicable models. It emphasizes the link between properties, structures, their phys. chem. and their math. description, provides examples of recent applications to a diverse set of chem. and materials science problems, and outlines the open questions and the most promising research directions in the field.
- 6Schütt, K. T.; Sauceda, H. E.; Kindermans, P. J.; Tkatchenko, A.; Müller, K. R. Schnet–a deep learning architecture for molecules and materials. J. Chem. Phys. 2018, 148, 241722 DOI: 10.1063/1.50197796https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXms1Ggurs%253D&md5=988638d520a423f529a16b35031243aaSchNet - A deep learning architecture for molecules and materialsSchuett, K. T.; Sauceda, H. E.; Kindermans, P.-J.; Tkatchenko, A.; Mueller, K.-R.Journal of Chemical Physics (2018), 148 (24), 241722/1-241722/11CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Deep learning has led to a paradigm shift in artificial intelligence, including web, text, and image search, speech recognition, as well as bioinformatics, with growing impact in chem. physics. Machine learning, in general, and deep learning, in particular, are ideally suitable for representing quantum-mech. interactions, enabling us to model nonlinear potential-energy surfaces or enhancing the exploration of chem. compd. space. Here, we present the deep learning architecture SchNet that is specifically designed to model atomistic systems by making use of continuous-filter convolutional layers. We demonstrate the capabilities of SchNet by accurately predicting a range of properties across chem. space for mols. and materials, where our model learns chem. plausible embeddings of atom types across the periodic table. Finally, we employ SchNet to predict potential-energy surfaces and energy-conserving force fields for mol. dynamics simulations of small mols. and perform an exemplary study on the quantum-mech. properties of C20-fullerene that would have been infeasible with regular ab initio mol. dynamics. (c) 2018 American Institute of Physics.
- 7Xie, T.; Grossman, J. C. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett. 2018, 120, 145301 DOI: 10.1103/PhysRevLett.120.1453017https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXltFSnu7c%253D&md5=93beb5675af86cf95e07c82c136f3511Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material PropertiesXie, Tian; Grossman, Jeffrey C.Physical Review Letters (2018), 120 (14), 145301CODEN: PRLTAO; ISSN:1079-7114. (American Physical Society)The use of machine learning methods for accelerating the design of cryst. materials usually requires manually constructed feature vectors or complex transformation of atom coordinates to input the crystal structure, which either constrains the model to certain crystal types or makes it difficult to provide chem. insights. Here, we develop a crystal graph convolutional neural networks framework to directly learn material properties from the connection of atoms in the crystal, providing a universal and interpretable representation of cryst. materials. Our method provides a highly accurate prediction of d. functional theory calcd. properties for eight different properties of crystals with various structure types and compns. after being trained with 104 data points. Further, our framework is interpretable because one can ext. the contributions from local chem. environments to global properties. Using an example of perovskites, we show how this information can be utilized to discover empirical rules for materials design.
- 8Chen, C.; Ye, W.; Zuo, Y.; Zheng, C.; Ong, S. P. Graph networks as a universal machine learning framework for molecules and crystals. Chem. Mater. 2019, 31, 3564– 3572, DOI: 10.1021/acs.chemmater.9b012948https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXntFaqt7g%253D&md5=c924547f17083428802d5698682d5971Graph Networks as a Universal Machine Learning Framework for Molecules and CrystalsChen, Chi; Ye, Weike; Zuo, Yunxing; Zheng, Chen; Ong, Shyue PingChemistry of Materials (2019), 31 (9), 3564-3572CODEN: CMATEX; ISSN:0897-4756. (American Chemical Society)Graph networks are a new machine learning (ML) paradigm that supports both relational reasoning and combinatorial generalization. Here, we develop universal MatErials Graph Network (MEGNet) models for accurate property prediction in both mols. and crystals. We demonstrate that the MEGNet models outperform prior ML models such as the SchNet in 11 out of 13 properties of the QM9 mol. data set. Similarly, we show that MEGNet models trained on ∼60 000 crystals in the Materials Project substantially outperform prior ML models in the prediction of the formation energies, band gaps, and elastic moduli of crystals, achieving better than d. functional theory accuracy over a much larger data set. We present two new strategies to address data limitations common in materials science and chem. First, we demonstrate a phys. intuitive approach to unify four sep. mol. MEGNet models for the internal energy at 0 K and room temp., enthalpy, and Gibbs free energy into a single free energy MEGNet model by incorporating the temp., pressure, and entropy as global state inputs. Second, we show that the learned element embeddings in MEGNet models encode periodic chem. trends and can be transfer-learned from a property model trained on a larger data set (formation energies) to improve property models with smaller amts. of data (band gaps and elastic moduli).
- 9Jiang, Y.; Yang, Z.; Guo, J.; Li, H.; Liu, Y.; Guo, Y.; Li, M.; Pu, X. Coupling complementary strategy to flexible graph neural network for quick discovery of coformer in diverse co-crystal materials. Nat. Commun. 2021, 12, 5950 DOI: 10.1038/s41467-021-26226-79https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXit1aqsbjE&md5=e877fd347df52869466130c176db61ecCoupling complementary strategy to flexible graph neural network for quick discovery of coformer in diverse co-crystal materialsJiang, Yuanyuan; Yang, Zongwei; Guo, Jiali; Li, Hongzhen; Liu, Yijing; Guo, Yanzhi; Li, Menglong; Pu, XuemeiNature Communications (2021), 12 (1), 5950CODEN: NCAOBW; ISSN:2041-1723. (Nature Research)Cocrystal engineering have been widely applied in pharmaceutical, chem. and material fields. However, how to effectively choose coformer has been a challenging task on expts. Here we develop a graph neural network (GNN) based deep learning framework to quickly predict formation of the cocrystal. In order to capture main driving force to crystn. from 6819 pos. and 1052 neg. samples reported by expts., a feasible GNN framework is explored to integrate important prior knowledge into end-to-end learning on the mol. graph. The model is strongly validated against seven competitive models and three challenging independent test sets involving pharmaceutical cocrystals, π-π cocrystals and energetic cocrystals, exhibiting superior performance with accuracy higher than 96%, confirming its robustness and generalization. Furthermore, one new energetic cocrystal predicted is successfully synthesized, showcasing high potential of the model in practice. All the data and source codes are available at https://github.com/Saoge123/ccgnet for aiding cocrystal community.
- 10Park, C. W.; Wolverton, C. Developing an improved crystal graph convolutional neural network framework for accelerated materials discovery. Phys. Rev. Mater. 2020, 4, 063801 DOI: 10.1103/PhysRevMaterials.4.06380110https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhsFKis7fP&md5=0b5903b05d47535d7c2967b7d9d6fc6bDeveloping an improved crystal graph convolutional neural network framework for accelerated materials discoveryPark, Cheol Woo; Wolverton, ChrisPhysical Review Materials (2020), 4 (6), 063801CODEN: PRMHBS; ISSN:2475-9953. (American Physical Society)The recently proposed crystal graph convolutional neural network (CGCNN) offers a highly versatile and accurate machine learning (ML) framework by learning material properties directly from graphlike representations of crystal structures ("crystal graphs"). Here, we develop an improved variant of the CGCNN model (iCGCNN) that outperforms the original by incorporating information of the Voronoi tessellated crystal structure, explicit three-body correlations of neighboring constituent atoms, and an optimized chem. representation of interat. bonds in the crystal graphs. We demonstrate the accuracy of the improved framework in two distinct illustrations: First, when trained/validated on 180 000/20 000 d. functional theory (DFT) calcd. thermodn. stability entries taken from the Open Quantum Materials Database (OQMD) and evaluated on a sep. test set of 230 000 entries, iCGCNN achieves a predictive accuracy that is significantly improved, i.e., 20% higher than that of the original CGCNN. Second, when used to assist a high-throughput search for materials in the ThCr2Si2 structure-type, iCGCNN exhibited a success rate of 31% which is 155 times higher than an undirected high-throughput search and 2.4 times higher than that of the original CGCNN. Using both CGCNN and iCGCNN, we screened 132 600 compds. with elemental decorations of the ThCr2Si2 prototype crystal structure and identified a total of 97 unique stable compds. by performing 757 DFT calcns., accelerating the computational time of the high-throughput search by a factor of 65. Our results suggest that the iCGCNN can be used to accelerate high-throughput discoveries of new materials by quickly and accurately identifying cryst. compds. with properties of interest.
- 11Louis, S. Y.; Zhao, Y.; Nasiri, A.; Wang, X.; Song, Y.; Liu, F.; Hu, J. Graph convolutional neural networks with global attention for improved materials property prediction. Phys. Chem. Chem. Phys. 2020, 22, 18141– 18148, DOI: 10.1039/D0CP01474E11https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhsFSmu7bL&md5=9190661d65053bc86d36c270be20cf0dGraph convolutional neural networks with global attention for improved materials property predictionLouis, Steph-Yves; Zhao, Yong; Nasiri, Alireza; Wang, Xiran; Song, Yuqi; Liu, Fei; Hu, JianjunPhysical Chemistry Chemical Physics (2020), 22 (32), 18141-18148CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)The development of an efficient and powerful machine learning (ML) model for materials property prediction (MPP) remains an important challenge in materials science. While various techniques have been proposed to ext. physicochem. features in MPP, graph neural networks (GNN) have also shown very strong capability in capturing effective features for high-performance MPP. Nevertheless, current GNN models do not effectively differentiate the contributions from different atoms. In this paper we develop a novel graph neural network model called GATGNN for predicting properties of inorg. materials. GATGNN is characterized by its compn. of augmented graph-attention layers (AGAT) and a global attention layer. The application of AGAT layers and global attention layers resp. learn the local relationship among neighboring atoms and overall contribution of the atoms to the material's property; together making our framework achieve considerably better prediction performance on various tested properties. Through extensive expts., we show that our method is able to outperform existing state-of-the-art GNN models while it can also provide a measurable insight into the correlation between the atoms and their material property.
- 12Cheng, J.; Zhang, C.; Dong, L. A geometric-information-enhanced crystal graph network for predicting properties of materials. Commun. Mater. 2021, 2, 92, DOI: 10.1038/s43246-021-00194-312https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XotVCku78%253D&md5=44ae02fdbbde3d1eef3d2e2fe77fb860A geometric-information-enhanced crystal graph network for predicting properties of materialsCheng, Jiucheng; Zhang, Chunkai; Dong, LifengCommunications Materials (2021), 2 (1), 92CODEN: CMOAGE; ISSN:2662-4443. (Nature Portfolio)Graph neural networks (GNNs) have been used previously for identifying new cryst. materials. However, geometric structure is not usually taken into consideration, or only partially. Here, we develop a geometric-information-enhanced crystal graph neural network (GeoCGNN) to predict the properties of cryst. materials. By considering the distance vector between each node and its neighbors, our model can learn full topol. and spatial geometric structure information. Furthermore, we incorporate an effective method based on the mixed basis functions to encode the geometric information into our model, which outperforms other GNN methods in a variety of databases. For example, for predicting formation energy our model is 25.6%, 14.3% and 35.7% more accurate than CGCNN, MEGNet and iCGCNN models, resp. For band gap, our model outperforms CGCNN by 27.6% and MEGNet by 12.4%.
- 13Choudhary, K.; DeCost, B. Atomistic line graph neural network for improved materials property predictions. npj Comput. Mater. 2021, 7, 185, DOI: 10.1038/s41524-021-00650-1There is no corresponding record for this reference.
- 14Pandey, S.; Qu, J.; Stevanović, V.; John, P. S.; Gorai, P. Predicting energy and stability of known and hypothetical crystals using graph neural network. Patterns 2021, 2, 100361 DOI: 10.1016/j.patter.2021.10036114https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BB2cfntlejtA%253D%253D&md5=fc71858fe7691e1e930bf632ef025975Predicting energy and stability of known and hypothetical crystals using graph neural networkPandey Shubham; Stevanovic Vladan; Gorai Prashun; Qu Jiaxing; St John PeterPatterns (New York, N.Y.) (2021), 2 (11), 100361 ISSN:.The discovery of new inorganic materials in unexplored chemical spaces necessitates calculating total energy quickly and with sufficient accuracy. Machine learning models that provide such a capability for both ground-state (GS) and higher-energy structures would be instrumental in accelerated screening. Here, we demonstrate the importance of a balanced training dataset of GS and higher-energy structures to accurately predict total energies using a generic graph neural network architecture. Using [Formula: see text] 16,500 density functional theory calculations from the National Renewable Energy Laboratory (NREL) Materials Database and [Formula: see text] 11,000 calculations for hypothetical structures as our training database, we demonstrate that our model satisfactorily ranks the structures in the correct order of total energies for a given composition. Furthermore, we present a thorough error analysis to explain failure modes of the model, including both prediction outliers and occasional inconsistencies in the training data. By examining intermediate layers of the model, we analyze how the model represents learned structures and properties.
- 15Frey, N. C.; Akinwande, D.; Jariwala, D.; Shenoy, V. B. Machine learning-enabled design of point defects in 2d materials for quantum and neuromorphic information processing. ACS Nano 2020, 14, 13406– 13417, DOI: 10.1021/acsnano.0c0526715https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhslKqtLfF&md5=1ade77df47fb8faf2dbe58746ccb4788Machine Learning-Enabled Design of Point Defects in 2D Materials for Quantum and Neuromorphic Information ProcessingFrey, Nathan C.; Akinwande, Deji; Jariwala, Deep; Shenoy, Vivek B.ACS Nano (2020), 14 (10), 13406-13417CODEN: ANCAC3; ISSN:1936-0851. (American Chemical Society)Engineered point defects in two-dimensional (2D) materials offer an attractive platform for solid-state devices that exploit tailored optoelectronic, quantum emission, and resistive properties. Naturally occurring defects are also unavoidably important contributors to material properties and performance. The immense variety and complexity of possible defects make it challenging to exptl. control, probe, or understand at.-scale defect-property relationships. Here, we develop an approach based on deep transfer learning, machine learning, and first-principles calcns. to rapidly predict key properties of point defects in 2D materials. We use physics-informed featurization to generate a minimal description of defect structures and present a general picture of defects across materials systems. We identify over one hundred promising, unexplored dopant defect structures in layered metal chalcogenides, hexagonal nitrides, and metal halides. These defects are prime candidates for quantum emission, resistive switching, and neuromorphic computing.
- 16Fung, V.; Zhang, J.; Juarez, E.; Sumpter, B. G. Benchmarking graph neural networks for materials chemistry. npj Comput. Mater. 2021, 7, 84, DOI: 10.1038/s41524-021-00554-016https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXhvFehs7zM&md5=e49c3e0b3e69dda5e5b41a33e3eba307Benchmarking graph neural networks for materials chemistryFung, Victor; Zhang, Jiaxin; Juarez, Eric; Sumpter, Bobby G.npj Computational Materials (2021), 7 (1), 84CODEN: NCMPCS; ISSN:2057-3960. (Nature Research)Graph neural networks (GNNs) have received intense interest as a rapidly expanding class of machine learning models remarkably well-suited for materials applications. To date, a no. of successful GNNs have been proposed and demonstrated for systems ranging from crystal stability to electronic property prediction and to surface chem. and heterogeneous catalysis. However, a consistent benchmark of these models remains lacking, hindering the development and consistent evaluation of new models in the materials field. Here, we present a workflow and testing platform, MatDeepLearn, for quickly and reproducibly assessing and comparing GNNs and other machine learning models. We use this platform to optimize and evaluate a selection of top performing GNNs on several representative datasets in computational materials chem. From our investigations we note the importance of hyperparameter selection and find roughly similar performances for the top models once optimized. We identify several strengths in GNNs over conventional models in cases with compositionally diverse datasets and in its overall flexibility with respect to inputs, due to learned rather than defined representations. Meanwhile several weaknesses of GNNs are also obsd. including high data requirements, and suggestions for further improvement for applications in materials chem. are discussed.
- 17Jalali, M.; Tsotsalas, M.; Wöll, C. MOFSocialNet: Exploiting metal-organic framework relationships via social network analysis. Nanomaterials 2022, 12, 704, DOI: 10.3390/nano1204070417https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XlslWmtLo%253D&md5=1bb836c9d57b2127944c9e1b8a45050eMOFSocialNet: Exploiting Metal-Organic Framework Relationships via Social Network AnalysisJalali, Mehrdad; Tsotsalas, Manuel; Woell, ChristofNanomaterials (2022), 12 (4), 704CODEN: NANOKO; ISSN:2079-4991. (MDPI AG)The no. of metal-org. frameworks (MOF) as well as the no. of applications of this material are growing rapidly. With the no. of characterized compds. exceeding 100,000, manual sorting becomes impossible. At the same time, the increasing computer power and established use of automated machine learning approaches makes data science tools available, that provide an overview of the MOF chem. space and support the selection of suitable MOFs for a desired application. Among the different data science tools, graph theory approaches, where data generated from numerous real-world applications is represented as a graph (network) of interconnected objects, has been widely used in a variety of scientific fields such as social sciences, health informatics, biol. sciences, agricultural sciences and economics. We describe the application of a particular graph theory approach known as social network anal. to MOF materials and highlight the importance of community (group) detection and graph node centrality. In this first application of the social network anal. approach to MOF chem. space, we created MOFSocialNet. This social network is based on the geometrical descriptors of MOFs available in the CoRE-MOFs database. MOFSocialNet can discover communities with similar MOFs structures and identify the most representative MOFs within a given community. In addn., anal. of MOFSocialNet using social network anal. methods can predict MOF properties more accurately than conventional ML tools. The latter advantage is demonstrated for the prediction of gas storage properties, the most important property of these porous reticular networks.
- 18Chen, C.; Ong, S. P. A universal graph deep learning interatomic potential for the periodic table. Nat. Comput. Sci. 2022, 2, 718– 728, DOI: 10.1038/s43588-022-00349-3There is no corresponding record for this reference.https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=&md5=9874b665cc7a056b8e2f928dd3112440
- 19Egorova, O.; Hafizi, R.; Woods, D. C.; Day, G. M. Multifidelity statistical machine learning for molecular crystal structure prediction. J. Phys. Chem. A 2020, 124, 8065– 8078, DOI: 10.1021/acs.jpca.0c0500619https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhslequ7jL&md5=15ef21ac6c7aafa85f5e4fc3bb3e0fceMultifidelity Statistical Machine Learning for Molecular Crystal Structure PredictionEgorova, Olga; Hafizi, Roohollah; Woods, David C.; Day, Graeme M.Journal of Physical Chemistry A (2020), 124 (39), 8065-8078CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)Statistical machine learning was applied to predict expensive hybrid functional DFT (PBE0) calcns. of crystal structures using a multifidelity approach to reevaluate the energies of crystal structures predicted with an inexpensive force field. The method uses an autoregressive Gaussian process, making use of less expensive GGA DFT (PBE) calcns. to bridge the gap between the force field and PBE0 energies. The method is benchmarked on the crystal structure landscapes of 3 small, H-bonded org. mols. and produces accurate predictions of energies and crystal structure ranking using small nos. of the most expensive calcns.; the PBE0 energies can be predicted with errors of <1 kJ mol-1 with between 4.2 and 6.8% of the cost of the full calcns. As the developed model is probabilistic, how the uncertainties in predicted energies impact the assessment of the energetic ranking of crystal structures are discussed.
- 20Han, Y.; Ali, I.; Wang, Z.; Cai, J.; Wu, S.; Tang, J.; Zhang, L.; Ren, J.; Xiao, R.; Lu, Q.; Hang, L. Machine learning accelerates quantum mechanics predictions of molecular crystals. Phys. Rep. 2021, 934, 1– 71, DOI: 10.1016/j.physrep.2021.08.00220https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXhvFWiurzI&md5=829b4c028e46feb7be228b2062790582Machine learning accelerates quantum mechanics predictions of molecular crystalsHan, Yanqiang; Ali, Imran; Wang, Zhilong; Cai, Junfei; Wu, Sicheng; Tang, Jiequn; Zhang, Lin; Ren, Jiahao; Xiao, Rui; Lu, Qianqian; Hang, Lei; Luo, Hongyuan; Li, JinjinPhysics Reports (2021), 934 (), 1-71CODEN: PRPLCM; ISSN:0370-1573. (Elsevier B.V.)A review. Quantum mechanics (QM) approaches (DFT, MP2, CCSD(T), etc.) play an important role in calcg. mols. and crystals with a high accuracy and acceptable efficiency. In recent years, with the development of artificial intelligence technol., machine learning (ML) has played an increasingly essential role in accelerating the QM calcns. and predictions of mol. crystals, as well as the discovery of novel materials. This review provides state-of-the-art information and prospects for QM theories, fragment-based methods and ML methods, as well as their up-to-date applications in predicting small inorg. mols., large drug mols. and relevant mol. crystals. The discussed applications include ML potential energy surface (PES) construction, crystal structure prediction (CSP), chem. reaction prediction and predictions of a series of properties, such as structure, energy, at. force, bond length, chem. shift, supercond., super-hardness, vibrational spectra, phase transition and diagram. This work also reviews software and packages built recently based on ML methods for property predictions and PES constructions in the field of physics and chem. For the three discussed methods, the most time-consuming one is the high-level all-atom QM method, which is capable of describing electronic structures with high accuracy and thus predicts properties that are consistent with the exptl. results. The second one, fragment-based QM method, requires less computational time than all-atom QM, which can accelerate all-atom QM calcns. for large systems by dividing the entire system into subsystems, presenting a considerable efficiency increase. The computational complexities for fragment-based QM and all-atom QM are N - N2 and N5-N7 (N is the size of the system), resp. A well-trained ML model can make the above predictions within seconds while ensuring a high prediction accuracy, where its prediction cost and accuracy are detd. by the training data and the training process. Therefore, it is challenging for ML applications in physics and chem. to generate highly accurate and powerful ML models while ensuring sufficient datasets. This work not only provides an overview of the recent progress in QM theories, fragment-based methods, ML methods and several ML-based software programs and applications on small inorg. mols., large drug mols. and relevant crystals, but also shed light on ML methods in accelerating QM prediction, optimization and novel crystal material design.
- 21Ishizaki, K.; Sugimoto, R.; Hagiwara, Y.; Koshima, H.; Taniguchi, T.; Asahi, T. Actuation performance of a photo-bending crystal modeled by machine learning-based regression. CrystEngComm 2021, 23, 5839– 5847, DOI: 10.1039/D1CE00208B21https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXmtVSqu7k%253D&md5=3ac98d93899abba3690d8a0f9114e0b6Actuation performance of a photo-bending crystal modeled by machine learning-based regressionIshizaki, Kazuki; Sugimoto, Ryota; Hagiwara, Yuki; Koshima, Hideko; Taniguchi, Takuya; Asahi, ToruCrystEngComm (2021), 23 (34), 5839-5847CODEN: CRECF4; ISSN:1466-8033. (Royal Society of Chemistry)Photomech. mol. crystals are a family of mech. responsive materials that have been recognized as a novel type of actuator. The actuation properties of photomech. crystals should be characterized. However, deflection and force, which are crucial for actuators, are dependent on exptl. conditions such as light intensity and crystal size, and thus the no. of combinations under different conditions is infinite. This causes difficulty in obtaining the relationship between the exptl. conditions and actuation outputs. To solve this problem, this paper presents a machine learning-based regression approach for modeling the deflection and force of a photo-bending crystal. The deflection and force were exptl. measured for various light intensities and crystal sizes. These time-series data of deflection and force were represented by feature values via exponential fitting for bending and unbending processes. The features of the max. value and speed of deflection and force were analyzed by polynomial regression and variable selection. Through this process, the most fitted models were constructed for deflection and force and most of them were interpreted by material mechanics. This statistical strategy will potentially be applied to control or optimize other functions of stimuli-responsive crystals.
- 22Groom, C. R.; Bruno, I. J.; Lightfoot, M. P.; Ward, S. C. The Cambridge structural database. Acta Crystallogr., Sect. B 2016, 72, 171– 179, DOI: 10.1107/S205252061600395422https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC28Xls1Kntro%253D&md5=f9c65ab86fc9db429588c95b0da3f9b2The Cambridge Structural DatabaseGroom, Colin R.; Bruno, Ian J.; Lightfoot, Matthew P.; Ward, Suzanna C.Acta Crystallographica, Section B: Structural Science, Crystal Engineering and Materials (2016), 72 (2), 171-179CODEN: ACSBDA; ISSN:2052-5206. (International Union of Crystallography)The Cambridge Structural Database (CSD) contains a complete record of all published org. and metal-org. small-mol. crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chem. data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chem. editors prior to entering the database. A key component of this processing is the reliable assocn. of the chem. identity of the structure studied with the exptl. data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of addnl. exptl. data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of std. identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface.
- 23Gražulis, S.; Daškevič, A.; Merkys, A.; Chateigner, D.; Lutterotti, L.; Quirós, M.; Serebryanaya, N. R.; Moeck, P.; Downs, R. T.; Bail, A. L. Crystallography Open Database (COD): an open-access collection of crystal structures and platform for world-wide collaboration. Nucleic Acids Res. 2012, 40, D420– D427, DOI: 10.1093/nar/gkr90023https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXhs12htbnP&md5=670b6cc1f2758728c303c4ad89e1cc6fCrystallography Open Database (COD): an open-access collection of crystal structures and platform for world-wide collaborationGrazulis, Saulius; Daskevic, Adriana; Merkys, Andrius; Chateigner, Daniel; Lutterotti, Luca; Quiros, Miguel; Serebryanaya, Nadezhda R.; Moeck, Peter; Downs, Robert T.; Le Bail, ArmelNucleic Acids Research (2012), 40 (D1), D420-D427CODEN: NARHAD; ISSN:0305-1048. (Oxford University Press)Using an open-access distribution model, the Crystallog. Open Database (COD, http://www.crystallog.net) collects all known small mol. / small to medium sized unit cell' crystal structures and makes them available freely on the Internet. As of today, the COD has aggregated ∼150 000 structures, offering basic search capabilities and the possibility to download the whole database, or parts thereof using a variety of std. open communication protocols. A newly developed website provides capabilities for all registered users to deposit published and so far unpublished structures as personal communications or pre-publication depositions. Such a setup enables extension of the COD database by many users simultaneously. This increases the possibilities for growth of the COD database, and is the first step towards establishing a world wide Internet-based collaborative platform dedicated to the collection and curation of structural knowledge.
- 24Olsthoorn, B.; Geilhufe, R. M.; Borysov, S. S.; Balatsky, A. V. Band gap prediction for large organic crystal structures with machine learning. Adv. Quantum Technol. 2019, 2, 1900023 DOI: 10.1002/qute.201900023There is no corresponding record for this reference.
- 25Bartók, A. P.; Kondor, R.; Csányi, G. On representing chemical environments. Phys. Rev. B 2013, 87, 184115 DOI: 10.1103/PhysRevB.87.18411525https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3sXpvFClu7Y%253D&md5=f7739275562b8e77d4532f00da8814fbOn representing chemical environmentsBartok, Albert P.; Kondor, Risi; Csanyi, GaborPhysical Review B: Condensed Matter and Materials Physics (2013), 87 (18), 184115/1-184115/16CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)We review some recently published methods to represent at. neighborhood environments, and analyze their relative merits in terms of their faithfulness and suitability for fitting potential energy surfaces. The crucial properties that such representations (sometimes called descriptors) must have are differentiability with respect to moving the atoms and invariance to the basic symmetries of physics: rotation, reflection, translation, and permutation of atoms of the same species. We demonstrate that certain widely used descriptors that initially look quite different are specific cases of a general approach, in which a finite set of basis functions with increasing angular wave nos. are used to expand the at. neighborhood d. function. Using the example system of small clusters, we quant. show that this expansion needs to be carried to higher and higher wave nos. as the no. of neighbors increases in order to obtain a faithful representation, and that variants of the descriptors converge at very different rates. We also propose an altogether different approach, called Smooth Overlap of Atomic Positions, that sidesteps these difficulties by directly defining the similarity between any two neighborhood environments, and show that it is still closely connected to the invariant descriptors. We test the performance of the various representations by fitting models to the potential energy surface of small silicon clusters and the bulk crystal.
- 26Wengert, S.; Csányi, G.; Reuter, K.; Margraf, J. T. Data-efficient machine learning for molecular crystal structure prediction. Chem. Sci. 2021, 12, 4536– 4546, DOI: 10.1039/D0SC05765G26https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXjslWrtr4%253D&md5=b168772ffa3e022a39d1e57a954804ffData-efficient machine learning for molecular crystal structure predictionWengert, Simon; Csanyi, Gabor; Reuter, Karsten; Margraf, Johannes T.Chemical Science (2021), 12 (12), 4536-4546CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)The combination of modern machine learning (ML) approaches with high-quality data from quantum mech. (QM) calcns. can yield models with an unrivalled accuracy/cost ratio. However, such methods are ultimately limited by the computational effort required to produce the ref. data. In particular, ref. calcns. for periodic systems with many atoms can become prohibitively expensive for higher levels of theory. This trade-off is crit. in the context of org. crystal structure prediction (CSP). Here, a data-efficient ML approach would be highly desirable, since screening a huge space of possible polymorphs in a narrow energy range requires the assessment of a large no. of trial structures with high accuracy. In this contribution, we present tailored Δ-ML models that allow screening a wide range of crystal candidates while adequately describing the subtle interplay between intermol. interactions such as H-bonding and many-body dispersion effects. This is achieved by enhancing a physics-based description of long-range interactions at the d. functional tight binding (DFTB) level-for which an efficient implementation is available-with a short-range ML model trained on high-quality first-principles ref. data. The presented workflow is broadly applicable to different mol. materials, without the need for a single periodic calcn. at the ref. level of theory. We show that this even allows the use of wavefunction methods in CSP.
- 27Takagi, D.; Ishizaki, K.; Asahi, T.; Taniguchi, T. Molecular screening for solid–solid phase transition by machine learning. Digital Discovery 2023, 2, 1126, DOI: 10.1039/D3DD00034F27https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3sXhvVygurvF&md5=027166e2d42022ecf59c4e8d6846490dMolecular screening for solid-solid phase transitions by machine learningTakagi, Daisuke; Ishizaki, Kazuki; Asahi, Toru; Taniguchi, TakuyaDigital Discovery (2023), 2 (4), 1126-1133CODEN: DDIIAI; ISSN:2635-098X. (Royal Society of Chemistry)The solid-solid phase transition in mol. crystals is generally found by chance empirically. In this study, we constructed a machine learning framework to screen mols. that will exhibit solid-solid phase transitions in their cryst. states, based on pos.-unlabeled learning. We trained classification models using the pos. dataset we constructed manually and the unlabeled data extd. from the Cambridge Structural Database. The best classifier works as a suggester, and 9 substances among the suggested 113 mols. were found to exhibit solid-solid phase transitions according to the literature and expts. The finding probability of 8.0% is much higher than the probability of phase transition in the database, suggesting the effectiveness of mol. selection by this workflow. We also found that the mol. structure is weakly related to the transition temp. by regression anal. The findings of this study are useful for designing functional mol. crystals with solid-solid phase transitions.
- 28Musil, F.; De, S.; Yang, J.; Campbell, J. E.; Day, G. M.; Ceriotti, M. Machine learning for the structure–energy–property landscapes of molecular crystals. Chem. Sci. 2018, 9, 1289– 1300, DOI: 10.1039/C7SC04665K28https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXhvFGnsbfE&md5=48b184722478d1a988a2090f765135e2Machine learning for the structure-energy-property landscapes of molecular crystalsMusil, Felix; De, Sandip; Yang, Jack; Campbell, Joshua E.; Day, Graeme M.; Ceriotti, MicheleChemical Science (2018), 9 (5), 1289-1300CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)Mol. crystals play an important role in several fields of science and technol. They frequently crystallize in different polymorphs with substantially different phys. properties. To help guide the synthesis of candidate materials, at.-scale modeling can be used to enumerate the stable polymorphs and to predict their properties, as well as to propose heuristic rules to rationalize the correlations between crystal structure and materials properties. Here we show how a recently-developed machine-learning (ML) framework can be used to achieve inexpensive and accurate predictions of the stability and properties of polymorphs, and a data-driven classification that is less biased and more flexible than typical heuristic rules. We discuss, as examples, the lattice energy and property landscapes of pentacene and two azapentacene isomers that are of interest as org. semiconductor materials. We show that we can est. force field or DFT lattice energies with sub-kJ mol-1 accuracy, using only a few hundred ref. configurations, and reduce by a factor of ten the computational effort needed to predict charge mobility in the crystal structures. The automatic structural classification of the polymorphs reveals a more detailed picture of mol. packing than that provided by conventional heuristics, and helps disentangle the role of hydrogen bonded and π-stacking interactions in detg. mol. self-assembly. This observation demonstrates that ML is not just a black-box scheme to interpolate between ref. calcns., but can also be used as a tool to gain intuitive insights into structure-property relations in mol. crystal engineering.
- 29Quirós, M.; Gražulis, S.; Girdzijauskaitė, S.; Merkys, A.; Vaitkus, A. Using SMILES strings for the description of chemical connectivity in the Crystallography Open Database. J. Cheminf. 2018, 10, 23, DOI: 10.1186/s13321-018-0279-629https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXmtFWiurk%253D&md5=99cd2cb8206426aae7de48ea932fe169Using SMILES strings for the description of chemical connectivity in the Crystallography Open DatabaseQuiros, Miguel; Grazulis, Saulius; Girdzijauskaite, Saule; Merkys, Andrius; Vaitkus, AntanasJournal of Cheminformatics (2018), 10 (), 23/1-23/17CODEN: JCOHB3; ISSN:1758-2946. (Chemistry Central Ltd.)Computer descriptions of chem. mol. connectivity are necessary for searching chem. databases and for predicting chem. properties from mol. structure. In this article, the ongoing work to describe the chem. connectivity of entries contained in the Crystallog. Open Database (COD) in SMILES format is reported. This collection of SMILES is publicly available for chem. (substructure) search or for any other purpose on an open-access basis, as is the COD itself. The conventions that have been followed for the representation of compds. that do not fit into the valence bond theory are outlined for the most frequently found cases. The procedure for getting the SMILES out of the CIF files starts with checking whether the atoms in the asym. unit are a chem. acceptable image of the compd. When they are not (mol. in a symmetry element, disorder, polymeric species, etc.), the previously published cif_mol. program is used to get such image in many cases. The program package Open Babel is then applied to get SMILES strings from the CIF files (either those directly taken from the COD or those produced by cif_mol. when applicable). The results are then checked and/or fixed by a human editor, in a computer-aided task that at present still consumes a great deal of human time. Even if the procedure still needs to be improved to make it more automatic (and hence faster), it has already yielded more than 160,000 curated chem. structures and the purpose of this article is to announce the existence of this work to the chem. community as well as to spread the use of its results.
- 30Feng, X.; Becke, A. D.; Johnson, E. R. Theoretical investigation of polymorph-and coformer-dependent photoluminescence in molecular crystals. CrystEngComm 2021, 23, 4264– 4271, DOI: 10.1039/D1CE00383F30https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXhtFSkurrK&md5=4789ac696c9e43357bc15af8988abfedTheoretical investigation of polymorph- and coformer-dependent photoluminescence in molecular crystalsFeng, Xibo; Becke, Axel D.; Johnson, Erin R.CrystEngComm (2021), 23 (24), 4264-4271CODEN: CRECF4; ISSN:1466-8033. (Royal Society of Chemistry)Polymorph- and coformer-dependent photoluminescence (PL) are among the variety of novel solid-state PL phenomena recently obsd. in many mol. crystals. They are of particular research interest due to their direct connections to two heavily investigated topics in crystal engineering: polymorphism and cocrystn. Herein, we apply a novel computational methodol., initially proposed and successfully applied in our previous investigation of piezochromism, to theor. modeling of the polymorph- and coformer-dependent PL in the well-known ROY polymorphs and the recently synthesized 9-acetylanthracene (9-ACA) cocrystals, resp. Our methodol. offers satisfactory prediction of the exptl. obsd. color zoning for the ROY polymorphs and provides good qual. and quant. accuracy for the emission (fluorescence) energies of the 9-ACA cocrystals, although the results in both cases may be adversely affected by delocalization error in the d.-functional methods employed. While the polymorph-dependent PL in ROY is found to be controlled by the intramol. geometry, modeling of the periodic crystal environment is necessary for accurate prediction of the coformer-dependent PL in the 9-ACA cocrystals, which is driven by charge transfer.
- 31Aziz, A.; Sidat, A.; Talati, P.; Crespo-Otero, R. Understanding the solid state luminescence and piezochromic properties in polymorphs of an anthracene derivative. Phys. Chem. Chem. Phys. 2022, 24, 2832– 2842, DOI: 10.1039/D1CP05192J31https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XhsF2ls7g%253D&md5=7bb4c15ceff7e88447a96420c707f5a2Understanding the solid state luminescence and piezochromic properties in polymorphs of an anthracene derivativeAziz, Alex; Sidat, Amir; Talati, Priyesh; Crespo-Otero, RachelPhysical Chemistry Chemical Physics (2022), 24 (5), 2832-2842CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)Luminescent mol. crystals have gained significant research interest for optoelectronic applications. However, fully understanding their structural and electronic relationships in the condensed phase and under external stimuli remains a significant challenge. Here, piezochromism in the mol. crystal 9,10-bis((E)-2-(pyridin-4-yl)vinyl)anthracene (BP4VA) is studied using a combination of d. functional theory (DFT) and time-dependent TD-DFT. We investigate the effects that mol. packing and geometry have on the electronic and phonon structure and the excited state properties in this archetypal system. We find that the luminescence properties are red-shifted with the transition from a herringbone to a sheet packing arrangement. An almost continuous red-shift in the band gap is found with the application of an external pressure through the enhancement of π-π and CH-π interactions, and is a mechanism in fine tuning an emissive response. The anal. of the phonon structure of the mol. crystal suggests restriction of motion in the herringbone packing arrangement, with motion restricted at higher pressure. This is supported by the Huang-Rhys factors which show a decrease in the reorganisation energy with the application of pressure. Ultimately, a balance between the decrease in reorganisation energies and the increase in exciton coupling will det. whether nonradiative decay is enhanced or decreased with the increase in pressure in these systems.
- 32Odom, S. A.; Caruso, M. M.; Finke, A. D.; Prokup, A. M.; Ritchey, J. A.; Leonard, J. H.; White, S. R.; Sottos, N. R.; Moore, J. S. Restoration of conductivity with TTF-TCNQ charge-transfer salts. Adv. Funct. Mater. 2010, 20, 1721– 1727, DOI: 10.1002/adfm.20100015932https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXntVyju7Y%253D&md5=a590d7aeaab369f76599fcd14cf8ab70Restoration of Conductivity with TTF-TCNQ Charge-Transfer SaltsOdom, Susan A.; Caruso, Mary M.; Finke, Aaron D.; Prokup, Alex M.; Ritchey, Joshua A.; Leonard, John H.; White, Scott R.; Sottos, Nancy R.; Moore, Jeffrey S.Advanced Functional Materials (2010), 20 (11), 1721-1727CODEN: AFMDC6; ISSN:1616-301X. (Wiley-VCH Verlag GmbH & Co. KGaA)The formation of the conductive TTF-TCNQ (tetrathiafulvalene-tetracyanoquinodimethane) charge-transfer salt via rupture of microencapsulated solns. of its individual components is reported. Solns. of TTF and TCNQ in various solvents are sep. incorporated into poly(urea-formaldehyde) core-shell microcapsules. Rupture of a mixt. of TTF-contg. microcapsules and TCNQ-contg. microcapsules gave the cryst. salt, as verified by FTIR spectroscopy and powder x-ray diffraction. Preliminary measurements demonstrate the partial restoration of cond. of severed gold electrodes in the presence of TTF-TCNQ derived in situ. This is the 1st microcapsule system for the restoration of cond. in mech. damaged electronic devices in which the repairing agent is not conductive until its release.
- 33Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 1994, 50, 17953, DOI: 10.1103/PhysRevB.50.1795333https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BC2sfjslSntA%253D%253D&md5=1853d67af808af2edab58beaab5d3051Projector augmented-wave methodBlochlPhysical review. B, Condensed matter (1994), 50 (24), 17953-17979 ISSN:0163-1829.There is no expanded citation for this reference.
- 34Kresse, G.; Joubert, D. From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 1999, 59, 1758, DOI: 10.1103/PhysRevB.59.175834https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK1MXkt12nug%253D%253D&md5=78a73e92a93f995982fc481715729b14From ultrasoft pseudopotentials to the projector augmented-wave methodKresse, G.; Joubert, D.Physical Review B: Condensed Matter and Materials Physics (1999), 59 (3), 1758-1775CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)The formal relationship between ultrasoft (US) Vanderbilt-type pseudopotentials and Blochl's projector augmented wave (PAW) method is derived. The total energy functional for US pseudopotentials can be obtained by linearization of two terms in a slightly modified PAW total energy functional. The Hamilton operator, the forces, and the stress tensor are derived for this modified PAW functional. A simple way to implement the PAW method in existing plane-wave codes supporting US pseudopotentials is pointed out. In addn., crit. tests are presented to compare the accuracy and efficiency of the PAW and the US pseudopotential method with relaxed-core all-electron methods. These tests include small mols. (H2, H2O, Li2, N2, F2, BF3, SiF4) and several bulk systems (diamond, Si, V, Li, Ca, CaF2, Fe, Co, Ni). Particular attention is paid to the bulk properties and magnetic energies of Fe, Co, and Ni.
- 35Perdew, J. P.; Burke, K.; Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 1996, 77, 3865, DOI: 10.1103/PhysRevLett.77.386535https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK28XmsVCgsbs%253D&md5=55943538406ee74f93aabdf882cd4630Generalized gradient approximation made simplePerdew, John P.; Burke, Kieron; Ernzerhof, MatthiasPhysical Review Letters (1996), 77 (18), 3865-3868CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)Generalized gradient approxns. (GGA's) for the exchange-correlation energy improve upon the local spin d. (LSD) description of atoms, mols., and solids. We present a simple derivation of a simple GGA, in which all parameters (other than those in LSD) are fundamental consts. Only general features of the detailed construction underlying the Perdew-Wang 1991 (PW91) GGA are invoked. Improvements over PW91 include an accurate description of the linear response of the uniform electron gas, correct behavior under uniform scaling, and a smoother potential.
Supporting Information
Supporting Information
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acsomega.3c05224.
Hyperparameters, data set for generalization test, data distribution (PDF)
Terms & Conditions
Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.