Machine Learning Diffusion Monte Carlo EnergiesClick to copy article linkArticle link copied!
- Kevin Ryczko*Kevin Ryczko*Email: [email protected]Good Chemistry Company, Vancouver, British ColumbiaV6E 4B1, CanadaMore by Kevin Ryczko
- Jaron T. KrogelJaron T. KrogelMaterials Science and Technology Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee37831, United StatesMore by Jaron T. Krogel
- Isaac Tamblyn*Isaac Tamblyn*Email: [email protected]Department of Physics, University of Ottawa, Ottawa, OntarioK1N 6N5, CanadaVector Institute for Artificial Intelligence, Toronto, OntarioM5G 1M1, CanadaMore by Isaac Tamblyn
Abstract

We present two machine learning methodologies that are capable of predicting diffusion Monte Carlo (DMC) energies with small data sets (≈60 DMC calculations in total). The first uses voxel deep neural networks (VDNNs) to predict DMC energy densities using Kohn–Sham density functional theory (DFT) electron densities as input. The second uses kernel ridge regression (KRR) to predict atomic contributions to the DMC total energy using atomic environment vectors as input (we used atom-centered symmetry functions, atomic environment vectors from the ANI models, and smooth overlap of atomic positions). We first compare the methodologies on pristine graphene lattices, where we find that the KRR methodology performs best in comparison to gradient boosted decision trees, random forest, Gaussian process regression, and multilayer perceptrons. In addition, KRR outperforms VDNNs by an order of magnitude. Afterward, we study the generalizability of KRR to predict the energy barrier associated with a Stone–Wales defect. Lastly, we move from 2D to 3D materials and use KRR to predict total energies of liquid water. In all cases, we find that the KRR models are more accurate than Kohn–Sham DFT and all mean absolute errors are less than chemical accuracy.
Cited By
Smart citations by scite.ai include citation statements extracted from the full text of the citing article. The number of the statements may be higher than the number of citations provided by ACS Publications if one paper cites another multiple times or lower if scite has not yet processed some of the citing articles.
This article is cited by 13 publications.
- Gopal R. Iyer, Noah Whelpley, Juha Tiihonen, Paul R. C. Kent, Jaron T. Krogel, Brenda M. Rubenstein. Force-Free Identification of Minimum-Energy Pathways and Transition States for Stochastic Electronic Structure Theories. Journal of Chemical Theory and Computation 2024, 20
(17)
, 7416-7429. https://doi.org/10.1021/acs.jctc.4c00214
- Tristan Maxson, Ademola Soyemi, Benjamin W. J. Chen, Tibor Szilvási. Enhancing the Quality and Reliability of Machine Learning Interatomic Potentials through Better Reporting Practices. The Journal of Physical Chemistry C 2024, 128
(16)
, 6524-6537. https://doi.org/10.1021/acs.jpcc.4c00028
- Michael S. Chen, Joonho Lee, Hong-Zhou Ye, Timothy C. Berkelbach, David R. Reichman, Thomas E. Markland. Data-Efficient Machine Learning Potentials from Transfer Learning of Periodic Correlated Electronic Structure Methods: Liquid Water at AFQMC, CCSD, and CCSD(T) Accuracy. Journal of Chemical Theory and Computation 2023, 19
(14)
, 4510-4519. https://doi.org/10.1021/acs.jctc.2c01203
- Cancan Huang, Brenda M. Rubenstein. Machine Learning Diffusion Monte Carlo Forces. The Journal of Physical Chemistry A 2023, 127
(1)
, 339-355. https://doi.org/10.1021/acs.jpca.2c05904
- Pranoy Ray, Kamal Choudhary, Surya R. Kalidindi. Lean CNNs for Mapping Electron Charge Density Fields to Material Properties. Integrating Materials and Manufacturing Innovation 2025, 14
(1)
, 1-13. https://doi.org/10.1007/s40192-024-00389-9
- Hao Xiao, Yingping Tian, Hengbo Gao, Xiaolei Cui, Shimin Dong, Qianlong Xue, Dongqi Yao. Analysis of the fatigue status of medical security personnel during the closed-loop period using multiple machine learning methods: a case study of the Beijing 2022 Olympic Winter Games. Scientific Reports 2024, 14
(1)
https://doi.org/10.1038/s41598-024-59397-6
- Edgar Josué Landinez Borda, Kenneth O. Berard, Annette Lopez, Brenda Rubenstein. Gaussian processes for finite size extrapolation of many-body simulations. Faraday Discussions 2024, 254 , 500-528. https://doi.org/10.1039/D4FD00051J
- Iddo Eliazar. Regular and anomalous diffusion: I. Foundations. Journal of Physics A: Mathematical and Theoretical 2024, 57
(23)
, 233002. https://doi.org/10.1088/1751-8121/ad4b7c
- Bin Lu, Yuze Xia, Yuqian Ren, Miaomiao Xie, Liguo Zhou, Giovanni Vinai, Simon A. Morton, Andrew T. S. Wee, Wilfred G. van der Wiel, Wen Zhang, Ping Kwan Johnny Wong. When Machine Learning Meets 2D Materials: A Review. Advanced Science 2024, 11
(13)
https://doi.org/10.1002/advs.202305277
- Tan Yang, Hai Yang, Yan Liu, Xiao Liu, Yi-Jie Ding, Run Li, An-Qiong Mao, Yue Huang, Xiao-Liang Li, Ying Zhang, Feng-Xu Yu. Postoperative delirium prediction after cardiac surgery using machine learning models. Computers in Biology and Medicine 2024, 169 , 107818. https://doi.org/10.1016/j.compbiomed.2023.107818
- William A. Wheeler, Shivesh Pathak, Kevin G. Kleiner, Shunyue Yuan, João N. B. Rodrigues, Cooper Lorsung, Kittithat Krongchon, Yueqing Chang, Yiqing Zhou, Brian Busemeyer, Kiel T. Williams, Alexander Muñoz, Chun Yu Chow, Lucas K. Wagner. PyQMC
: An all-Python real-space quantum Monte Carlo module in
PySCF. The Journal of Chemical Physics 2023, 158
(11)
https://doi.org/10.1063/5.0139024
- Choon Wee Kee. Molecular Understanding and Practical In Silico Catalyst Design in Computational Organocatalysis and Phase Transfer Catalysis—Challenges and Opportunities. Molecules 2023, 28
(4)
, 1715. https://doi.org/10.3390/molecules28041715
- Sampath Routu, Madhughnea Sai Adabala, G. Gopichand. Solutions to Diffusion Equations Using Neural Networks. 2023, 881-892. https://doi.org/10.1007/978-981-99-4634-1_69
Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.
Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.
The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.