ACS Publications. Most Trusted. Most Cited. Most Read
My Activity
CONTENT TYPES

Figure 1Loading Img
RETURN TO ISSUEPREVStructure PredictionNEXT

Dual-Level Training of Gaussian Processes with Physically Inspired Priors for Geometry Optimizations

  • Chong Teng
    Chong Teng
    Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
    More by Chong Teng
  • Yang Wang
    Yang Wang
    Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
    More by Yang Wang
  • Daniel Huang
    Daniel Huang
    Department of Computer Science, San Francisco State University, San Francisco, California 94132, United States
    More by Daniel Huang
  • Katherine Martin
    Katherine Martin
    Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
  • Jean-Baptiste Tristan*
    Jean-Baptiste Tristan
    Department of Computer Science, Boston College, Chestnut Hill, Massachusetts 02467, United States
    *Email: [email protected]
  • , and 
  • Junwei Lucas Bao*
    Junwei Lucas Bao
    Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
    *Email: [email protected]
Cite this: J. Chem. Theory Comput. 2022, 18, 9, 5739–5754
Publication Date (Web):August 8, 2022
https://doi.org/10.1021/acs.jctc.2c00546
Copyright © 2022 American Chemical Society

    Article Views

    565

    Altmetric

    -

    Citations

    LEARN ABOUT THESE METRICS
    Read OnlinePDF (3 MB)
    Supporting Info (1)»

    Abstract

    Abstract Image

    Gaussian process (GP) regression has been recently developed as an effective method in molecular geometry optimization. The prior mean function is one of the crucial parts of the GP. We design and validate two types of physically inspired prior mean functions: force-field-based priors and posterior-type priors. In this work, we implement a dual-level training (DLT) optimizer for the posterior-type priors. The DLT optimizers can be considered as a class of optimization algorithms that belong to the delta-machine learning paradigm but with several major differences compared to the previously proposed algorithms in the same paradigm. In the first level of the DLT, we incorporate the classical mechanical descriptions of the equilibrium geometries into the prior function, which enhances the performance of the GP optimizer as compared to the one using a constant (or zero) prior. In the second level, we utilize the surrogate potential energy surfaces (PESs), which incorporate the physics learned in the first-level training, as the prior function to refine the model performance further. We find that the force-field-based priors and posterior-type priors reduce the overall optimization steps by a factor of 2–3 when compared to the limited-memory Broyden–Fletcher–Goldfarb–Shanno (L-BFGS) optimizer as well as the constant-prior GP optimizer proposed in previous works. We also demonstrate the potential of recovering the real PESs with GP with a force-field prior. This work shows the importance of including domain knowledge as an ingredient in the GP, which offers a potentially robust learning model for molecular geometry optimization and for exploring molecular PESs.

    Supporting Information

    ARTICLE SECTIONS
    Jump To

    The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jctc.2c00546.

    • Cartesian coordinates (in Å) of the initial geometries for the Baker-25 dataset; Cartesian coordinates (in Å) of the initial geometries for the AA-20 dataset; implementation details of the modified CHARMM force field; dissociation energies (in kJ/mol) by bond type for the Morse potential used in the modified CHARMM force field; detailed comparisons of the molecular structures (with ids and abbreviations consistent with Table 1) optimized by the force-only GP (with the modified CHARMM force field as the prior mean function) and the L-BFGS optimizers; scripts for generating the harmonic-only, Morse-only, and the modified CHARMM force-field prior mean functions; input scripts of the single-level energy-and-force GP with the harmonic-only, Morse-only, and the modified CHARMM force-field prior mean functions; implementation details of the simplified UFF; implementation details of the DL-BFGS method; input scripts for the DLT force-only GP with the modified CHARMM prior mean function, with fmaxL1 = 0.1 and fmaxL2 = 0.01 eV/Å; input script for the single-level energy-and-force GP with the Emax + 10 prior (PDF)

    Terms & Conditions

    Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.

    Cited By

    This article is cited by 1 publications.

    1. Chong Teng, Daniel Huang, Junwei Lucas Bao. A spur to molecular geometry optimization: Gradient-enhanced universal kriging with on-the-fly adaptive ab initio prior mean functions in curvilinear coordinates. The Journal of Chemical Physics 2023, 158 (2) , 024112. https://doi.org/10.1063/5.0133675

    Pair your accounts.

    Export articles to Mendeley

    Get article recommendations from ACS based on references in your Mendeley library.

    Pair your accounts.

    Export articles to Mendeley

    Get article recommendations from ACS based on references in your Mendeley library.

    You’ve supercharged your research process with ACS and Mendeley!

    STEP 1:
    Click to create an ACS ID

    Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

    Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

    Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

    MENDELEY PAIRING EXPIRED
    Your Mendeley pairing has expired. Please reconnect