Hamiltonian-Reservoir Replica Exchange and Machine Learning Potentials for Computational Organic Chemistry
- Raimon FabregatRaimon FabregatLaboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École polytechnique fédérale de Lausanne (EPFL), CH-1015 Lausanne, SwitzerlandMore by Raimon Fabregat
- ,
- Alberto FabrizioAlberto FabrizioLaboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École polytechnique fédérale de Lausanne (EPFL), CH-1015 Lausanne, SwitzerlandNational Centre for Computational Design and Discovery of Novel Materials (MARVEL), École polytechnique fédérale de Lausanne (EPFL), CH-1015 Lausanne, SwitzerlandMore by Alberto Fabrizio
- ,
- Benjamin MeyerBenjamin MeyerLaboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École polytechnique fédérale de Lausanne (EPFL), CH-1015 Lausanne, SwitzerlandNational Centre for Computational Design and Discovery of Novel Materials (MARVEL), École polytechnique fédérale de Lausanne (EPFL), CH-1015 Lausanne, SwitzerlandMore by Benjamin Meyer
- ,
- Daniel HollasDaniel HollasLaboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École polytechnique fédérale de Lausanne (EPFL), CH-1015 Lausanne, SwitzerlandMore by Daniel Hollas
- , and
- Clémence Corminboeuf*Clémence Corminboeuf*[email protected]Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École polytechnique fédérale de Lausanne (EPFL), CH-1015 Lausanne, SwitzerlandNational Centre for Computational Design and Discovery of Novel Materials (MARVEL), École polytechnique fédérale de Lausanne (EPFL), CH-1015 Lausanne, SwitzerlandMore by Clémence Corminboeuf
Abstract

This work combines a machine learning potential energy function with a modular enhanced sampling scheme to obtain statistically converged thermodynamical properties of flexible medium-size organic molecules at high ab initio level. We offer a modular environment in the python package MORESIM that allows custom design of replica exchange simulations with any level of theory including ML-based potentials. Our specific combination of Hamiltonian and reservoir replica exchange is shown to be a powerful technique to accelerate enhanced sampling simulations and explore free energy landscapes with a quantum chemical accuracy unattainable otherwise (e.g., DLPNO-CCSD(T)/CBS quality). This engine is used to demonstrate the relevance of accessing the ab initio free energy landscapes of molecules whose stability is determined by a subtle interplay between variations in the underlying potential energy and conformational entropy (i.e., a bridged asymmetrically polarized dithiacyclophane and a widely used organocatalyst) both in the gas phase and in solution (implicit solvent).
1. Introduction
Figure 1

Figure 1. (a) Dithiacyclophane and the collective variables used to characterize its global structure: the distance between the center of masses of each cyclic bulk and the angles between the average planes going through them. (b) Cinchona alkaloid organocatalyst and the two dihedral angles used to characterize its global structure.
2. Methods and Computational Details
Overview
Figure 2

Figure 2. Mind-map and workflow illustrating the proposed methodology.
2.1. Quantum Chemical Potentials: Targets and Baseline
2.2. Machine Learning Models
2.3. Hamiltonian-Reservoir Replica Exchange
Figure 3

Figure 3. Schematic depiction of Hamiltonian reservoir Replica Exchange.
Figure 4

Figure 4. (a) Free energy landscape (DFTB-SK/3OB level) of dithiacyclophane at 300 K (T-RE) projected on the 2D space generated by the collective variables visible in Figure 1a. (b) Projection of the data set made of 1500 dithiacyclophane structures extracted with farthest point sampling from the 300 K canonical ensemble of 40 000 structures and color coded on the basis of the single-point energy difference ΔE = ((DFTB-SK/3OB) – (DLPNO-CCSD(T)/CBS)). The continuous background is plotted using a Gaussian interpolation of the mean energy difference. The smooth histograms were constructed with a Gaussian Kernel Density Estimator (Gaussian KDE) using the SciPy (73) python library.
Figure 5

Figure 5. (a) Free energy landscape (DFTB-SK/3OB level) of the cinchona alkaloid organocatalyst at 300 K projected on the 2D space generated by the collective variables visible in Figure 1b. Constructed with canonical structures generated with T-RE simulations with DFTB-SK as potential energy. (b) Projection of the 1800 data set structures obtained with FPS from a canonical ensemble of 32 000 structures at 300 K canonical ensemble and color coded on the basis of the single point energy difference ΔE = ((DFTB-SK/3OB) – (DLPNO-CCSD(T)/CBS)). (c) Structures representing each of the four conformational regions (i.e., basins).
2.4. Technical Details
3. Results
3.1. Dithiacyclophane
Figure 6

Figure 6. Comparison between the DFTB-SK electronic energy and the ML-DLPNO-CCSD(T)/CBS predictions (i.e., DFTB-SK + ΔML correction) for the 40 000 structures in the reservoir.
Figure 7

Figure 7. Free energy landscapes at 300 K generated with the potential: (a) DFTB-SK; (b) ML-DLPNO-CCSD(T)/CBS; (c) ML-[PBE0-D3/(6-31G)(SMD Chloroform)]; (d) PBE0-D3/(6-31G); (e) ML-PBE0-D3/(6-31G). (f) Relative free energies by integration within the local minima. (24) The free energies are all given relative to the Disarticulated state except for the solvated system, where the open state is used as a reference. The striped columns correspond to the static relative free energy using the harmonic approximation (for the solvated system the harmonic free energies were computed with the true potential, and not with the machine learning version). All the free energies maps come from resH-RE expect for the direct PBE0, which uses T-RE, as described in the methods section.
3.2. Cinchona Alkaloid
Figure 8

Figure 8. Free energy landscapes at 300 K generated with the potential: (a) DFTB-SK; (b) ML-DLPNO-CCSD(T)/CBS; (c) ML-[PBE0-D3/(6-31G)(SMD Chloroform)]. (d) Free energies upon integration within the free energy basins. The free energies are all given relative to the Disarticulated state. The stripped columns are the free energy predictions of the basins using the static free energies using the harmonic correction.
4. Conclusions
Supporting Information
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jctc.0c00100.
Histogram of the cost of computations, learning curves, energy comparisons, basin potential energies, energies scatter plot, free energy landscapes and convergences, and detailed information on the training procedure (PDF)
Terms & Conditions
Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.
Acknowledgments
This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant agreement No 817977). AF. and B.M. acknowledge the National Centre of Competence in Research (NCCR) “Materials’ Revolution: Computational Design and Discovery of Novel Materials (MARVEL)” of the Swiss National Science Foundation (SNSF) for financial support.
References
This article references 77 other publications.
- 1Behler, J.; Parrinello, M. Generalized Neural-Network Representation of High-Dimensional Potential-Energy Surfaces. Phys. Rev. Lett. 2007, 98 (14), 146401, DOI: 10.1103/PhysRevLett.98.146401[Crossref], [PubMed], [CAS], Google Scholar1https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD2sXjvF2ls7w%253D&md5=579a6cbf503565205acbb86ade0ae86bGeneralized Neural-Network Representation of High-Dimensional Potential-Energy SurfacesBehler, Jorg; Parrinello, MichelePhysical Review Letters (2007), 98 (14), 146401/1-146401/4CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)The accurate description of chem. processes often requires the use of computationally demanding methods like d.-functional theory (DFT), making long simulations of large systems unfeasible. In this Letter we introduce a new kind of neural-network representation of DFT potential-energy surfaces, which provides the energy and forces as a function of all at. positions in systems of arbitrary size and is several orders of magnitude faster than DFT. The high accuracy of the method is demonstrated for bulk silicon and compared with empirical potentials and DFT. The method is general and can be applied to all types of periodic and nonperiodic systems.
- 2Behler, J.; Marto crnnák, R.; Donadio, D.; Parrinello, M. Metadynamics Simulations of the High-Pressure Phases of Silicon Employing a High-Dimensional Neural Network Potential. Phys. Rev. Lett. 2008, 100 (18), 185501, DOI: 10.1103/PhysRevLett.100.185501[Crossref], [PubMed], [CAS], Google Scholar2https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1cXlslKnsro%253D&md5=e003c971c671233d15bbc3679dace5bbMetadynamics simulations of the high-pressure phases of silicon employing a high-dimensional neural network potentialBehler, Joerg; Martonak, Roman; Donadio, Davide; Parrinello, MichelePhysical Review Letters (2008), 100 (18), 185501/1-185501/4CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)We study in a systematic way the complex sequence of the high-pressure phases of silicon obtained upon compression by combining an accurate high-dimensional neural network representation of the d.-functional theory potential energy surface with the metadynamics scheme. Starting from the thermodynamically stable diamond structure at ambient conditions we are able to identify all structural phase transitions up to the highest-pressure fcc phase at about 100 GPa. The results are in excellent agreement with expt. The method developed promises to be of great value in the study of inorg. solids, including those having metallic phases.
- 3Khaliullin, R. Z.; Eshet, H.; Kühne, T. D.; Behler, J.; Parrinello, M. Graphite-Diamond Phase Coexistence Study Employing a Neural-Network Mapping of the Ab Initio Potential Energy Surface. Phys. Rev. B: Condens. Matter Mater. Phys. 2010, 81 (10), 100103, DOI: 10.1103/PhysRevB.81.100103[Crossref], [CAS], Google Scholar3https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXkt1Ojtb4%253D&md5=ea08109866c0f25f4950b2263fd68b64Graphite-diamond phase coexistence study employing a neural-network mapping of the ab initio potential energy surfaceKhaliullin, Rustam Z.; Eshet, Hagai; Kuhne, Thomas D.; Behler, Jorg; Parrinello, MichelePhysical Review B: Condensed Matter and Materials Physics (2010), 81 (10), 100103/1-100103/4CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)An interat. potential for the diamond and graphite phases of carbon has been created using a neural-network (NN) representation of the ab initio potential energy surface. The NN potential combines the accuracy of a first-principles description of both phases with the efficiency of empirical force fields and allows one to perform a mol.-dynamics study, of ab initio quality, of the thermodn. of graphite-diamond coexistence. Good agreement between the exptl. and calcd. coexistence curves is achieved if nuclear quantum effects are included in the simulation.
- 4Khaliullin, R. Z.; Eshet, H.; Kühne, T. D.; Behler, J.; Parrinello, M. Nucleation Mechanism for the Direct Graphite-to-Diamond Phase Transition. Nat. Mater. 2011, 10 (9), 693, DOI: 10.1038/nmat3078[Crossref], [PubMed], [CAS], Google Scholar4https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXptlaku7o%253D&md5=c1a307e718eaa060f02f14e3af4aa93eNucleation mechanism for the direct graphite-to-diamond phase transitionKhaliullin, Rustam Z.; Eshet, Hagai; Kuehne, Thomas D.; Behler, Joerg; Parrinello, MicheleNature Materials (2011), 10 (9), 693-697CODEN: NMAACR; ISSN:1476-1122. (Nature Publishing Group)Graphite and diamond have comparable free energies, yet forming diamond from graphite in the absence of a catalyst requires pressures that are significantly higher than those at equil. coexistence. At lower temps., the formation of the metastable hexagonal polymorph of diamond is favored instead of the more stable cubic diamond. These phenomena cannot be explained by the concerted mechanism suggested in previous theor. studies. Using an ab initio quality neural-network potential, we carried out a large-scale study of the graphite-to-diamond transition assuming that it occurs through nucleation. The nucleation mechanism accounts for the obsd. phenomenol. and reveals its microscopic origins. We demonstrate that the large lattice distortions that accompany the formation of diamond nuclei inhibit the phase transition at low pressure, and direct it towards the hexagonal diamond phase at higher pressure. The proposed nucleation mechanism should improve our understanding of structural transformations in a wide range of carbon-based materials.
- 5Eshet, H.; Khaliullin, R. Z.; Kühne, T. D.; Behler, J.; Parrinello, M. Ab Initio Quality Neural-Network Potential for Sodium. Phys. Rev. B: Condens. Matter Mater. Phys. 2010, 81 (18), 184107, DOI: 10.1103/PhysRevB.81.184107[Crossref], [CAS], Google Scholar5https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXmvVKjsL0%253D&md5=f56756ae8cfdb097f84ced73ba87baa8Ab initio quality neural-network potential for sodiumEshet, Hagai; Khaliullin, Rustam Z.; Kuhne, Thomas D.; Behler, Jorg; Parrinello, MichelePhysical Review B: Condensed Matter and Materials Physics (2010), 81 (18), 184107/1-184107/8CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)An interat. potential for high-pressure high-temp. (HPHT) cryst. and liq. phases of sodium is created using a neural-network (NN) representation of the ab initio potential-energy surface. It is demonstrated that the NN potential provides an ab initio quality description of multiple properties of liq. sodium and bcc, fcc, and cI16 crystal phases in the P-T region up to 120 GPa and 1200 K. The unique combination of computational efficiency of the NN potential and its ability to reproduce quant. exptl. properties of sodium in the wide P-T range enables mol.-dynamics simulations of physicochem. processes in HPHT sodium of unprecedented quality.
- 6Gastegger, M.; Kauffmann, C.; Behler, J.; Marquetand, P. Comparing the Accuracy of High-Dimensional Neural Network Potentials and the Systematic Molecular Fragmentation Method: A Benchmark Study for All-Trans Alkanes. J. Chem. Phys. 2016, 144 (19), 194110, DOI: 10.1063/1.4950815[Crossref], [PubMed], [CAS], Google Scholar6https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC28XotleksrY%253D&md5=a805eda60b32bf67bacfa6d3fdf40befComparing the accuracy of high-dimensional neural network potentials and the systematic molecular fragmentation method: A benchmark study for all-trans alkanesGastegger, Michael; Kauffmann, Clemens; Behler, Joerg; Marquetand, PhilippJournal of Chemical Physics (2016), 144 (19), 194110/1-194110/6CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Many approaches, which have been developed to express the potential energy of large systems, exploit the locality of the at. interactions. A prominent example is the fragmentation methods in which the quantum chem. calcns. are carried out for overlapping small fragments of a given mol. that are then combined in a second step to yield the system's total energy. Here we compare the accuracy of the systematic mol. fragmentation approach with the performance of high-dimensional neural network (HDNN) potentials introduced by Behler and Parrinello. HDNN potentials are similar in spirit to the fragmentation approach in that the total energy is constructed as a sum of environment-dependent at. energies, which are derived indirectly from electronic structure calcns. As a benchmark set, we use all-trans alkanes contg. up to eleven carbon atoms at the coupled cluster level of theory. These mols. have been chosen because they allow to extrapolate reliable ref. energies for very long chains, enabling an assessment of the energies obtained by both methods for alkanes including up to 10 000 carbon atoms. We find that both methods predict high-quality energies with the HDNN potentials yielding smaller errors with respect to the coupled cluster ref. (c) 2016 American Institute of Physics.
- 7Gastegger, M.; Behler, J.; Marquetand, P. Machine Learning Molecular Dynamics for the Simulation of Infrared Spectra. Chem. Sci. 2017, 8 (10), 6924– 6935, DOI: 10.1039/C7SC02267K[Crossref], [PubMed], [CAS], Google Scholar7https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXhtlSrtL7J&md5=68f30b308f22e04fc416e62c7aa85eedMachine learning molecular dynamics for the simulation of infrared spectraGastegger, Michael; Behler, Joerg; Marquetand, PhilippChemical Science (2017), 8 (10), 6924-6935CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)A review. Machine learning has emerged as an invaluable tool in many research areas. In the present work, we harness this power to predict highly accurate mol. IR spectra with unprecedented computational efficiency. To account for vibrational anharmonic and dynamical effects - typically neglected by conventional quantum chem. approaches - we base our machine learning strategy on ab initio mol. dynamics simulations. While these simulations are usually extremely time consuming even for small mols., we overcome these limitations by leveraging the power of a variety of machine learning techniques, not only accelerating simulations by several orders of magnitude, but also greatly extending the size of systems that can be treated. To this end, we develop a mol. dipole moment model based on environment dependent neural network charges and combine it with the neural network potential approach of Behler and Parrinello. Contrary to the prevalent big data philosophy, we are able to obtain very accurate machine learning models for the prediction of IR spectra based on only a few hundreds of electronic structure ref. points. This is made possible through the use of mol. forces during neural network potential training and the introduction of a fully automated sampling scheme. We demonstrate the power of our machine learning approach by applying it to model the IR spectra of a methanol mol., n-alkanes contg. up to 200 atoms and the protonated alanine tripeptide, which at the same time represents the first application of machine learning techniques to simulate the dynamics of a peptide. In all of these case studies we find an excellent agreement between the IR spectra predicted via machine learning models and the resp. theor. and exptl. spectra.
- 8Bartók, A. P.; Payne, M. C.; Kondor, R.; Csányi, G. Gaussian Approximation Potentials: the Accuracy of Quantum Mechanics, without the Electrons. Phys. Rev. Lett. 2010, 104 (13), 136403, DOI: 10.1103/PhysRevLett.104.136403[Crossref], [PubMed], [CAS], Google Scholar8https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXkt1Kqur8%253D&md5=0a468458554e85413b53816c082419f2Gaussian Approximation Potentials: The Accuracy of Quantum Mechanics, without the ElectronsBartok, Albert P.; Payne, Mike C.; Kondor, Risi; Csanyi, GaborPhysical Review Letters (2010), 104 (13), 136403/1-136403/4CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)We introduce a class of interat. potential models that can be automatically generated from data consisting of the energies and forces experienced by atoms, as derived from quantum mech. calcns. The models do not have a fixed functional form and hence are capable of modeling complex potential energy landscapes. They are systematically improvable with more data. We apply the method to bulk crystals, and test it by calcg. properties at high temps. Using the interat. potential to generate the long mol. dynamics trajectories required for such calcns. saves orders of magnitude in computational cost.
- 9Bartók, A. P.; De, S.; Poelking, C.; Bernstein, N.; Kermode, J. R.; Csányi, G.; Ceriotti, M. Machine Learning Unifies the Modeling of Materials and Molecules. Sci. Adv. 2017, 3 (12), e1701816 DOI: 10.1126/sciadv.1701816[Crossref], [PubMed], [CAS], Google Scholar9https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXisVWgsbjP&md5=e996f3746c995ed0304f33762f7da713Machine learning unifies the modeling of materials and moleculesBartok, Albert P.; De, Sandip; Poelking, Carl; Bernstein, Noam; Kermode, James R.; Csanyi, Gabor; Ceriotti, MicheleScience Advances (2017), 3 (12), e1701816/1-e1701816/8CODEN: SACDAF; ISSN:2375-2548. (American Association for the Advancement of Science)Detg. the stability of mols. and condensed phases is the cornerstone of atomistic modeling, underpinning our understanding of chem. and materials properties and transformations. We show that a machine-learning model, based on a local description of chem. environments and Bayesian statistical learning, provides a unified framework to predict at.-scale properties. It captures the quantum mech. effects governing the complex surface reconstructions of silicon, predicts the stability of different classes of mols. with chem. accuracy, and distinguishes active and inactive protein ligands with more than 99% reliability. The universality and the systematic nature of our framework provide new insight into the potential energy surface of materials and mols.
- 10Caro, M. A.; Deringer, V. L.; Koskinen, J.; Laurila, T.; Csányi, G. Growth Mechanism and Origin of High Sp3 Content in Tetrahedral Amorphous Carbon. Phys. Rev. Lett. 2018, 120, 166101, DOI: 10.1103/PhysRevLett.120.166101[Crossref], [PubMed], [CAS], Google Scholar10https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXltFSgt78%253D&md5=9b29a42ea8c0a1bccff5d98eed9713c0Growth Mechanism and Origin of High sp3 Content in Tetrahedral Amorphous CarbonCaro, Miguel A.; Deringer, Volker L.; Koskinen, Jari; Laurila, Tomi; Csanyi, GaborPhysical Review Letters (2018), 120 (16), 166101CODEN: PRLTAO; ISSN:1079-7114. (American Physical Society)We study the deposition of tetrahedral amorphous carbon (ta-C) films from mol. dynamics simulations based on a machine-learned interat. potential trained from d.-functional theory data. For the first time, the high sp3 fractions in excess of 85% obsd. exptl. are reproduced by means of computational simulation, and the deposition energy dependence of the film's characteristics is also accurately described. High confidence in the potential and direct access to the at. interactions allow us to infer the microscopic growth mechanism in this material. While the widespread view is that ta-C grows by "subplantation," we show that the so-called "peening" model is actually the dominant mechanism responsible for the high sp3 content. We show that pressure waves lead to bond rearrangement away from the impact site of the incident ion, and high sp3 fractions arise from a delicate balance of transitions between three- and fourfold coordinated carbon atoms. These results open the door for a microscopic understanding of carbon nanostructure formation with an unprecedented level of predictive power.
- 11Deringer, V. L.; Bernstein, N.; Bartók, A. P.; Cliffe, M. J.; Kerber, R. N.; Marbella, L. E.; Grey, C. P.; Elliott, S. R.; Csányi, G. Realistic Atomistic Structure of Amorphous Silicon from Machine-Learning-Driven Molecular Dynamics. J. Phys. Chem. Lett. 2018, 9 (11), 2879– 2885, DOI: 10.1021/acs.jpclett.8b00902[ACS Full Text
], [CAS], Google Scholar
11https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXptlyitrk%253D&md5=803bfce8641590478e89276ebb7fc81dRealistic Atomistic Structure of Amorphous Silicon from Machine-Learning-Driven Molecular DynamicsDeringer, Volker L.; Bernstein, Noam; Bartok, Albert P.; Cliffe, Matthew J.; Kerber, Rachel N.; Marbella, Lauren E.; Grey, Clare P.; Elliott, Stephen R.; Csanyi, GaborJournal of Physical Chemistry Letters (2018), 9 (11), 2879-2885CODEN: JPCLCD; ISSN:1948-7185. (American Chemical Society)Amorphous silicon (a-Si) is a widely studied noncryst. material, and yet the subtle details of its atomistic structure are still unclear. Here, we show that accurate structural models of a-Si can be obtained using a machine-learning-based interat. potential. Our best a-Si network is obtained by simulated cooling from the melt at a rate of 1011 K/s (i.e., on the 10 ns time scale), contains less than 2% defects, and agrees with expts. regarding excess energies, diffraction data, and 29Si NMR chem. shifts. We show that this level of quality is impossible to achieve with faster quench simulations. We then generate a 4096-atom system that correctly reproduces the magnitude of the first sharp diffraction peak (FSDP) in the structure factor, achieving the closest agreement with expts. to date. Our study demonstrates the broader impact of machine-learning potentials for elucidating structures and properties of technol. important amorphous materials. - 12Hu, D.; Xie, Y.; Li, X.; Li, L.; Lan, Z. Inclusion of Machine Learning Kernel Ridge Regression Potential Energy Surfaces in On-the-Fly Nonadiabatic Molecular Dynamics Simulation. J. Phys. Chem. Lett. 2018, 9 (11), 2725– 2732, DOI: 10.1021/acs.jpclett.8b00684[ACS Full Text
], [CAS], Google Scholar
12https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXptVequ7g%253D&md5=d7257373e2b1bc7791fdda02c8e18b47Inclusion of Machine Learning Kernel Ridge Regression Potential Energy Surfaces in On-the-Fly Nonadiabatic Molecular Dynamics SimulationHu, Deping; Xie, Yu; Li, Xusong; Li, Lingyue; Lan, ZhenggangJournal of Physical Chemistry Letters (2018), 9 (11), 2725-2732CODEN: JPCLCD; ISSN:1948-7185. (American Chemical Society)We discuss a theor. approach that employs machine learning potential energy surfaces (ML-PESs) in the nonadiabatic dynamics simulation of polyat. systems by taking 6-aminopyrimidine as a typical example. The Zhu-Nakamura theory is employed in the surface hopping dynamics, which does not require the calcn. of the nonadiabatic coupling vectors. The kernel ridge regression is used in the construction of the adiabatic PESs. In the nonadiabatic dynamics simulation, we use ML-PESs for most geometries and switch back to the electronic structure calcns. for a few geometries either near the S1/S0 conical intersections or in the out-of-confidence regions. The dynamics results based on ML-PESs are consistent with those based on CASSCF PESs. The ML-PESs are further used to achieve the highly efficient massive dynamics simulations with a large no. of trajectories. This work displays the powerful role of ML methods in the nonadiabatic dynamics simulation of polyat. systems. - 13Smith, J. S.; Isayev, O.; Roitberg, A. E. ANI-1: An Extensible Neural Network Potential with DFT Accuracy at Force Field Computational Cost. Chem. Sci. 2017, 8 (4), 3192– 3203, DOI: 10.1039/C6SC05720A[Crossref], [PubMed], [CAS], Google Scholar13https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXitlGnsrs%253D&md5=95b2f5106c620c6f09560966dba3559eANI-1: an extensible neural network potential with DFT accuracy at force field computational costSmith, J. S.; Isayev, O.; Roitberg, A. E.Chemical Science (2017), 8 (4), 3192-3203CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)A review. Deep learning is revolutionizing many areas of science and technol., esp. image, text, and speech recognition. In this paper, we demonstrate how a deep neural network (NN) trained on quantum mech. (QM) DFT calcns. can learn an accurate and transferable potential for org. mols. We introduce ANAKIN-ME (Accurate NeurAl networK engINe for Mol. Energies) or ANI for short. ANI is a new method designed with the intent of developing transferable neural network potentials that utilize a highly-modified version of the Behler and Parrinello symmetry functions to build single-atom at. environment vectors (AEV) as a mol. representation. AEVs provide the ability to train neural networks to data that spans both configurational and conformational space, a feat not previously accomplished on this scale. We utilized ANI to build a potential called ANI-1, which was trained on a subset of the GDB databases with up to 8 heavy atoms in order to predict total energies for org. mols. contg. four atom types: H, C, N, and O. To obtain an accelerated but phys. relevant sampling of mol. potential surfaces, we also proposed a Normal Mode Sampling (NMS) method for generating mol. conformations. Through a series of case studies, we show that ANI-1 is chem. accurate compared to ref. DFT calcns. on much larger mol. systems (up to 54 atoms) than those included in the training data set.
- 14Smith, J. S.; Nebgen, B.; Lubbers, N.; Isayev, O.; Roitberg, A. E. Less Is More: Sampling Chemical Space with Active Learning. J. Chem. Phys. 2018, 148 (24), 241733, DOI: 10.1063/1.5023802[Crossref], [PubMed], [CAS], Google Scholar14https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXpvVOjurk%253D&md5=c6894cdd3c471d7ceed10d8b2c095a03Less is more: Sampling chemical space with active learningSmith, Justin S.; Nebgen, Ben; Lubbers, Nicholas; Isayev, Olexandr; Roitberg, Adrian E.Journal of Chemical Physics (2018), 148 (24), 241733/1-241733/10CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)The development of accurate and transferable machine learning (ML) potentials for predicting mol. energetics is a challenging task. The process of data generation to train such ML potentials is a task neither well understood nor researched in detail. In this work, we present a fully automated approach for the generation of datasets with the intent of training universal ML potentials. It is based on the concept of active learning (AL) via Query by Committee (QBC), which uses the disagreement between an ensemble of ML potentials to infer the reliability of the ensemble's prediction. QBC allows the presented AL algorithm to automatically sample regions of chem. space where the ML potential fails to accurately predict the potential energy. AL improves the overall fitness of ANAKIN-ME (ANI) deep learning potentials in rigorous test cases by mitigating human biases in deciding what new training data to use. AL also reduces the training set size to a fraction of the data required when using naive random sampling techniques. To provide validation of our AL approach, we develop the COmprehensive Machine-learning Potential (COMP6) benchmark (publicly available on GitHub) which contains a diverse set of org. mols. Active learning-based ANI potentials outperform the original random sampled ANI-1 potential with only 10% of the data, while the final active learning-based model vastly outperforms ANI-1 on the COMP6 benchmark after training to only 25% of the data. Finally, we show that our proposed AL technique develops a universal ANI potential (ANI-1x) that provides accurate energy and force predictions on the entire COMP6 benchmark. This universal ML potential achieves a level of accuracy on par with the best ML potentials for single mols. or materials, while remaining applicable to the general class of org. mols. composed of the elements CHNO. (c) 2018 American Institute of Physics.
- 15Smith, J. S.; Nebgen, B. T.; Zubatyuk, R.; Lubbers, N.; Devereux, C.; Barros, K.; Tretiak, S.; Isayev, O.; Roitberg, A. E. Approaching Coupled Cluster Accuracy with a General-Purpose Neural Network Potential through Transfer Learning. Nat. Commun. 2019, 10 (1), 2903, DOI: 10.1038/s41467-019-10827-4[Crossref], [PubMed], [CAS], Google Scholar15https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BB3MzjtFaitg%253D%253D&md5=6476e866a59d1408cbfba321242a353dApproaching coupled cluster accuracy with a general-purpose neural network potential through transfer learningSmith Justin S; Devereux Christian; Roitberg Adrian E; Smith Justin S; Nebgen Benjamin T; Zubatyuk Roman; Lubbers Nicholas; Barros Kipton; Tretiak Sergei; Smith Justin S; Lubbers Nicholas; Nebgen Benjamin T; Tretiak Sergei; Zubatyuk Roman; Isayev OlexandrNature communications (2019), 10 (1), 2903 ISSN:.Computational modeling of chemical and biological systems at atomic resolution is a crucial tool in the chemist's toolset. The use of computer simulations requires a balance between cost and accuracy: quantum-mechanical methods provide high accuracy but are computationally expensive and scale poorly to large systems, while classical force fields are cheap and scalable, but lack transferability to new systems. Machine learning can be used to achieve the best of both approaches. Here we train a general-purpose neural network potential (ANI-1ccx) that approaches CCSD(T)/CBS accuracy on benchmarks for reaction thermochemistry, isomerization, and drug-like molecular torsions. This is achieved by training a network to DFT data then using transfer learning techniques to retrain on a dataset of gold standard QM calculations (CCSD(T)/CBS) that optimally spans chemical space. The resulting potential is broadly applicable to materials science, biology, and chemistry, and billions of times faster than CCSD(T)/CBS calculations.
- 16Chmiela, S.; Tkatchenko, A.; Sauceda, H. E.; Poltavsky, I.; Schütt, K. T.; Müller, K. R. Machine Learning of Accurate Energy-Conserving Molecular Force Fields. Sci. Adv. 2017, 3 (5), e1603015 DOI: 10.1126/sciadv.1603015[Crossref], [PubMed], [CAS], Google Scholar16https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXkvVGjsrg%253D&md5=f26ef9dd87735d5d8fdbf5cc1ab9ae7cMachine learning of accurate energy-conserving molecular force fieldsChmiela, Stefan; Tkatchenko, Alexandre; Sauceda, Huziel E.; Poltavsky, Igor; Schuett, Kristof T.; Mueller, Klaus-RobertScience Advances (2017), 3 (5), e1603015/1-e1603015/6CODEN: SACDAF; ISSN:2375-2548. (American Association for the Advancement of Science)Using conservation of energy-a fundamental property of closed classical and quantum mech. systems-we develop an efficient gradient-domain machine learning (GDML) approach to construct accurate mol. force fields using a restricted no. of samples from ab initio mol. dynamics (AIMD) trajectories. The GDML implementation is able to reproduce global potential energy surfaces of intermediate-sized mols. with an accuracy of 0.3 kcal mol-1 for energies and 1 kcal mol-1 Å-1 for at. forces using only 1000 conformational geometries for training. We demonstrate this accuracy for AIMD trajectories of mols., including benzene, toluene, naphthalene, ethanol, uracil, and aspirin. The challenge of constructing conservative force fields is accomplished in our work by learning in a Hilbert space of vector-valued functions that obey the law of energy conservation. The GDML approach enables quant. mol. dynamics simulations for mols. at a fraction of cost of explicit AIMD calcns., thereby allowing the construction of efficient force fields with the accuracy and transferability of high-level ab initio methods.
- 17Chmiela, S.; Sauceda, H. E.; Müller, K. R.; Tkatchenko, A. Towards Exact Molecular Dynamics Simulations with Machine-Learned Force Fields. Nat. Commun. 2018, 9 (1), 3887, DOI: 10.1038/s41467-018-06169-2[Crossref], [PubMed], [CAS], Google Scholar17https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BB3czit1Oqsw%253D%253D&md5=90e14b965fe7f099602125037818aa32Towards exact molecular dynamics simulations with machine-learned force fieldsChmiela Stefan; Muller Klaus-Robert; Sauceda Huziel E; Muller Klaus-Robert; Muller Klaus-Robert; Tkatchenko AlexandreNature communications (2018), 9 (1), 3887 ISSN:.Molecular dynamics (MD) simulations employing classical force fields constitute the cornerstone of contemporary atomistic modeling in chemistry, biology, and materials science. However, the predictive power of these simulations is only as good as the underlying interatomic potential. Classical potentials often fail to faithfully capture key quantum effects in molecules and materials. Here we enable the direct construction of flexible molecular force fields from high-level ab initio calculations by incorporating spatial and temporal physical symmetries into a gradient-domain machine learning (sGDML) model in an automatic data-driven way. The developed sGDML approach faithfully reproduces global force fields at quantum-chemical CCSD(T) level of accuracy and allows converged molecular dynamics simulations with fully quantized electrons and nuclei. We present MD simulations, for flexible molecules with up to a few dozen atoms and provide insights into the dynamical behavior of these molecules. Our approach provides the key missing ingredient for achieving spectroscopic accuracy in molecular simulations.
- 18Sauceda, H. E.; Chmiela, S.; Poltavsky, I.; Müller, K. R.; Tkatchenko, A. Molecular Force Fields with Gradient-Domain Machine Learning: Construction and Application to Dynamics of Small Molecules with Coupled Cluster Forces. J. Chem. Phys. 2019, 150 (11), 114102, DOI: 10.1063/1.5078687[Crossref], [PubMed], [CAS], Google Scholar18https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXltFGgsrk%253D&md5=f7e80e4bb220c36c4e5fb7e44454657aMolecular force fields with gradient-domain machine learning: Construction and application to dynamics of small molecules with coupled cluster forcesSauceda, Huziel E.; Chmiela, Stefan; Poltavsky, Igor; Mueller, Klaus-Robert; Tkatchenko, AlexandreJournal of Chemical Physics (2019), 150 (11), 114102/1-114102/12CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We present the construction of mol. force fields for small mols. (less than 25 atoms) using the recently developed symmetrized gradient-domain machine learning (sGDML) approach [Chmiela et al., Nat. Commun. 9, 3887 (2018) and Chmiela et al., Sci. Adv. 3, e1603015 (2017)]. This approach is able to accurately reconstruct complex high-dimensional potential-energy surfaces from just a few 100s of mol. conformations extd. from ab initio mol. dynamics trajectories. The data efficiency of the sGDML approach implies that at. forces for these conformations can be computed with high-level wavefunction-based approaches, such as the "gold std." coupled-cluster theory with single, double and perturbative triple excitations [CCSD(T)]. We demonstrate that the flexible nature of the sGDML model recovers local and non-local electronic interactions (e.g., H-bonding, proton transfer, lone pairs, changes in hybridization states, steric repulsion, and n → π* interactions) without imposing any restriction on the nature of interat. potentials. The anal. of sGDML mol. dynamics trajectories yields new qual. insights into dynamics and spectroscopy of small mols. close to spectroscopic accuracy. (c) 2019 American Institute of Physics.
- 19Schütt, K. T.; Arbabzadah, F.; Chmiela, S.; Müller, K. R.; Tkatchenko, A. Quantum-Chemical Insights from Deep Tensor Neural Networks. Nat. Commun. 2017, 8 (1), 13890, DOI: 10.1038/ncomms13890[Crossref], [PubMed], [CAS], Google Scholar19https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXpsVOnsg%253D%253D&md5=6007758e2a0029c5a854673a5451bc7fQuantum-chemical insights from deep tensor neural networksSchuett, Kristof T.; Arbabzadah, Farhad; Chmiela, Stefan; Mueller, Klaus R.; Tkatchenko, AlexandreNature Communications (2017), 8 (), 13890CODEN: NCAOBW; ISSN:2041-1723. (Nature Publishing Group)Learning from data has led to paradigm shifts in a multitude of disciplines, including web, text and image search, speech recognition, as well as bioinformatics. Can machine learning enable similar breakthroughs in understanding quantum many-body systems. Here we develop an efficient deep learning approach that enables spatially and chem. resolved insights into quantum-mech. observables of mol. systems. We unify concepts from many-body Hamiltonians with purpose-designed deep tensor neural networks, which leads to size-extensive and uniformly accurate (1 kcal mol-1) predictions in compositional and configurational chem. space for mols. of intermediate size. As an example of chem. relevance, the model reveals a classification of arom. rings with respect to their stability. Further applications of our model for predicting at. energies and local chem. potentials in mols., reliable isomer energies, and mols. with peculiar electronic structure demonstrate the potential of machine learning for revealing insights into complex quantum-chem. systems.
- 20Schütt, K. T.; Sauceda, H. E.; Kindermans, P.-J.; Tkatchenko, A.; Müller, K. R. SchNet-A Deep Learning Architecture for Molecules and Materials. J. Chem. Phys. 2018, 148 (24), 241722, DOI: 10.1063/1.5019779[Crossref], [PubMed], [CAS], Google Scholar20https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXms1Ggurs%253D&md5=988638d520a423f529a16b35031243aaSchNet - A deep learning architecture for molecules and materialsSchuett, K. T.; Sauceda, H. E.; Kindermans, P.-J.; Tkatchenko, A.; Mueller, K.-R.Journal of Chemical Physics (2018), 148 (24), 241722/1-241722/11CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Deep learning has led to a paradigm shift in artificial intelligence, including web, text, and image search, speech recognition, as well as bioinformatics, with growing impact in chem. physics. Machine learning, in general, and deep learning, in particular, are ideally suitable for representing quantum-mech. interactions, enabling us to model nonlinear potential-energy surfaces or enhancing the exploration of chem. compd. space. Here, we present the deep learning architecture SchNet that is specifically designed to model atomistic systems by making use of continuous-filter convolutional layers. We demonstrate the capabilities of SchNet by accurately predicting a range of properties across chem. space for mols. and materials, where our model learns chem. plausible embeddings of atom types across the periodic table. Finally, we employ SchNet to predict potential-energy surfaces and energy-conserving force fields for mol. dynamics simulations of small mols. and perform an exemplary study on the quantum-mech. properties of C20-fullerene that would have been infeasible with regular ab initio mol. dynamics. (c) 2018 American Institute of Physics.
- 21Schütt, K. T.; Kessel, P.; Gastegger, M.; Nicoli, K. A.; Tkatchenko, A.; Müller, K. R. SchNetPack: A Deep Learning Toolbox for Atomistic Systems. J. Chem. Theory Comput. 2019, 15 (1), 448– 455, DOI: 10.1021/acs.jctc.8b00908[ACS Full Text
], [CAS], Google Scholar
21https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXitlalsLvN&md5=51b19679412f490ee9260a986371e66bSchNetPack: A Deep Learning Toolbox For Atomistic SystemsSchuett, K. T.; Kessel, P.; Gastegger, M.; Nicoli, K. A.; Tkatchenko, A.; Mueller, K.-R.Journal of Chemical Theory and Computation (2019), 15 (1), 448-455CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)SchNetPack is a toolbox for the development and application of deep neural networks that predict potential energy surfaces and other quantum-chem. properties of mols. and materials. It contains basic building blocks of atomistic neural networks, manages their training, and provides simple access to common benchmark datasets. This allows for an easy implementation and evaluation of new models. For now, SchNetPack includes implementations of (weighted) atom-centered symmetry functions and the deep tensor neural network SchNet, as well as ready-to-use scripts that allow one to train these models on mol. and material datasets. Based on the PyTorch deep learning framework, SchNetPack allows one to efficiently apply the neural networks to large datasets with millions of ref. calcns., as well as parallelize the model across multiple GPUs. Finally, SchNetPack provides an interface to the Atomic Simulation Environment in order to make trained models easily accessible to researchers that are not yet familiar with neural networks. - 22Unke, O. T.; Meuwly, M. PhysNet: A Neural Network for Predicting Energies, Forces, Dipole Moments and Partial Charges. J. Chem. Theory Comput. 2019, 15 (6), 3678– 3693, DOI: 10.1021/acs.jctc.9b00181[ACS Full Text
], [CAS], Google Scholar
22https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXosF2ms7g%253D&md5=77bea45c52d12b93267bc785d3d4b375PhysNet: A Neural Network for Predicting Energies, Forces, Dipole Moments, and Partial ChargesUnke, Oliver T.; Meuwly, MarkusJournal of Chemical Theory and Computation (2019), 15 (6), 3678-3693CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)In recent years, machine learning (ML) methods have become increasingly popular in computational chem. After being trained on appropriate ab initio ref. data, these methods allow for accurately predicting the properties of chem. systems, circumventing the need for explicitly solving the electronic Schrodinger equation. Because of their computational efficiency and scalability to large data sets, deep neural networks (DNNs) are a particularly promising ML algorithm for chem. applications. This work introduces PhysNet, a DNN architecture designed for predicting energies, forces, and dipole moments of chem. systems. PhysNet achieves state-of-the-art performance on the QM9, MD17, and ISO17 benchmarks. Further, two new data sets are generated in order to probe the performance of ML models for describing chem. reactions, long-range interactions, and condensed phase systems. It is shown that explicitly including electrostatics in energy predictions is crucial for a qual. correct description of the asymptotic regions of a potential energy surface (PES). PhysNet models trained on a systematically constructed set of small peptide fragments (at most eight heavy atoms) are able to generalize to considerably larger proteins like deca-alanine (Ala10): The optimized geometry of helical Ala10 predicted by PhysNet is virtually identical to ab initio results (RMSD = 0.21 Å). By running unbiased mol. dynamics (MD) simulations of Ala10 on the PhysNet-PES in gas phase, it is found that instead of a helical structure, Ala10 folds into a "wreath-shaped" configuration, which is more stable than the helical form by 0.46 kcal mol-1 according to the ref. ab initio calcns. - 23Brockherde, F.; Vogt, L.; Li, L.; Tuckerman, M. E.; Burke, K.; Müller, K. R. Bypassing the Kohn-Sham Equations with Machine Learning. Nat. Commun. 2017, 8 (1), 872, DOI: 10.1038/s41467-017-00839-3[Crossref], [PubMed], [CAS], Google Scholar23https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BC1M%252Fps1GhsA%253D%253D&md5=c3e11bea0346fbb9ae06c3f67453f2a2Bypassing the Kohn-Sham equations with machine learningBrockherde Felix; Muller Klaus-Robert; Brockherde Felix; Vogt Leslie; Tuckerman Mark E; Li Li; Burke Kieron; Tuckerman Mark E; Tuckerman Mark E; Burke Kieron; Muller Klaus-Robert; Muller Klaus-RobertNature communications (2017), 8 (1), 872 ISSN:.Last year, at least 30,000 scientific papers used the Kohn-Sham scheme of density functional theory to solve electronic structure problems in a wide variety of scientific fields. Machine learning holds the promise of learning the energy functional via examples, bypassing the need to solve the Kohn-Sham equations. This should yield substantial savings in computer time, allowing larger systems and/or longer time-scales to be tackled, but attempts to machine-learn this functional have been limited by the need to find its derivative. The present work overcomes this difficulty by directly learning the density-potential and energy-density maps for test systems and various molecules. We perform the first molecular dynamics simulation with a machine-learned density functional on malonaldehyde and are able to capture the intramolecular proton transfer process. Learning density models now allows the construction of accurate density functionals for realistic molecular systems.Machine learning allows electronic structure calculations to access larger system sizes and, in dynamical simulations, longer time scales. Here, the authors perform such a simulation using a machine-learned density functional that avoids direct solution of the Kohn-Sham equations.
- 24Petraglia, R.; Nicolaï, A.; Wodrich, M. D.; Ceriotti, M.; Corminboeuf, C. Beyond Static Structures: Putting Forth REMD as a Tool to Solve Problems in Computational Organic Chemistry. J. Comput. Chem. 2016, 37 (1), 83– 92, DOI: 10.1002/jcc.24025[Crossref], [PubMed], [CAS], Google Scholar24https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2MXht1GltbjE&md5=046930a574db03f8e28453700de204aaBeyond static structures: Putting forth REMD as a tool to solve problems in computational organic chemistryPetraglia, Riccardo; Nicolai, Adrien; Wodrich, Matthew D.; Ceriotti, Michele; Corminboeuf, ClemenceJournal of Computational Chemistry (2016), 37 (1), 83-92CODEN: JCCHDD; ISSN:0192-8651. (John Wiley & Sons, Inc.)Computational studies of org. systems are frequently limited to static pictures that closely align with textbook style presentations of reaction mechanisms and isomerization processes. Of course, in reality chem. systems are dynamic entities where a multitude of mol. conformations exists on incredibly complex potential energy surfaces (PES). Here, we borrow a computational technique originally conceived to be used in the context of biol. simulations, together with empirical force fields, and apply it to org. chem. problems. Replica-exchange mol. dynamics (REMD) permits thorough exploration of the PES. We combined REMD with d. functional tight binding (DFTB), thereby establishing the level of accuracy necessary to analyze small mol. systems. Through the study of four prototypical problems: isomer identification, reaction mechanisms, temp.-dependent rotational processes, and catalysis, we reveal new insights and chem. that likely would be missed using static electronic structure computations. The REMD-DFTB methodol. at the heart of this study is powered by i-PI, which efficiently handles the interface between the DFTB and REMD codes. © 2015 Wiley Periodicals, Inc.
- 25Sugita, Y.; Okamoto, Y. Replica-Exchange Molecular Dynamics Method for Protein Folding. Chem. Phys. Lett. 1999, 314, 141– 151, DOI: 10.1016/S0009-2614(99)01123-9[Crossref], [CAS], Google Scholar25https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK1MXotVKrsLc%253D&md5=0fec0ff81ca7806c1e1ac29e5f50ce19Replica-exchange molecular dynamics method for protein foldingSugita, Y.; Okamoto, Y.Chemical Physics Letters (1999), 314 (1,2), 141-151CODEN: CHPLBC; ISSN:0009-2614. (Elsevier Science B.V.)We have developed a formulation for mol. dynamics algorithm for the replica-exchange method. The effectiveness of the method for the protein-folding problem is tested with the penta-peptide Met-enkephalin. The method can overcome the multiple-min. problem by exchanging non-interacting replicas of the system at several temps. From only one simulation run, one can obtain probability distributions in canonical ensemble for a wide temp. range using multiple-histogram re-weighting techniques, which allows the calcn. of any thermodn. quantity as a function of temp. in that range.
- 26Gaus, M.; Cui, Q.; Elstner, M. DFTB3: Extension of the Self-Consistent-Charge Density-Functional Tight-Binding Method (SCC-DFTB). J. Chem. Theory Comput. 2011, 7 (4), 931– 948, DOI: 10.1021/ct100684s[ACS Full Text
], [CAS], Google Scholar
26https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXjtVKgu74%253D&md5=179659060fa503023375266a674d02e7DFTB3: Extension of the Self-Consistent-Charge Density-Functional Tight-Binding Method (SCC-DFTB)Gaus, Michael; Cui, Qiang; Elstner, MarcusJournal of Chemical Theory and Computation (2011), 7 (4), 931-948CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The self-consistent-charge d.-functional tight-binding method (SCC-DFTB) is an approx. quantum chem. method derived from d. functional theory (DFT) based on a second-order expansion of the DFT total energy around a ref. d. In the present study, we combine earlier extensions and improve them consistently with, first, an improved Coulomb interaction between at. partial charges and, second, the complete third-order expansion of the DFT total energy. These modifications lead us to the next generation of the DFTB methodol. called DFTB3, which substantially improves the description of charged systems contg. elements C, H, N, O, and P, esp. regarding hydrogen binding energies and proton affinities. As a result, DFTB3 is particularly applicable to biomol. systems. Remaining challenges and possible solns. are also briefly discussed. - 27Gaus, M.; Goez, A.; Elstner, M. Parametrization and Benchmark of DFTB3 for Organic Molecules. J. Chem. Theory Comput. 2013, 9 (1), 338– 354, DOI: 10.1021/ct300849w[ACS Full Text
], [CAS], Google Scholar
27https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC38Xhs1ent77L&md5=2ddde03653e32a857d7416e755332515Parametrization and Benchmark of DFTB3 for Organic MoleculesGaus, Michael; Goez, Albrecht; Elstner, MarcusJournal of Chemical Theory and Computation (2013), 9 (1), 338-354CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)DFTB3 is a recent extension of the self-consistent-charge d.-functional tight-binding method (SCC-DFTB) and derived from a third order expansion of the d. functional theory (DFT) total energy around a given ref. d. Being applied in combination with the parametrization of its predecessor (MIO), DFTB3 improves for hydrogen binding energies, proton affinities, and hydrogen transfer barriers. In the present study, parameters esp. designed for DFTB3 are presented, and its performance is evaluated for small org. mols. focusing on thermochem., geometries, and vibrational frequencies from our own and several databases from literature. The new parameters remove significant overbinding errors, reduce errors for geometries of noncovalent interactions, and improve the overall performance. - 28Gaus, M.; Lu, X.; Elstner, M.; Cui, Q. Parameterization of DFTB3/3OB for Sulfur and Phosphorus for Chemical and Biological Applications. J. Chem. Theory Comput. 2014, 10 (4), 1518– 1537, DOI: 10.1021/ct401002w[ACS Full Text
], [CAS], Google Scholar
28https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2cXktFeitrw%253D&md5=6eda218757c17fc08cdf820c3c9452eaParameterization of DFTB3/3OB for Sulfur and Phosphorus for Chemical and Biological ApplicationsGaus, Michael; Lu, Xiya; Elstner, Marcus; Cui, QiangJournal of Chemical Theory and Computation (2014), 10 (4), 1518-1537CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)We report the parametrization of the approx. d. functional tight binding method, DFTB3, for sulfur and phosphorus. The parametrization is done in a framework consistent with our previous 3OB set established for O, N, C, and H, thus the resulting parameters can be used to describe a broad set of org. and biol. relevant mols. The 3d orbitals are included in the parametrization, and the electronic parameters are chosen to minimize errors in the atomization energies. The parameters are tested using a fairly diverse set of mols. of biol. relevance, focusing on the geometries, reaction energies, proton affinities, and hydrogen bonding interactions of these mols.; vibrational frequencies are also examd., although less systematically. The results of DFTB3/3OB are compared to those from DFT (B3LYP and PBE), ab initio (MP2, G3B3), and several popular semiempirical methods (PM6 and PDDG), as well as predictions of DFTB3 with the older parametrization (the MIO set). In general, DFTB3/3OB is a major improvement over the previous parametrization (DFTB3/MIO), and for the majority cases tested here, it also outperforms PM6 and PDDG, esp. for structural properties, vibrational frequencies, hydrogen bonding interactions, and proton affinities. For reaction energies, DFTB3/3OB exhibits major improvement over DFTB3/MIO, due mainly to significant redn. of errors in atomization energies; compared to PM6 and PDDG, DFTB3/3OB also generally performs better, although the magnitude of improvement is more modest. Compared to high-level calcns., DFTB3/3OB is most successful at predicting geometries; larger errors are found in the energies, although the results can be greatly improved by computing single point energies at a high level with DFTB3 geometries. There are several remaining issues with the DFTB3/3OB approach, most notably its difficulty in describing phosphate hydrolysis reactions involving a change in the coordination no. of the phosphorus, for which a specific parametrization (3OB/OPhyd) is developed as a temporary soln.; this suggests that the current DFTB3 methodol. has limited transferability for complex phosphorus chem. at the level of accuracy required for detailed mechanistic investigations. Therefore, fundamental improvements in the DFTB3 methodol. are needed for a reliable method that describes phosphorus chem. without ad hoc parameters. Nevertheless, DFTB3/3OB is expected to be a competitive QM method in QM/MM calcns. for studying phosphorus/sulfur chem. in condensed phase systems, esp. as a low-level method that drives the sampling in a dual-level QM/MM framework. - 29Laio, A.; Parrinello, M. Escaping free-energy minima. Proc. Natl. Acad. Sci. U. S. A. 2002, 99 (20), 12562– 12566, DOI: 10.1073/pnas.202427399[Crossref], [PubMed], [CAS], Google Scholar29https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD38XnvFGiurc%253D&md5=48d5bc7436f3ef9d78369671e70fa608Escaping free-energy minimaLaio, Alessandro; Parrinello, MicheleProceedings of the National Academy of Sciences of the United States of America (2002), 99 (20), 12562-12566CODEN: PNASA6; ISSN:0027-8424. (National Academy of Sciences)We introduce a powerful method for exploring the properties of the multidimensional free energy surfaces (FESs) of complex many-body systems by means of coarse-grained non-Markovian dynamics in the space defined by a few collective coordinates. A characteristic feature of these dynamics is the presence of a history-dependent potential term that, in time, fills the min. in the FES, allowing the efficient exploration and accurate detn. of the FES as a function of the collective coordinates. We demonstrate the usefulness of this approach in the case of the dissocn. of a NaCl mol. in water and in the study of the conformational changes of a dialanine in soln.
- 30Grimme, S. Exploration of Chemical Compound, Conformer, and Reaction Space with Meta-Dynamics Simulations Based on Tight-Binding Quantum Chemical Calculations. J. Chem. Theory Comput. 2019, 15 (5), 2847– 2862, DOI: 10.1021/acs.jctc.9b00143[ACS Full Text
], [CAS], Google Scholar
30https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXms1ahs7Y%253D&md5=ec5d26600f13710436a96c608c2b743dExploration of Chemical Compound, Conformer, and Reaction Space with Meta-Dynamics Simulations Based on Tight-Binding Quantum Chemical CalculationsGrimme, StefanJournal of Chemical Theory and Computation (2019), 15 (5), 2847-2862CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The semiempirical tight-binding based quantum chem. method GFN2-xTB is used in the framework of meta-dynamics (MTD) to globally explore chem. compd., conformer, and reaction space. The biasing potential given as a sum of Gaussian functions is expressed with the root-mean-square-deviation (RMSD) in Cartesian space as a metric for the collective variables. This choice makes the approach robust and generally applicable to three common problems (i.e., conformer search, chem. reaction space exploration in a virtual nanoreactor, and for guessing reaction paths). Because of the inherent locality of the at. RMSD, functional group or fragment selective treatments are possible facilitating the investigation of catalytic processes where, for example, only the substrate is thermally activated. Due to the approx. character of the GFN2-xTB method, the resulting structure ensembles require further refinement with more sophisticated, for example, d. functional or wave function theory methods. However, the approach is extremely efficient running routinely on common laptop computers in minutes to hours of computation time even for realistically sized mols. with a few hundred atoms. Furthermore, the underlying potential energy surface for mols. contg. almost all elements (Z = 1-86) is globally consistent including the covalent dissocn. process and electronically complicated situations in, for example, transition metal systems. As examples, thermal decompn., ethyne oligomerization, the oxidn. of hydrocarbons (by oxygen and a P 450 enzyme model), a Miller-Urey model system, a thermally forbidden dimerization, and a multistep intramol. cyclization reaction are shown. For typical conformational search problems of org. drug mols., the new MTD(RMSD) algorithm yields lower energy structures and more complete conformer ensembles at reduced computational effort compared with its already well performing predecessor. - 31Kaczor, A.; Reva, I. D.; Proniewicz, L. M.; Fausto, R. Importance of Entropy in the Conformational Equilibrium of Phenylalanine: A Matrix-Isolation Infrared Spectroscopy and Density Functional Theory Study. J. Phys. Chem. A 2006, 110 (7), 2360– 2370, DOI: 10.1021/jp0550715[ACS Full Text
], [CAS], Google Scholar
31https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD28Xnslegug%253D%253D&md5=f404ca3987ca47b83de2b44adbeedde3Importance of Entropy in the Conformational Equilibrium of Phenylalanine: A Matrix-Isolation Infrared Spectroscopy and Density Functional Theory StudyKaczor, A.; Reva, I. D.; Proniewicz, L. M.; Fausto, R.Journal of Physical Chemistry A (2006), 110 (7), 2360-2370CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)The conformational behavior and IR spectrum of L-phenylalanine were studied by matrix-isolation IR spectroscopy and DFT [B3LYP/6-311++G(d,p)] calcns. The fourteen most stable structures were predicted to differ in energy by less than 10 kJ mol-1, eight of them with abundances higher than 5% at the temp. of evapn. of the compd. (423 K). Exptl. results suggest that six conformers contribute to the spectrum of the isolated compd., whereas two conformers (IIb3 and IIIb3) relax in matrix to a more stable form (IIb2) due to low energy barriers for conformational isomerization (conformational cooling). The two lowest-energy conformers (Ib1, Ia) differ only in the arrangement of the amino acid group relative to the Ph ring; they exhibit a relatively strong stabilizing intramol. hydrogen bond of the O-H···N type and the carboxylic group in the trans configuration (O:C-O-H dihedral angle ca. 180°). Type II conformers have a weaker H-bond of the N-H···O=C type, but they bear the more favorable cis arrangement of the carboxylic group. Being considerably more flexible, type II conformers are stabilized by entropy and the relative abundances of two conformers of this type (IIb2 and IIc1) are shown to significantly increase with temp. due to entropic stabilization. At 423 K, these conformers are found to be the first and third most abundant species present in the conformational equil., with relative populations of ca. 15% each, whereas their populations could be expected to be only ca. 5% if entropy effects were not taken into consideration. Indeed, phenylalanine can be considered a notable example of a mol. where entropy plays an essential role in detg. the relative abundance of the possible low-energy conformational states and then, the thermodn. of the compd., even at moderate temps. Upon UV irradn. (λ > 235 nm) of the matrix-isolated compd., unimol. photodecompn. of phenylalanine is obsd. with prodn. of CO2 and phenethylamine. - 32Ess, D. H.; Wheeler, S. E.; Iafe, R. G.; Xu, L.; Çelebi-Ölçüm, N.; Houk, K. N. Bifurcations on Potential Energy Surfaces of Organic Reactions. Angew. Chem., Int. Ed. 2008, 47 (40), 7592– 7601, DOI: 10.1002/anie.200800918[Crossref], [CAS], Google Scholar32https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1cXht1GltLzJ&md5=03ccbec3508f982d3f39941887485ef3Bifurcations on potential energy surfaces of organic reactionsEss, Daniel H.; Wheeler, Steven E.; Iafe, Robert G.; Xu, Lai; Celebi-Olcum, Nihan; Houk, Kendall N.Angewandte Chemie, International Edition (2008), 47 (40), 7592-7601CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)A review. A single transition state may lead to multiple intermediates or products if there is a post-transition-state reaction pathway bifurcation. These bifurcations arise when there are sequential transition states with no intervening energy min. For such systems, the shape of the potential energy surface and dynamic effects, rather than transition-state energetics, control selectivity. This Mini review covers recent investigations of org. reactions exhibiting reaction pathway bifurcations. Such phenomena are surprisingly general and affect exptl. observables such as kinetic isotope effects and product distributions.
- 33Rehbein, J.; Carpenter, B. K. Do We Fully Understand What Controls Chemical Selectivity?. Phys. Chem. Chem. Phys. 2011, 13 (47), 20906, DOI: 10.1039/c1cp22565k[Crossref], [PubMed], [CAS], Google Scholar33https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXhsFaiu77L&md5=93dfbfa2f6514a4c343503142b536d6aDo we fully understand what controls chemical selectivity?Rehbein, Julia; Carpenter, Barry K.Physical Chemistry Chemical Physics (2011), 13 (47), 20906-20922CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)Reaction rates and product selectivity of kinetically controlled reactions are not always sufficiently described by std. RRKM or TST theory. Reactions taking place on potential energy surfaces featuring a valley ridge inflection point belong to this class of reactions. Though various research groups could show that reaction path bifurcations are far from being an exception in org. reactions the underlying principles that govern product distributions of those bifurcating reaction pathways are yet not fully understood. This Perspective has the intention to provide an overview of how far our understanding and the development of the theor. foundation have progressed.
- 34Schreiner, P. R.; Reisenauer, H. P.; Ley, D.; Gerbig, D.; Wu, C.-H.; Allen, W. D. Methylhydroxycarbene: Tunneling Control of a Chemical Reaction. Science 2011, 332 (6035), 1300– 1303, DOI: 10.1126/science.1203761[Crossref], [PubMed], [CAS], Google Scholar34https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXntVGltL8%253D&md5=777ec16ef3e1128376c7b12cd2872c85Methylhydroxycarbene: Tunneling Control of a Chemical ReactionSchreiner, Peter R.; Reisenauer, Hans Peter; Ley, David; Gerbig, Dennis; Wu, Chia-Hua; Allen, Wesley D.Science (Washington, DC, United States) (2011), 332 (6035), 1300-1303CODEN: SCIEAS; ISSN:0036-8075. (American Association for the Advancement of Science)Chem. reactivity is conventionally understood in broad terms of kinetic vs. thermodn. control, wherein the decisive factor is the lowest activation barrier among the various reaction paths or the lowest free energy of the final products, resp. We demonstrate that quantum-mech. tunneling can supersede traditional kinetic control and direct a reaction exclusively to a product whose reaction path has a higher barrier. Specifically, we prepd. methylhydroxycarbene (H3C-C-OH) via vacuum pyrolysis of pyruvic acid at about 1200 K (K), followed by argon matrix trapping at 11 K. The previously elusive carbene, characterized by UV and IR spectroscopy as well as exacting quantum-mech. computations, undergoes a facile [1,2]hydrogen shift to acetaldehyde via tunneling under a barrier of 28.0 kcal per mol (kcal mol-1), with a half-life of around 1 h. The analogous isomerization to vinyl alc. has a substantially lower barrier of 22.6 kcal mol-1 but is precluded at low temp. by the greater width of the potential energy profile for tunneling.
- 35Plata, R. E.; Singleton, D. A. A Case Study of the Mechanism of Alcohol-Mediated Morita Baylis-Hillman Reactions. The Importance of Experimental Observations. J. Am. Chem. Soc. 2015, 137 (11), 3811– 3826, DOI: 10.1021/ja5111392[ACS Full Text
], [CAS], Google Scholar
35https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2MXjsVWksb8%253D&md5=dcd1973465edea3142926862a225beaeA Case Study of the Mechanism of Alcohol-Mediated Morita Baylis-Hillman Reactions. The Importance of Experimental ObservationsPlata, R. Erik; Singleton, Daniel A.Journal of the American Chemical Society (2015), 137 (11), 3811-3826CODEN: JACSAT; ISSN:0002-7863. (American Chemical Society)The mechanism of the Morita Baylis-Hillman reaction has been heavily studied in the literature, and a long series of computational studies have defined complete theor. energy profiles in these reactions. We employ here a combination of mechanistic probes, including the observation of intermediates, the independent generation and partitioning of intermediates, thermodn. and kinetic measurements on the main reaction and side reactions, isotopic incorporation from solvent, and kinetic isotope effects, to define the mechanism and an exptl. mechanistic free-energy profile for a prototypical Morita Baylis-Hillman reaction in methanol. The results are then used to critically evaluate the ability of computations to predict the mechanism. The most notable prediction of the many computational studies, that of a proton-shuttle pathway, is refuted in favor of a simple but computationally intractable acid-base mechanism. Computational predictions vary vastly, and it is not clear that any significant accurate information that was not already apparent from expt. could have been garnered from computations. With care, entropy calcns. are only a minor contributor to the larger computational error, while literature entropy-correction processes lead to absurd free-energy predictions. The computations aid in interpreting observations but fail utterly as a replacement for expt. - 36Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning; Springer Series in Statistics; Springer New York: New York, NY, 2009.
- 37Fukunishi, H.; Watanabe, O.; Takada, S. On the Hamiltonian Replica Exchange Method for Efficient Sampling of Biomolecular Systems: Application to Protein Structure Prediction. J. Chem. Phys. 2002, 116 (20), 9058– 9067, DOI: 10.1063/1.1472510[Crossref], [CAS], Google Scholar37https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD38XjsFKmsLo%253D&md5=7ac571a5afdd63b0b4b29cfdec06f53bOn the Hamiltonian replica exchange method for efficient sampling of biomolecular systems: Application to protein structure predictionFukunishi, Hiroaki; Watanabe, Osamu; Takada, ShojiJournal of Chemical Physics (2002), 116 (20), 9058-9067CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Motivated by the protein structure prediction problem, we develop two variants of the Hamiltonian replica exchange methods (REMs) for efficient configuration sampling, (1) the scaled hydrophobicity REM and (2) the phantom chain REM, and compare their performance with the ordinary REM. We first point out that the ordinary REM has a shortage for the application to large systems such as biomols. and that the Hamiltonian REM, an alternative formulation of the REM, can give a remedy for it. We then propose two examples of the Hamiltonian REM that are suitable for a coarse-grained protein model. (1) The scaled hydrophobicity REM preps. replicas that are characterized by various strengths of hydrophobic interaction. The strongest interaction that mimics aq. soln. environment makes proteins folding, while weakened hydrophobicity unfolds proteins as in org. solvent. Exchange between these environments enables proteins to escape from misfolded traps and accelerate conformational search. This resembles the roles of mol. chaperone that assist proteins to fold in vivo. (2) The phantom chain REM uses replicas that allow various degrees of at. overlaps. By allowing at. overlap in some of replicas, the peptide chain can cross over itself, which can accelerate conformation sampling. Using a coarse-gained model we developed, we compute equil. probability distributions for poly-alanine 16-mer and for a small protein by these REMs and compare the accuracy of the results. We see that the scaled hydrophobicity REM is the most efficient method among the three REMs studied.
- 38Okur, A.; Roe, D. R.; Cui, G.; Hornak, V.; Simmerling, C. Improving Convergence of Replica Exchange Simulations through Coupling to a High-Temperature Structure Reservoir. J. Chem. Theory Comput. 2007, 3 (2), 557– 568, DOI: 10.1021/ct600263e[ACS Full Text
], [CAS], Google Scholar
38https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD2sXjtVeitQ%253D%253D&md5=94557a5a1fc0d6750f988638d3ced8aeImproving Convergence of Replica-Exchange Simulations through Coupling to a High-Temperature Structure ReservoirOkur, Asim; Roe, Daniel R.; Cui, Guanglei; Hornak, Viktor; Simmerling, CarlosJournal of Chemical Theory and Computation (2007), 3 (2), 557-568CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Parallel tempering or replica-exchange mol. dynamics (REMD) significantly increases the efficiency of conformational sampling for complex mol. systems. However, obtaining converged data with REMD remains challenging, esp. for large systems with complex topologies. We propose a new variant to REMD where the replicas are also permitted to exchange with an ensemble of structures that have been generated in advance using high-temp. MD simulations, similar in spirit to J-walking methods. We tested this approach on two non-trivial model systems, a β-hairpin and a 3-stranded β-sheet and compared the results to those obtained from very long ( > 100 ns) std. REMD simulations. The resulting ensembles were indistinguishable, including relative populations of different conformations on the unfolded state. The use of the reservoir is shown to significantly reduce the time required for convergence. - 39Brémond, É.; Golubev, N.; Steinmann, S. N.; Corminboeuf, C. How Important Is Self-Consistency for the dDsC Density Dependent Dispersion Correction?. J. Chem. Phys. 2014, 140, 18A516, DOI: 10.1063/1.4867195
- 40Petraglia, R.; Steinmann, S. N.; Corminboeuf, C. A Fast Charge-Dependent Atom-Pairwise Dispersion Correction for DFTB3. Int. J. Quantum Chem. 2015, 115 (18), 1265– 1272, DOI: 10.1002/qua.24887[Crossref], [CAS], Google Scholar40https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2MXks12gtbo%253D&md5=766623367f245520ce3076f429544665A fast charge-Dependent atom-pairwise dispersion correction for DFTB3Petraglia, Riccardo; Steinmann, Stephan N.; Corminboeuf, ClemenceInternational Journal of Quantum Chemistry (2015), 115 (18), 1265-1272CODEN: IJQCB2; ISSN:0020-7608. (John Wiley & Sons, Inc.)Org. electronic materials remarkably illustrate the importance of the "weak" dispersion interactions that are neglected in the most cost-efficient electronic structure approaches. This work introduces a fast atom-pairwise dispersion correction, dDMC that is compatible with the most recent variant of self-consistent charge d. functional tight binding (SCC-DFTB). The emphasis is placed on improving the description of π-π stacked motifs featuring sulfur-contg. mols. that are known to be esp. challenging for DFTB. Our scheme relies upon the use of Mulliken charges using minimal basis set that are readily available from the DFTB computations at no addnl. cost. The performance and efficiency of the dDMC correction are validated on examples targeting energies, geometries, and mol. dynamic trajectories.
- 41Mashraqui, S. H.; Sangvikar, Y. S.; Meetsma, A. Synthesis and Structures of Thieno[2,3-b]Thiophene Incorporated [3.3]Dithiacyclophanes. Enhanced First Hyperpolarizability in an Unsymmetrically Polarized Cyclophane. Tetrahedron Lett. 2006, 47 (31), 5599– 5602, DOI: 10.1016/j.tetlet.2006.05.098[Crossref], [CAS], Google Scholar41https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD28XmsVyjs7k%253D&md5=db81a2a4f340553bb169377f90323e7bSynthesis and structures of thieno[2,3-b]thiophene incorporated [3.3]dithiacyclophanes. Enhanced first hyperpolarizability in an unsymmetrically polarized cyclophaneMashraqui, Sabir H.; Sangvikar, Yogesh S.; Meetsma, AukeTetrahedron Letters (2006), 47 (31), 5599-5602CODEN: TELEAY; ISSN:0040-4039. (Elsevier B.V.)Dithiacyclophanes incorporating thieno[2,3-b]thiophene have been synthesized, in order to investigate the nonlinear optical properties of donor-acceptor cyclophane I. I displayed significantly higher first hyperpolarizability β (21.6 × 10-30 esu) compared to model II (9.58 × 10-30esu). Relatively higher β in I presumably arises from an extra electron redistribution arising from through-space charge transfer, a feature lacking in II. Moreover, the thermal decompn. temp. of I (300 °C) is higher than that reported for the NLO prototype DANS (295 °C).
- 42Dijkstra, G. D. H.; Kellogg, R. M.; Wynberg, H.; Svendsen, J. S.; Marko, I.; Sharpless, K. B. Conformational Study of Cinchona Alkaloids. A Combined NMR, Molecular Mechanics and x-Ray Approach. J. Am. Chem. Soc. 1989, 111 (21), 8069, DOI: 10.1021/ja00203a001[ACS Full Text
], [CAS], Google Scholar
42https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaL1MXlslCnu7g%253D&md5=700cb5ab06c4760120e47704e698111aConformational study of cinchona alkaloids. A combined NMR, molecular mechanics and x-ray approachDijkstra, Gerard D. H.; Kellogg, Richard M.; Wynberg, Hans; Svendsen, John S.; Marko, Istvan; Sharpless, K. BarryJournal of the American Chemical Society (1989), 111 (21), 8069-76CODEN: JACSAT; ISSN:0002-7863.A conformational study of cinchona alkaloids is presented. The conformations of the alkaloids in soln. have been detd. by NOE difference and NOESY NMR technique. The necessary chem. shift assignments of the cinchona alkaloids have been elucidated by combination of COSY and NOE NMR expts. Min. energy conformations of the alkaloids have been calcd. by mol. mechanics. The first known x-ray structure of a cinchona alkaloid with a closed conformation is presented. Finally, the conformational effects of protonation and complexation on the tertiary nitrogen of the quinuclidine ring have been investigated. - 43Zhou, J.; Wakchaure, V.; Kraft, P.; List, B. Primary-Amine-Catalyzed Enantioselective Intramolecular Aldolizations. Angew. Chem., Int. Ed. 2008, 47 (40), 7656, DOI: 10.1002/anie.200802497[Crossref], [CAS], Google Scholar43https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1cXht1GltL3I&md5=b400840d85d1f3ddc2f7858dd87f5779Primary-amine-catalyzed enantioselective intramolecular aldolizationsZhou, Jian; Wakchaure, Vijay; Kraft, Philip; List, BenjaminAngewandte Chemie, International Edition (2008), 47 (40), 7656-7658CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)Aldol cyclodehydration of 4-substituted-2,6-heptanediones leads to enantiomerically enriched 5-substituted-3-methyl-2-cyclohexene-1-ones, which serve as perfume ingredients and valuable synthetic building blocks. Primary amines derived from cinchona alkaloids in combination with acetic acid are efficient catalysts for this transformation, which deliver both enantiomers of the celery ketone.
- 44Ramakrishnan, R.; Dral, P. O.; Rupp, M.; von Lilienfeld, O. A. Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach. J. Chem. Theory Comput. 2015, 11 (5), 2087– 2096, DOI: 10.1021/acs.jctc.5b00099[ACS Full Text
], [CAS], Google Scholar
44https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2MXmtlams7Y%253D&md5=a59b33f51a9dd6dbad95290f2642c306Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning ApproachRamakrishnan, Raghunathan; Dral, Pavlo O.; Rupp, Matthias; von Lilienfeld, O. AnatoleJournal of Chemical Theory and Computation (2015), 11 (5), 2087-2096CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Chem. accurate and comprehensive studies of the virtual space of all possible mols. are severely limited by the computational cost of quantum chem. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approx. legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron correlation energies are possible, for significantly larger mol. sets than used for training. For thermochem. properties of up to 16k isomers of C7H10O2 we present numerical evidence that chem. accuracy can be reached. We also predict electron correlation energy in post Hartree-Fock methods, at the computational cost of Hartree-Fock, and we establish a qual. relationship between mol. entropy and electron correlation. The transferability of our approach is demonstrated, using semiempirical quantum chem. and machine learning models trained on 1 and 10% of 134k org. mols., to reproduce enthalpies of all remaining mols. at d. functional theory level of accuracy. - 45Fabregat, R. Modular Replica Exchange Simulatior. http://doi.org/10.5281/zenodo.3630553, 2020.Google ScholarThere is no corresponding record for this reference.
- 46Elstner, M.; Hobza, P.; Frauenheim, T.; Suhai, S.; Kaxiras, E. Hydrogen Bonding and Stacking Interactions of Nucleic Acid Base Pairs: A Density-Functional-Theory Based Treatment. J. Chem. Phys. 2001, 114 (12), 5149– 5155, DOI: 10.1063/1.1329889[Crossref], [CAS], Google Scholar46https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD3MXhvV2ktbk%253D&md5=6aa6bf45783f7706d44fa514092a2bccHydrogen bonding and stacking interactions of nucleic acid base pairs: A density-functional-theory based treatmentElstner, Marcus; Hobza, Pavel; Frauenheim, Thomas; Suhai, Sandor; Kaxiras, EfthimiosJournal of Chemical Physics (2001), 114 (12), 5149-5155CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We extend an approx. d. functional theory (DFT) method for the description of long-range dispersive interactions which are normally neglected by construction, irresp. of the correlation function applied. An empirical formula, consisting of an R-6 term is introduced, which is appropriately damped for short distances; the corresponding C6 coeff., which is calcd. from exptl. at. polarizabilities, can be consistently added to the total energy expression of the method. We apply this approx. DFT plus dispersion energy method to describe the hydrogen bonding and stacking interactions of nucleic acid base pairs. Comparison to MP2/6-31G*(0.25) results shows that the method is capable of reproducing hydrogen bonding as well as the vertical and twist dependence of the interaction energy very accurately.
- 47Aradi, B.; Hourahine, B.; Frauenheim, T. DFTB+, a Sparse Matrix-Based Implementation of the DFTB Method. J. Phys. Chem. A 2007, 111 (26), 5678– 5684, DOI: 10.1021/jp070186p[ACS Full Text
], [CAS], Google Scholar
47https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD2sXmsVaju7c%253D&md5=f75a28788c2a83fdb00bb395178548ddDFTB+, a Sparse Matrix-Based Implementation of the DFTB MethodAradi, B.; Hourahine, B.; Frauenheim, Th.Journal of Physical Chemistry A (2007), 111 (26), 5678-5684CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)A new Fortran 95 implementation of the DFTB (d. functional-based tight binding) method was developed, where the sparsity of the DFTB system of equations was exploited. Conventional dense algebra was used only to evaluate the eigenproblems of the system and long-range Coulombic terms, but drop-in O(N) or O(N2) modules were planned to replace the small code sections that these entail. The developed sparse storage structure is discussed in detail, and a short overview of other features of the new code is given. - 48Riplinger, C.; Neese, F. An Efficient and near Linear Scaling Pair Natural Orbital Based Local Coupled Cluster Method. J. Chem. Phys. 2013, 138 (3), 034106, DOI: 10.1063/1.4773581[Crossref], [PubMed], [CAS], Google Scholar48https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3sXpslOqtw%253D%253D&md5=4327115b95524107245acb44ff4aaa7bAn efficient and near linear scaling pair natural orbital based local coupled cluster methodRiplinger, Christoph; Neese, FrankJournal of Chemical Physics (2013), 138 (3), 034106/1-034106/18CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)In previous publications, it was shown that an efficient local coupled cluster method with single- and double excitations can be based on the concept of pair natural orbitals (PNOs) . The resulting local pair natural orbital-coupled-cluster single double (LPNO-CCSD) method has since been proven to be highly reliable and efficient. For large mols., the no. of amplitudes to be detd. is reduced by a factor of 105-106 relative to a canonical CCSD calcn. on the same system with the same basis set. In the original method, the PNOs were expanded in the set of canonical virtual orbitals and single excitations were not truncated. This led to a no. of fifth order scaling steps that eventually rendered the method computationally expensive for large mols. (e.g., >100 atoms). In the present work, these limitations are overcome by a complete redesign of the LPNO-CCSD method. The new method is based on the combination of the concepts of PNOs and projected AOs (PAOs). Thus, each PNO is expanded in a set of PAOs that in turn belong to a given electron pair specific domain. In this way, it is possible to fully exploit locality while maintaining the extremely high compactness of the original LPNO-CCSD wavefunction. No terms are dropped from the CCSD equations and domains are chosen conservatively. The correlation energy loss due to the domains remains below <0.05%, which implies typically 15-20 but occasionally up to 30 atoms per domain on av. The new method has been given the acronym DLPNO-CCSD ("domain based LPNO-CCSD"). The method is nearly linear scaling with respect to system size. The original LPNO-CCSD method had three adjustable truncation thresholds that were chosen conservatively and do not need to be changed for actual applications. In the present treatment, no addnl. truncation parameters have been introduced. Any addnl. truncation is performed on the basis of the three original thresholds. There are no real-space cutoffs. Single excitations are truncated using singles-specific natural orbitals. Pairs are prescreened according to a multipole expansion of a pair correlation energy est. based on local orbital specific virtual orbitals (LOSVs). Like its LPNO-CCSD predecessor, the method is completely of black box character and does not require any user adjustments. It is shown here that DLPNO-CCSD is as accurate as LPNO-CCSD while leading to computational savings exceeding one order of magnitude for larger systems. The largest calcns. reported here featured >8800 basis functions and >450 atoms. In all larger test calcns. done so far, the LPNO-CCSD step took less time than the preceding Hartree-Fock calcn., provided no approxns. have been introduced in the latter. Thus, based on the present development reliable CCSD calcns. on large mols. with unprecedented efficiency and accuracy are realized. (c) 2013 American Institute of Physics.
- 49Neese, F.; Valeev, E. F. Revisiting the Atomic Natural Orbital Approach for Basis Sets: Robust Systematic Basis Sets for Explicitly Correlated and Conventional Correlated ab initio Methods?. J. Chem. Theory Comput. 2011, 7 (1), 33– 43, DOI: 10.1021/ct100396y[ACS Full Text
], [CAS], Google Scholar
49https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXhsFSkurjE&md5=65cebb90f419ae18f35f28de2e1169e7Revisiting the Atomic Natural Orbital Approach for Basis Sets: Robust Systematic Basis Sets for Explicitly Correlated and Conventional Correlated ab initio Methods?Neese, Frank; Valeev, Edward F.Journal of Chemical Theory and Computation (2011), 7 (1), 33-43CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The performance of several families of basis sets for correlated wave function calcns. on mols. is studied. The widely used correlation-consistent basis set family cc-pVXZ (n = D, T, Q, 5) is compared to a systematic series of at. natural orbital basis sets (ano-pVXZ). These basis sets are built from the cc-pV6Z primitives in at. multireference av. coupled pair functional (MR-ACPF) calcns. Segmented basis sets optimized for SCF calcns. (def2-SVP, def2-TZVPP, and def2-QZVPP as well as "pc-n", n = 1, 2, 3) were also tested. Ref. Hartree-Fock energies are detd. with the uncontracted aug-cc-pV6Z basis set for a set of 21 small mols. built from H, B, C, N, O, and F. Ref. coupled cluster CCSD(T) correlation energies were detd. from extrapolation at the cc-pV5Z/cc-pV6Z level. It is found that the ano-pVXZ basis sets outperform the other basis sets. The error in the SCF energies compared to cc-pVXZ basis sets is reduced by about a factor of 3 at each cardinal no. In addn., the ano-pVXZ consistently recovers more correlation energy than their competitors at each cardinal no. The ability of the four families of basis sets to extrapolate SCF and correlation energies to the basis set limit has been investigated. A conclusion by Truhlar is confirmed that the optimum exponent for correlation energy extrapolations at the DZ/TZ level is ∼2.4. All TZ/QZ basis set pairs lead to an optimum exponent close to the expected value of 3. The SCF energy extrapolation proposed by Petersson and co-workers is found to be effective. At the DZ/TZ level, errors in total energies of less than 2 mEh are found for the test set, while at the TZ/QZ level one obtains the total energies within ∼0.3 mEh of the basis set limit. For extrapolation, the "cc" and "ano" bases are found to be similarly successful. Extrapolation results were compared to explicitly correlated calcns. with dedicated basis sets (cc-pVXZ-F12) as well as the ano-pVXZ bases. It is found that the ano-pVXZ + basis sets perform as well as the cc-pVXZ-F12 family (both are of comparable size); addnl. improvement should be possible by reoptimizing the ANO basis sets for explicitly correlated calcns. The error of the extrapolated energies is about 2-3 times smaller than what was found in the explicitly correlated calcns. However, the error in the explicitly correlated calcns. is more systematic, and hence the same conclusion may not hold for the computation of energy differences. - 50Neese, F. The ORCA Program System. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2012, 2 (1), 73– 78, DOI: 10.1002/wcms.81[Crossref], [CAS], Google Scholar50https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC38XhvFGls7s%253D&md5=a753e33a6f9a326553295596f5c754e5The ORCA program systemNeese, FrankWiley Interdisciplinary Reviews: Computational Molecular Science (2012), 2 (1), 73-78CODEN: WIRCAH; ISSN:1759-0884. (Wiley-Blackwell)A review. ORCA is a general-purpose quantum chem. program package that features virtually all modern electronic structure methods (d. functional theory, many-body perturbation and coupled cluster theories, and multireference and semiempirical methods). It is designed with the aim of generality, extendibility, efficiency, and user friendliness. Its main field of application is larger mols., transition metal complexes, and their spectroscopic properties. ORCA uses std. Gaussian basis functions and is fully parallelized. The article provides an overview of its current possibilities and documents its efficiency.
- 51Dunning, T. H. Gaussian Basis Sets for Use in Correlated Molecular Calculations. I. The Atoms Boron through Neon and Hydrogen. J. Chem. Phys. 1989, 90 (2), 1007– 1023, DOI: 10.1063/1.456153[Crossref], [CAS], Google Scholar51https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaL1MXksVGmtrk%253D&md5=c6cd67a3748dc61692a9cb622d2694a0Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogenDunning, Thom H., Jr.Journal of Chemical Physics (1989), 90 (2), 1007-23CODEN: JCPSA6; ISSN:0021-9606.Guided by the calcns. on oxygen in the literature, basis sets for use in correlated at. and mol. calcns. were developed for all of the first row atoms from boron through neon, and for hydrogen. As in the oxygen atom calcns., the incremental energy lowerings, due to the addn. of correlating functions, fall into distinct groups. This leads to the concept of correlation-consistent basis sets, i.e., sets which include all functions in a given group as well as all functions in any higher groups. Correlation-consistent sets are given for all of the atoms considered. The most accurate sets detd. in this way, [5s4p3d2f1g], consistently yield 99% of the correlation energy obtained with the corresponding at.-natural-orbital sets, even though the latter contains 50% more primitive functions and twice as many primitive polarization functions. It is estd. that this set yields 94-97% of the total (HF + 1 + 2) correlation energy for the atoms neon through boron.
- 52Kendall, R. A.; Dunning, T. H.; Harrison, R. J. Electron Affinities of the First-Row Atoms Revisited. Systematic Basis Sets and Wave Functions. J. Chem. Phys. 1992, 96 (9), 6796– 6806, DOI: 10.1063/1.462569[Crossref], [CAS], Google Scholar52https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK38XktFClurw%253D&md5=948a06eee10604a8fa37eae2b2ada4beElectron affinities of the first-row atoms revisited. Systematic basis sets and wave functionsKendall, Rick A.; Dunning, Thom H., Jr.; Harrison, Robert J.Journal of Chemical Physics (1992), 96 (9), 6796-806CODEN: JCPSA6; ISSN:0021-9606.The authors describe a reliable procedure for calcg. the electron affinity of an atom and present results for H, B, C, O, and F (H is included for completeness). This procedure involves the use of the recently proposed correlation-consistent basis sets augmented with functions to describe the more diffuse character of the at. anion coupled with a straightforward, uniform expansion of the ref. space for multireference singles and doubles configuration-interaction (MRSD-CI) calcns. A comparison is given with previous results and with corresponding full CI calcns. The most accurate EAs obtained from the MRSD-CI calcns. are (with exptl. values in parentheses): H 0.740 eV (0.754), B 0.258 (0.277), C 1.245 (1.263), O 1.384 (1.461), and F 3.337 (3.401). The EAs obtained from the MR-SDCI calcns. differ by less than 0.03 eV from those predicted by the full CI calcns.
- 53Adamo, C.; Barone, V. Toward Reliable Density Functional Methods without Adjustable Parameters: The PBE0Model. J. Chem. Phys. 1999, 110 (13), 6158– 6170, DOI: 10.1063/1.478522[Crossref], [CAS], Google Scholar53https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK1MXitVCmt7Y%253D&md5=cad4185c69f9232753497f5203d6dc9fToward reliable density functional methods without adjustable parameters: the PBE0 modelAdamo, Carlo; Barone, VincenzoJournal of Chemical Physics (1999), 110 (13), 6158-6170CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We present an anal. of the performances of a parameter free d. functional model (PBE0) obtained combining the so called PBE generalized gradient functional with a predefined amt. of exact exchange. The results obtained for structural, thermodn., kinetic and spectroscopic (magnetic, IR and electronic) properties are satisfactory and not far from those delivered by the most reliable functionals including heavy parameterization. The way in which the functional is derived and the lack of empirical parameters fitted to specific properties make the PBE0 model a widely applicable method for both quantum chem. and condensed matter physics.
- 54Grimme, S.; Antony, J.; Ehrlich, S.; Krieg, H. A Consistent and Accurate Ab Initio Parametrization of Density Functional Dispersion Correction (DFT-D) for the 94 Elements H-Pu. J. Chem. Phys. 2010, 132 (15), 154104, DOI: 10.1063/1.3382344[Crossref], [PubMed], [CAS], Google Scholar54https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXkvVyks7o%253D&md5=2bca89d904579d5565537a0820dc2ae8A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-PuGrimme, Stefan; Antony, Jens; Ehrlich, Stephan; Krieg, HelgeJournal of Chemical Physics (2010), 132 (15), 154104/1-154104/19CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)The method of dispersion correction as an add-on to std. Kohn-Sham d. functional theory (DFT-D) has been refined regarding higher accuracy, broader range of applicability, and less empiricism. The main new ingredients are atom-pairwise specific dispersion coeffs. and cutoff radii that are both computed from first principles. The coeffs. for new eighth-order dispersion terms are computed using established recursion relations. System (geometry) dependent information is used for the first time in a DFT-D type approach by employing the new concept of fractional coordination nos. (CN). They are used to interpolate between dispersion coeffs. of atoms in different chem. environments. The method only requires adjustment of two global parameters for each d. functional, is asymptotically exact for a gas of weakly interacting neutral atoms, and easily allows the computation of at. forces. Three-body nonadditivity terms are considered. The method has been assessed on std. benchmark sets for inter- and intramol. noncovalent interactions with a particular emphasis on a consistent description of light and heavy element systems. The mean abs. deviations for the S22 benchmark set of noncovalent interactions for 11 std. d. functionals decrease by 15%-40% compared to the previous (already accurate) DFT-D version. Spectacular improvements are found for a tripeptide-folding model and all tested metallic systems. The rectification of the long-range behavior and the use of more accurate C6 coeffs. also lead to a much better description of large (infinite) systems as shown for graphene sheets and the adsorption of benzene on an Ag(111) surface. For graphene it is found that the inclusion of three-body terms substantially (by about 10%) weakens the interlayer binding. We propose the revised DFT-D method as a general tool for the computation of the dispersion energy in mols. and solids of any kind with DFT and related (low-cost) electronic structure methods for large systems. (c) 2010 American Institute of Physics.
- 55Ufimtsev, I. S.; Martinez, T. J. Quantum Chemistry on Graphical Processing Units. 3. Analytical Energy Gradients, Geometry Optimization, and First Principles Molecular Dynamics. J. Chem. Theory Comput. 2009, 5 (10), 2619– 2628, DOI: 10.1021/ct9003004[ACS Full Text
], [CAS], Google Scholar
55https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1MXhtVelurvJ&md5=f4e320ac3479e4b566a6b8bdb9e5add8Quantum Chemistry on Graphical Processing Units. 3. Analytical Energy Gradients, Geometry Optimization, and First Principles Molecular DynamicsUfimtsev, Ivan S.; Martinez, Todd J.Journal of Chemical Theory and Computation (2009), 5 (10), 2619-2628CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)We demonstrate that a video gaming machine contg. two consumer graphical cards can outpace a state-of-the-art quad-core processor workstation by a factor of more than 180× in Hartree-Fock energy + gradient calcns. Such performance makes it possible to run large scale Hartree-Fock and D. Functional Theory calcns., which typically require hundreds of traditional processor cores, on a single workstation. Benchmark Born-Oppenheimer mol. dynamics simulations are performed on two mol. systems using the 3-21G basis set - a hydronium ion solvated by 30 waters (94 atoms, 405 basis functions) and an aspartic acid mol. solvated by 147 waters (457 atoms, 2014 basis functions). Our GPU implementation can perform 27 ps/day and 0.7 ps/day of ab initio mol. dynamics simulation on a single desktop computer for these systems. - 56Titov, A. V.; Ufimtsev, I. S.; Luehr, N.; Martinez, T. J. Generating Efficient Quantum Chemistry Codes for Novel Architectures. J. Chem. Theory Comput. 2013, 9 (1), 213– 221, DOI: 10.1021/ct300321a[ACS Full Text
], [CAS], Google Scholar
56https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC38XhsFygtb7E&md5=8f7e43b6920dea4df1743eef3094401dGenerating Efficient Quantum Chemistry Codes for Novel ArchitecturesTitov, Alexey V.; Ufimtsev, Ivan S.; Luehr, Nathan; Martinez, Todd J.Journal of Chemical Theory and Computation (2013), 9 (1), 213-221CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)We describe an extension of our graphics processing unit (GPU) electronic structure program TeraChem to include atom-centered Gaussian basis sets with d angular momentum functions. This was made possible by a "meta-programming" strategy that leverages computer algebra systems for the derivation of equations and their transformation to correct code. We generate a multitude of code fragments that are formally math. equiv., but differ in their memory and floating-point operation footprints. We then select between different code fragments using empirical testing to find the highest performing code variant. This leads to an optimal balance of floating-point operations and memory bandwidth for a given target architecture without laborious manual tuning. We show that this approach is capable of similar performance compared to our hand-tuned GPU kernels for basis sets with s and p angular momenta. We also demonstrate that mixed precision schemes (using both single and double precision) remain stable and accurate for mols. with d functions. We provide benchmarks of the execution time of entire SCF calcns. using our GPU code and compare to mature CPU based codes, showing the benefits of the GPU architecture for electronic structure theory with appropriately redesigned algorithms. We suggest that the meta-programming and empirical performance optimization approach may be important in future computational chem. applications, esp. in the face of quickly evolving computer architectures. - 57Pearlman, D. A.; Case, D. A.; Caldwell, J. W.; Ross, W. S.; Cheatham, T. E.; DeBolt, S.; Ferguson, D.; Seibel, G.; Kollman, P. AMBER, a Package of Computer Programs for Applying Molecular Mechanics, Normal Mode Analysis, Molecular Dynamics and Free Energy Calculations to Simulate the Structural and Energetic Properties of Molecules. Comput. Phys. Commun. 1995, 91 (1), 1– 41, DOI: 10.1016/0010-4655(95)00041-D[Crossref], [CAS], Google Scholar57https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK2MXps1Wrtrw%253D&md5=8dc71939a46bbc17d5da08782d4e6ec8"AMBER", a package of computer programs for applying molecular mechanics, normal mode analysis, molecular dynamics and free energy calculations to stimulate the structural and energetic properties of moleculesPearlman, David A.; Case, David A.; Caldwell, James W.; Ross, Wilson S.; Cheatham, Thomas E. III; DeBolt, Steve; Ferguson, David; Seibel, George; Kollman, PeterComputer Physics Communications (1995), 91 (1-3), 1-42CODEN: CPHCBZ; ISSN:0010-4655. (Elsevier)We describe the development, current features, and some directions for future development of the AMBER package of computer programs. This package has evolved from a program that was constructed to do Assisted Model Building and Energy Refinement to a group of programs embodying a no. of the powerful tools of modern computational chem.-mol. dynamics and free energy calcns.
- 58Marenich, A. V.; Cramer, C. J.; Truhlar, D. G. Universal Solvation Model Based on Solute Electron Density and on a Continuum Model of the Solvent Defined by the Bulk Dielectric Constant and Atomic Surface Tensions. J. Phys. Chem. B 2009, 113 (18), 6378– 6396, DOI: 10.1021/jp810292n[ACS Full Text
], [CAS], Google Scholar
58https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1MXksV2is74%253D&md5=54931a64c70d28445ee53876a8b1a4b9Universal Solvation Model Based on Solute Electron Density and on a Continuum Model of the Solvent Defined by the Bulk Dielectric Constant and Atomic Surface TensionsMarenich, Aleksandr V.; Cramer, Christopher J.; Truhlar, Donald G.Journal of Physical Chemistry B (2009), 113 (18), 6378-6396CODEN: JPCBFK; ISSN:1520-6106. (American Chemical Society)We present a new continuum solvation model based on the quantum mech. charge d. of a solute mol. interacting with a continuum description of the solvent. The model is called SMD, where the "D" stands for "d." to denote that the full solute electron d. is used without defining partial at. charges. "Continuum" denotes that the solvent is not represented explicitly but rather as a dielec. medium with surface tension at the solute-solvent boundary. SMD is a universal solvation model, where "universal" denotes its applicability to any charged or uncharged solute in any solvent or liq. medium for which a few key descriptors are known (in particular, dielec. const., refractive index, bulk surface tension, and acidity and basicity parameters). The model separates the observable solvation free energy into two main components. The first component is the bulk electrostatic contribution arising from a self-consistent reaction field treatment that involves the soln. of the nonhomogeneous Poisson equation for electrostatics in terms of the integral-equation-formalism polarizable continuum model (IEF-PCM). The cavities for the bulk electrostatic calcn. are defined by superpositions of nuclear-centered spheres. The second component is called the cavity-dispersion-solvent-structure term and is the contribution arising from short-range interactions between the solute and solvent mols. in the first solvation shell. This contribution is a sum of terms that are proportional (with geometry-dependent proportionality consts. called at. surface tensions) to the solvent-accessible surface areas of the individual atoms of the solute. The SMD model has been parametrized with a training set of 2821 solvation data including 112 aq. ionic solvation free energies, 220 solvation free energies for 166 ions in acetonitrile, methanol, and DMSO, 2346 solvation free energies for 318 neutral solutes in 91 solvents (90 nonaq. org. solvents and water), and 143 transfer free energies for 93 neutral solutes between water and 15 org. solvents. The elements present in the solutes are H, C, N, O, F, Si, P, S, Cl, and Br. The SMD model employs a single set of parameters (intrinsic at. Coulomb radii and at. surface tension coeffs.) optimized over six electronic structure methods: M05-2X/MIDI!6D, M05-2X/6-31G*, M05-2X/6-31+G**, M05-2X/cc-pVTZ, B3LYP/6-31G*, and HF/6-31G*. Although the SMD model has been parametrized using the IEF-PCM protocol for bulk electrostatics, it may also be employed with other algorithms for solving the nonhomogeneous Poisson equation for continuum solvation calcns. in which the solute is represented by its electron d. in real space. This includes, for example, the conductor-like screening algorithm. With the 6-31G* basis set, the SMD model achieves mean unsigned errors of 0.6-1.0 kcal/mol in the solvation free energies of tested neutrals and mean unsigned errors of 4 kcal/mol on av. for ions with either Gaussian03 or GAMESS. - 59Huang, B.; von Lilienfeld, O. A. The “DNA” of Chemistry: Scalable Quantum Machine Learning with “Amons”. 2017 arXiv:1707.04146. https://arxiv.org/abs/1707.04146 (accessed March 23, 2020)Google ScholarThere is no corresponding record for this reference.
- 60Christensen Faber, H. A Python Toolkit for Quantum Machine Learning. http://doi.org/10.5281/zenodo.817332 2017.Google ScholarThere is no corresponding record for this reference.
- 61Rupp, M.; Tkatchenko, A.; Müller, K. R.; Lilienfeld, V.; Anatole, O. Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning. Phys. Rev. Lett. 2012, 108 (5), 058301, DOI: 10.1103/PhysRevLett.108.058301[Crossref], [PubMed], [CAS], Google Scholar61https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC38XisFant70%253D&md5=059b8ed186638101842c8df69173679fFast and accurate modeling of molecular atomization energies with machine learningRupp, Matthias; Tkatchenko, Alexandre; Mueller, Klaus-Robert; von Lilienfeld, O. AnatolePhysical Review Letters (2012), 108 (5), 058301/1-058301/5CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)The authors introduce a machine learning model to predict atomization energies of a diverse set of org. mols., based on nuclear charges and at. positions only. The problem of solving the mol. Schrodinger equation is mapped onto a nonlinear statistical regression problem of reduced complexity. Regression models are trained on and compared to atomization energies computed with hybrid d.-functional theory. Cross validation over more than seven thousand org. mols. yields a mean abs. error of ∼10 kcal/mol. Applicability is demonstrated for the prediction of mol. atomization potential energy curves.
- 62Hansen, K.; Biegler, F.; Ramakrishnan, R.; Pronobis, W.; Von Lilienfeld, O. A.; Müller, K. R.; Tkatchenko, A. Machine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical Space. J. Phys. Chem. Lett. 2015, 6 (12), 2326– 2331, DOI: 10.1021/acs.jpclett.5b00831[ACS Full Text
], [CAS], Google Scholar
62https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2MXpsFOmsrg%253D&md5=402cfd080b9ab0aae17932dffda375cfMachine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical SpaceHansen, Katja; Biegler, Franziska; Ramakrishnan, Raghunathan; Pronobis, Wiktor; von Lilienfeld, O. Anatole; Mueller, Klaus-Robert; Tkatchenko, AlexandreJournal of Physical Chemistry Letters (2015), 6 (12), 2326-2331CODEN: JPCLCD; ISSN:1948-7185. (American Chemical Society)Simultaneously accurate and efficient prediction of mol. properties throughout chem. compd. space is a crit. ingredient toward rational compd. design in chem. and pharmaceutical industries. Aiming toward this goal, we develop and apply a systematic hierarchy of efficient empirical methods to est. atomization and total energies of mols. These methods range from a simple sum over atoms, to addn. of bond energies, to pairwise interat. force fields, reaching to the more sophisticated machine learning approaches that are capable of describing collective interactions between many atoms or bonds. In the case of equil. mol. geometries, even simple pairwise force fields demonstrate prediction accuracy comparable to benchmark energies calcd. using d. functional theory with hybrid exchange-correlation functionals; however, accounting for the collective many-body interactions proves to be essential for approaching the "holy grail" of chem. accuracy of 1 kcal/mol for both equil. and out-of-equil. geometries. This remarkable accuracy is achieved by a vectorized representation of mols. (so-called Bag of Bonds model) that exhibits strong nonlocality in chem. space. In addn., the same representation allows us to predict accurate electronic properties of mols., such as their polarizability and mol. frontier orbital energies. - 63Nelder, J. A.; Mead, R. A Simplex Method for Function Minimization. Comp. J. 1965, 7 (4), 308– 313, DOI: 10.1093/comjnl/7.4.308
- 64Fukunishi, H.; Watanabe, O.; Takada, S. On the Hamiltonian Replica Exchange Method for Efficient Sampling of Biomolecular Systems: Application to Protein Structure Prediction. J. Chem. Phys. 2002, 116 (20), 9058– 9067, DOI: 10.1063/1.1472510[Crossref], [CAS], Google Scholar64https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD38XjsFKmsLo%253D&md5=7ac571a5afdd63b0b4b29cfdec06f53bOn the Hamiltonian replica exchange method for efficient sampling of biomolecular systems: Application to protein structure predictionFukunishi, Hiroaki; Watanabe, Osamu; Takada, ShojiJournal of Chemical Physics (2002), 116 (20), 9058-9067CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Motivated by the protein structure prediction problem, we develop two variants of the Hamiltonian replica exchange methods (REMs) for efficient configuration sampling, (1) the scaled hydrophobicity REM and (2) the phantom chain REM, and compare their performance with the ordinary REM. We first point out that the ordinary REM has a shortage for the application to large systems such as biomols. and that the Hamiltonian REM, an alternative formulation of the REM, can give a remedy for it. We then propose two examples of the Hamiltonian REM that are suitable for a coarse-grained protein model. (1) The scaled hydrophobicity REM preps. replicas that are characterized by various strengths of hydrophobic interaction. The strongest interaction that mimics aq. soln. environment makes proteins folding, while weakened hydrophobicity unfolds proteins as in org. solvent. Exchange between these environments enables proteins to escape from misfolded traps and accelerate conformational search. This resembles the roles of mol. chaperone that assist proteins to fold in vivo. (2) The phantom chain REM uses replicas that allow various degrees of at. overlaps. By allowing at. overlap in some of replicas, the peptide chain can cross over itself, which can accelerate conformation sampling. Using a coarse-gained model we developed, we compute equil. probability distributions for poly-alanine 16-mer and for a small protein by these REMs and compare the accuracy of the results. We see that the scaled hydrophobicity REM is the most efficient method among the three REMs studied.
- 65Swendsen, R. H.; Wang, J.-S. Replica Monte Carlo Simulation of Spin-Glasses. Phys. Rev. Lett. 1986, 57 (21), 2607, DOI: 10.1103/PhysRevLett.57.2607[Crossref], [PubMed], [CAS], Google Scholar65https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BC2sfotFaisQ%253D%253D&md5=6f748450b1063d67006b4e0ff4273fddReplica Monte Carlo simulation of spin glassesSwendsen; WangPhysical review letters (1986), 57 (21), 2607-2609 ISSN:.There is no expanded citation for this reference.
- 66Affentranger, R.; Tavernelli, I.; Di Iorio, E. E. A Novel Hamiltonian Replica Exchange MD Protocol to Enhance Protein Conformational Space Sampling. J. Chem. Theory Comput. 2006, 2 (2), 217– 228, DOI: 10.1021/ct050250b[ACS Full Text
], [CAS], Google Scholar
66https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD28XjsFCntw%253D%253D&md5=973b0c5a6cc780bf5edf4cf16c0d63f3A Novel Hamiltonian Replica Exchange MD Protocol to Enhance Protein Conformational Space SamplingAffentranger, Roman; Tavernelli, Ivano; Di Iorio, Ernesto E.Journal of Chemical Theory and Computation (2006), 2 (2), 217-228CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Limited searching in the conformational space is one of the major obstacles for investigating protein dynamics by numerical approaches. For this reason, classical all-atom mol. dynamics (MD) simulations of proteins tend to be confined to local energy min., particularly when the bulk solvent is treated explicitly. To overcome this problem, the authors have developed a novel replica exchange protocol that uses modified force-field parameters to treat interparticle nonbonded potentials within the protein and between protein and solvent atoms, leaving unperturbed those relative to solvent-solvent interactions. The authors have tested the new protocol on the 18-residue-long tip of the P domain of calreticulin in an explicit solvent. With only eight replicas, the authors have been able to considerably enhance the conformational space sampled during a 100 ns simulation, compared to as many parallel classical mol. dynamics simulations of the same length or to a single one lasting 450 ns. A direct comparison between the various simulations has been possible thanks to the implementation of the weighted histogram anal. method, by which conformations simulated with modified force-field parameters can be assigned different wts. Interatom, inter-residue distances in the structural ensembles obtained with the authors' novel replica exchange approach and by classical MD simulations compare equally well with those derived from NMR data. Rare events, such as unfolding and refolding, occur with reasonable statistical frequency. Visiting of conformations characterized by very small Boltzmann wts. is also possible. Despite their low probability, such regions of the conformational space may play an important role in the search for local potential-energy min. and in dynamically controlled functions. - 67Wang, L.; Friesner, R. A.; Berne, B. J. Replica Exchange with Solute Scaling: A More Efficient Version of Replica Exchange with Solute Tempering (REST2). J. Phys. Chem. B 2011, 115 (30), 9431– 9438, DOI: 10.1021/jp204407d[ACS Full Text
], [CAS], Google Scholar
67https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXosFymurs%253D&md5=8acb0d670bcb70eacfc30e32d5dfb6ddReplica Exchange with Solute Scaling: A More Efficient Version of Replica Exchange with Solute Tempering (REST2)Wang, Lingle; Friesner, Richard A.; Berne, B. J.Journal of Physical Chemistry B (2011), 115 (30), 9431-9438CODEN: JPCBFK; ISSN:1520-5207. (American Chemical Society)A small change in the Hamiltonian scaling in Replica Exchange with Solute Tempering (REST) is found to improve its sampling efficiency greatly, esp. for the sampling of aq. protein solns. in which there are large-scale solute conformation changes. Like the original REST (REST1), the new version (which the authors call REST2) also bypasses the poor scaling with system size of the std. Temp. Replica Exchange Method (TREM), reducing the no. of replicas (parallel processes) from what must be used in TREM. This redn. is accomplished by deforming the Hamiltonian function for each replica in such a way that the acceptance probability for the exchange of replica configurations does not depend on the no. of explicit water mols. in the system. For proof of concept, REST2 is compared with TREM and with REST1 for the folding of the trpcage and β-hairpin in water. The comparisons confirm that REST2 greatly reduces the no. of CPUs required by regular replica exchange and greatly increases the sampling efficiency over REST1. This method reduces the CPU time required for calcg. thermodn. avs. and for the ab initio folding of proteins in explicit water. - 68Ceriotti, M.; More, J.; Manolopoulos, D. E. I-PI: A Python Interface for Ab Initio Path Integral Molecular Dynamics Simulations. Comput. Phys. Commun. 2014, 185 (3), 1019– 1026, DOI: 10.1016/j.cpc.2013.10.027[Crossref], [CAS], Google Scholar68https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3sXhvVSltL3I&md5=a7a21c0d8665ac7d2a4a01b0cc280057i-PI: A Python interface for ab initio path integral molecular dynamics simulationsCeriotti, Michele; More, Joshua; Manolopoulos, David E.Computer Physics Communications (2014), 185 (3), 1019-1026CODEN: CPHCBZ; ISSN:0010-4655. (Elsevier B.V.)Recent developments in path integral methodol. have significantly reduced the computational expense of including quantum mech. effects in the nuclear motion in ab initio mol. dynamics simulations. However, the implementation of these developments requires a considerable programming effort, which has hindered their adoption. Here we describe i-PI, an interface written in Python that has been designed to minimise the effort required to bring state-of-the-art path integral techniques to an electronic structure program. While it is best suited to first principles calcns. and path integral mol. dynamics, i-PI can also be used to perform classical mol. dynamics simulations, and can just as easily be interfaced with an empirical forcefield code. To give just one example of the many potential applications of the interface, we use it in conjunction with the CP2K electronic structure package to showcase the importance of nuclear quantum effects in high-pressure water.
- 69Kapil, V.; Rossi, M.; Marsalek, O.; Petraglia, R.; Litman, Y.; Spura, T.; Cheng, B.; Cuzzocrea, A.; Meißner, R. H.; Wilkins, D. M.; Juda, P.; Bienvenue, S. P.; Fang, W.; Kessler, J.; Poltavsky, I.; Vandenbrande, S.; Wieme, J.; Corminboeuf, C.; Kühne, T. D.; Manolopoulos, D. E.; Markland, T. E.; Richardson, J. O.; Tkatchenko, A.; Tribello, G. A.; Van Speybroeck, V. V.; Ceriotti, M. i-PI 2.0: A universal force engine for advanced molecular simulations. Comput. Phys. Commun. 2019, 236, 214– 223, DOI: 10.1016/j.cpc.2018.09.020[Crossref], [CAS], Google Scholar69https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXhvFWqtL3I&md5=02da73b3d98059a9a849ca4dc8266d99i-PI 2.0: A universal force engine for advanced molecular simulationsKapil, Venkat; Rossi, Mariana; Marsalek, Ondrej; Petraglia, Riccardo; Litman, Yair; Spura, Thomas; Cheng, Bingqing; Cuzzocrea, Alice; Meissner, Robert H.; Wilkins, David M.; Helfrecht, Benjamin A.; Juda, Przemyslaw; Bienvenue, Sebastien P.; Fang, Wei; Kessler, Jan; Poltavsky, Igor; Vandenbrande, Steven; Wieme, Jelle; Corminboeuf, Clemence; Kuhne, Thomas D.; Manolopoulos, David E.; Markland, Thomas E.; Richardson, Jeremy O.; Tkatchenko, Alexandre; Tribello, Gareth A.; Van Speybroeck, Veronique; Ceriotti, MicheleComputer Physics Communications (2019), 236 (), 214-223CODEN: CPHCBZ; ISSN:0010-4655. (Elsevier B.V.)Progress in the at.-scale modeling of matter over the past decade has been tremendous. This progress has been brought about by improvements in methods for evaluating interat. forces that work by either solving the electronic structure problem explicitly, or by computing accurate approxns. of the soln. and by the development of techniques that use the Born-Oppenheimer (BO) forces to move the atoms on the BO potential energy surface. As a consequence of these developments it is now possible to identify stable or metastable states, to sample configurations consistent with the appropriate thermodn. ensemble, and to est. the kinetics of reactions and phase transitions. All too often, however, progress is slowed down by the bottleneck assocd. with implementing new optimization algorithms and/or sampling techniques into the many existing electronic-structure and empirical-potential codes. To address this problem, we are thus releasing a new version of the i-PI software. This piece of software is an easily extensible framework for implementing advanced atomistic simulation techniques using interat. potentials and forces calcd. by an external driver code. While the original version of the code (Ceriotti et al., 2014) was developed with a focus on path integral mol. dynamics techniques, this second release of i-PI not only includes several new advanced path integral methods, but also offers other classes of algorithms. In other words, i-PI is moving towards becoming a universal force engine that is both modular and tightly coupled to the driver codes that evaluate the potential energy surface and its derivs.
- 70Frenkel, D.; Smit, B. Understanding Molecular Simulation: From Algorithms to Applications, 2nd ed.; Elsevier: London, UK, 2001.Google ScholarThere is no corresponding record for this reference.
- 71Kunsch, H. R. The Jackknife and the Bootstrap for General Stationary Observations. Ann. Stat. 1989, 17 (3), 1217– 1241, DOI: 10.1214/aos/1176347265
- 72Rathore, N.; Chopra, M.; de Pablo, J. J. Optimal Allocation of Replicas in Parallel Tempering Simulations. J. Chem. Phys. 2005, 122 (2), 024111, DOI: 10.1063/1.1831273[Crossref], [PubMed], [CAS], Google Scholar72https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD2MXislahug%253D%253D&md5=dea2c2021aaebda0c54b603858650bd4Optimal allocation of replicas in parallel tempering simulationsRathore, Nitin; Chopra, Manan; de Pablo, Juan J.Journal of Chemical Physics (2005), 122 (2), 024111/1-024111/8CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We have studied the efficiency of parallel tempering simulations for a variety of systems including a coarse-grained protein, an atomistic model polypeptide, and the Lennard-Jones fluid. A scheme is proposed for the optimal allocation of temps. in these simulations. The method is compared to the existing empirical approaches used for this purpose. Accuracy assocd. with the computed thermodn. quantities such as sp. heat is also computed and their dependence on the trial-exchange acceptance rate is reported.
- 73Jones, E.; Pearson, P.; Travis, O. SciPy: Open Source Scientific Tools for Python. http://www.scipy.org, 2001.Google ScholarThere is no corresponding record for this reference.
- 74Vannay, L.; Meyer, B.; Petraglia, R.; Sforazzini, G.; Ceriotti, M.; Corminboeuf, C. Analyzing Fluxional Molecules Using DORI. J. Chem. Theory Comput. 2018, 14 (5), 2370– 2379, DOI: 10.1021/acs.jctc.7b01176[ACS Full Text
], [CAS], Google Scholar
74https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXlvVKlsLw%253D&md5=3fd2d742bc7bce9cf3ef9eefbd58b4dcAnalyzing Fluxional Molecules Using DORIVannay, Laurent; Meyer, Benjamin; Petraglia, Riccardo; Sforazzini, Giuseppe; Ceriotti, Michele; Corminboeuf, ClemenceJournal of Chemical Theory and Computation (2018), 14 (5), 2370-2379CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The D. Overlap Region Indicator (DORI) is a d.-based scalar field that reveals covalent bonding patterns and non-covalent interactions in the same value range. This work goes beyond the traditional static quantum chem. use of scalar fields and illustrates the suitability of DORI for analyzing geometrical and electronic signatures in highly fluxional mol. systems. Examples include a dithiocyclophane, which possesses multiple local min. with differing extents of π-stacking interactions and a temp. dependent rotation of a mol. rotor, where the descriptor is employed to capture fingerprints of CH-π and π-π interactions. Finally, DORI serves to examine the fluctuating π-conjugation pathway of a photochromic torsional switch (PTS). Attention is also placed on post-processing the large amt. of generated data and juxtaposing DORI with a data-driven low-dimensional representation of the structural landscape. - 75Kapil, V.; Engel, E.; Rossi, M.; Ceriotti, M. Assessment of Approximate Methods for Anharmonic Free Energies. J. Chem. Theory Comput. 2019, 15 (11), 5845– 5857, DOI: 10.1021/acs.jctc.9b00596[ACS Full Text
], [CAS], Google Scholar
75https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXhsl2htL3N&md5=57a4331cecec93c76be026ec5b04b7d9Assessment of Approximate Methods for Anharmonic Free EnergiesKapil, Venkat; Engel, Edgar; Rossi, Mariana; Ceriotti, MicheleJournal of Chemical Theory and Computation (2019), 15 (11), 5845-5857CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Quant. evaluations of the thermodn. properties of materials - most notably their stability, as measured by the free energy - must take into account the role of thermal and zero-point energy fluctuations. While these effects can easily be estd. within a harmonic approxn., corrections arising from the anharmonic nature of the interat. potential are often crucial and require computationally costly path integral simulations. Consequently, different approx. frameworks for computing affordable ests. of the anharmonic free energies have been developed over the years. Understanding which of the approxns. involved are justified for a given system, and therefore choosing the most suitable method, is complicated by the lack of comparative benchmarks. To facilitate this choice we assess the accuracy and efficiency of some of the most commonly used approx. methods - the independent mode framework, the vibrational SCF and self-consistent phonons - by comparing the anharmonic correction to the Helmholtz free energy against ref. path integral calcns. These benchmarks are performed for a diverse set of systems, ranging from simple quasi-harmonic solids to flexible mol. crystals with freely-rotating units. Our results suggest that for simple solids such as allotropes of carbon these methods yield results that are in excellent agreement with the ref. calcns., at a considerably lower computational cost. For more complex mol. systems such as polymorphs of ice and paracetamol the methods do not consistently provide a reliable approxn. of the anharmonic correction. Despite substantial cancellation of errors when comparing the stability of different phases, we do not observe a systematic improvement over the harmonic approxn. even for relative free-energies. Our results suggest that efforts towards obtaining computationally-feasible anharmonic free-energies for flexible mol. solids should therefore be directed towards reducing the expense of path integral methods. - 76Bürgi, T.; Baiker, A. Conformational Behavior of Cinchonidine in Different Solvents: A Combined NMR and Ab Initio Investigation. J. Am. Chem. Soc. 1998, 120 (49), 12920– 12926, DOI: 10.1021/ja982466b[ACS Full Text
], [CAS], Google Scholar
76https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK1cXnsFKmu78%253D&md5=6cac8e8ea2a3adf51ac14dcf7e63b3bdConformational Behavior of Cinchonidine in Different Solvents: A Combined NMR and ab Initio InvestigationBuergi, Thomas; Baiker, AlfonsJournal of the American Chemical Society (1998), 120 (49), 12920-12926CODEN: JACSAT; ISSN:0002-7863. (American Chemical Society)The conformation of cinchonidine in soln. has been investigated by NMR techniques as well as theor. Three conformers of cinchonidine are shown to be substantially populated at room temp., Closed(1), Closed(2), and Open(3), with the latter being the most stable in apolar solvents. The stability of the closed conformers relative to that of Open(3), however, increases with solvent polarity. In polar solvents the three conformers have similar energy. The relationship between relative energies and the dielec. const. of the solvent is not linear but resembles the form of an Onsager function. Ab initio and d. functional reaction field calcns. using cavity shapes detd. by an isodensity surface are in good agreement with expt. for solvents which do not show strong specific interaction with cinchonidine. The role of the conformational behavior of cinchonidine is illustrated using the example of the platinum-catalyzed enantioselective hydrogenation of ketopantolactone in different solvents. - 77Huang, M.; Dissanayake, T.; Kuechler, E.; Radak, B. K.; Lee, T. S.; Giese, T. J.; York, D. M. A Multidimensional B-Spline Correction for Accurate Modeling Sugar Puckering in QM/MM Simulations. J. Chem. Theory Comput. 2017, 13 (9), 3975– 3984, DOI: 10.1021/acs.jctc.7b00161[ACS Full Text
], [CAS], Google Scholar
77https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXht1Oht7%252FF&md5=b978922b7b7dbb942a0f5d3c90a9bfdeA Multidimensional B-Spline Correction for Accurate Modeling Sugar Puckering in QM/MM SimulationsHuang, Ming; Dissanayake, Thakshila; Kuechler, Erich; Radak, Brian K.; Lee, Tai-Sung; Giese, Timothy J.; York, Darrin M.Journal of Chemical Theory and Computation (2017), 13 (9), 3975-3984CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The computational efficiency of approx. quantum mech. methods allows their use for the construction of multidimensional reaction free energy profiles. It has recently been demonstrated that quantum models based on the NDDO approxn. have difficulty modeling deoxyribose and ribose sugar ring puckers and thus limit their predictive value in the study of RNA and DNA systems. A method has been introduced in our previous work to improve the description of the sugar puckering conformational landscape that uses a multidimensional B-spline correction map (BMAP correction) for systems involving intrinsically coupled torsion angles. This method greatly improved the adiabatic potential energy surface profiles of DNA and RNA sugar rings relative to high-level ab initio methods even for highly problematic NDDO-based models. In the present work, a BMAP correction is developed, implemented, and tested in mol. dynamics simulations using the AM1/d-PhoT semiempirical Hamiltonian for biol. phosphoryl transfer reactions. Results are presented for gas-phase adiabatic potential energy surfaces of RNA transesterification model reactions and condensed-phase QM/MM free energy surfaces for nonenzymic and RNase A-catalyzed transesterification reactions. The results show that the BMAP correction is stable, efficient, and leads to improvement in both the potential energy and free energy profiles for the reactions studied, as compared with ab initio and exptl. ref. data. Exploration of the effect of the size of the quantum mech. region indicates the best agreement with exptl. reaction barriers occurs when the full CpA dinucleotide substrate is treated quantum mech. with the sugar pucker correction.
Cited By
This article is cited by 10 publications.
- Shawn C.C. Hsueh, Adekunle Aina, Steven S. Plotkin. Ensemble Generation for Linear and Cyclic Peptides Using a Reservoir Replica Exchange Molecular Dynamics Implementation in GROMACS. The Journal of Physical Chemistry B 2022, 126 (49) , 10384-10399. https://doi.org/10.1021/acs.jpcb.2c05470
- Simone Gallarati, Raimon Fabregat, Veronika Juraskova, Theo Jaffrelot Inizan, Clemence Corminboeuf. How Robust Is the Reversible Steric Shielding Strategy for Photoswitchable Organocatalysts?. The Journal of Organic Chemistry 2022, 87 (14) , 8849-8857. https://doi.org/10.1021/acs.joc.1c02991
- Raimon Fabregat, Alberto Fabrizio, Edgar A. Engel, Benjamin Meyer, Veronika Juraskova, Michele Ceriotti, Clemence Corminboeuf. Local Kernel Regression and Neural Network Approaches to the Conformational Landscapes of Oligopeptides. Journal of Chemical Theory and Computation 2022, 18 (3) , 1467-1479. https://doi.org/10.1021/acs.jctc.1c00813
- J. Terence Blaskovits, Kun-Han Lin, Raimon Fabregat, Iwona Swiderska, Hélène Wu, Clémence Corminboeuf. Is a Single Conformer Sufficient to Describe the Reorganization Energy of Amorphous Organic Transport Materials?. The Journal of Physical Chemistry C 2021, 125 (31) , 17355-17362. https://doi.org/10.1021/acs.jpcc.1c04067
- Koushik Kasavajhala, Kenneth Lam, Carlos Simmerling. Exploring Protocols to Build Reservoirs to Accelerate Temperature Replica Exchange MD Simulations. Journal of Chemical Theory and Computation 2020, 16 (12) , 7776-7799. https://doi.org/10.1021/acs.jctc.0c00513
- Wook Shin, Zhongyue J. Yang. Computational Strategies for Entropy Modeling in Chemical Processes. Chemistry – An Asian Journal 2023, 18 (9) https://doi.org/10.1002/asia.202300117
- David Kuntz, Angela K. Wilson. Machine learning, artificial intelligence, and chemistry: How smart algorithms are reshaping simulation and the laboratory. Pure and Applied Chemistry 2022, 94 (8) , 1019-1054. https://doi.org/10.1515/pac-2022-0202
- Veronika Jurásková, Frederic Célerse, Ruben Laplaza, Clemence Corminboeuf. Assessing the persistence of chalcogen bonds in solution with neural network potentials. The Journal of Chemical Physics 2022, 156 (15) https://doi.org/10.1063/5.0085153
- W. S. Jesus, F. V. Prudente, J. M. C. Marques, F. B. Pereira. Modeling microsolvation clusters with electronic-structure calculations guided by analytical potentials and predictive machine learning techniques. Physical Chemistry Chemical Physics 2021, 23 (2) , 1738-1749. https://doi.org/10.1039/D0CP05200K
- Alberto Fabrizio, Ksenia R. Briling, David D. Girardier, Clemence Corminboeuf. Learning on-top: Regressing the on-top pair density for real-space visualization of electron correlation. The Journal of Chemical Physics 2020, 153 (20) https://doi.org/10.1063/5.0033326
Abstract
Figure 1
Figure 1. (a) Dithiacyclophane and the collective variables used to characterize its global structure: the distance between the center of masses of each cyclic bulk and the angles between the average planes going through them. (b) Cinchona alkaloid organocatalyst and the two dihedral angles used to characterize its global structure.
Figure 2
Figure 2. Mind-map and workflow illustrating the proposed methodology.
Figure 3
Figure 3. Schematic depiction of Hamiltonian reservoir Replica Exchange.
Figure 4
Figure 4. (a) Free energy landscape (DFTB-SK/3OB level) of dithiacyclophane at 300 K (T-RE) projected on the 2D space generated by the collective variables visible in Figure 1a. (b) Projection of the data set made of 1500 dithiacyclophane structures extracted with farthest point sampling from the 300 K canonical ensemble of 40 000 structures and color coded on the basis of the single-point energy difference ΔE = ((DFTB-SK/3OB) – (DLPNO-CCSD(T)/CBS)). The continuous background is plotted using a Gaussian interpolation of the mean energy difference. The smooth histograms were constructed with a Gaussian Kernel Density Estimator (Gaussian KDE) using the SciPy (73) python library.
Figure 5
Figure 5. (a) Free energy landscape (DFTB-SK/3OB level) of the cinchona alkaloid organocatalyst at 300 K projected on the 2D space generated by the collective variables visible in Figure 1b. Constructed with canonical structures generated with T-RE simulations with DFTB-SK as potential energy. (b) Projection of the 1800 data set structures obtained with FPS from a canonical ensemble of 32 000 structures at 300 K canonical ensemble and color coded on the basis of the single point energy difference ΔE = ((DFTB-SK/3OB) – (DLPNO-CCSD(T)/CBS)). (c) Structures representing each of the four conformational regions (i.e., basins).
Figure 6
Figure 6. Comparison between the DFTB-SK electronic energy and the ML-DLPNO-CCSD(T)/CBS predictions (i.e., DFTB-SK + ΔML correction) for the 40 000 structures in the reservoir.
Figure 7
Figure 7. Free energy landscapes at 300 K generated with the potential: (a) DFTB-SK; (b) ML-DLPNO-CCSD(T)/CBS; (c) ML-[PBE0-D3/(6-31G)(SMD Chloroform)]; (d) PBE0-D3/(6-31G); (e) ML-PBE0-D3/(6-31G). (f) Relative free energies by integration within the local minima. (24) The free energies are all given relative to the Disarticulated state except for the solvated system, where the open state is used as a reference. The striped columns correspond to the static relative free energy using the harmonic approximation (for the solvated system the harmonic free energies were computed with the true potential, and not with the machine learning version). All the free energies maps come from resH-RE expect for the direct PBE0, which uses T-RE, as described in the methods section.
Figure 8
Figure 8. Free energy landscapes at 300 K generated with the potential: (a) DFTB-SK; (b) ML-DLPNO-CCSD(T)/CBS; (c) ML-[PBE0-D3/(6-31G)(SMD Chloroform)]. (d) Free energies upon integration within the free energy basins. The free energies are all given relative to the Disarticulated state. The stripped columns are the free energy predictions of the basins using the static free energies using the harmonic correction.
References
ARTICLE SECTIONSThis article references 77 other publications.
- 1Behler, J.; Parrinello, M. Generalized Neural-Network Representation of High-Dimensional Potential-Energy Surfaces. Phys. Rev. Lett. 2007, 98 (14), 146401, DOI: 10.1103/PhysRevLett.98.146401[Crossref], [PubMed], [CAS], Google Scholar1https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD2sXjvF2ls7w%253D&md5=579a6cbf503565205acbb86ade0ae86bGeneralized Neural-Network Representation of High-Dimensional Potential-Energy SurfacesBehler, Jorg; Parrinello, MichelePhysical Review Letters (2007), 98 (14), 146401/1-146401/4CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)The accurate description of chem. processes often requires the use of computationally demanding methods like d.-functional theory (DFT), making long simulations of large systems unfeasible. In this Letter we introduce a new kind of neural-network representation of DFT potential-energy surfaces, which provides the energy and forces as a function of all at. positions in systems of arbitrary size and is several orders of magnitude faster than DFT. The high accuracy of the method is demonstrated for bulk silicon and compared with empirical potentials and DFT. The method is general and can be applied to all types of periodic and nonperiodic systems.
- 2Behler, J.; Marto crnnák, R.; Donadio, D.; Parrinello, M. Metadynamics Simulations of the High-Pressure Phases of Silicon Employing a High-Dimensional Neural Network Potential. Phys. Rev. Lett. 2008, 100 (18), 185501, DOI: 10.1103/PhysRevLett.100.185501[Crossref], [PubMed], [CAS], Google Scholar2https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1cXlslKnsro%253D&md5=e003c971c671233d15bbc3679dace5bbMetadynamics simulations of the high-pressure phases of silicon employing a high-dimensional neural network potentialBehler, Joerg; Martonak, Roman; Donadio, Davide; Parrinello, MichelePhysical Review Letters (2008), 100 (18), 185501/1-185501/4CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)We study in a systematic way the complex sequence of the high-pressure phases of silicon obtained upon compression by combining an accurate high-dimensional neural network representation of the d.-functional theory potential energy surface with the metadynamics scheme. Starting from the thermodynamically stable diamond structure at ambient conditions we are able to identify all structural phase transitions up to the highest-pressure fcc phase at about 100 GPa. The results are in excellent agreement with expt. The method developed promises to be of great value in the study of inorg. solids, including those having metallic phases.
- 3Khaliullin, R. Z.; Eshet, H.; Kühne, T. D.; Behler, J.; Parrinello, M. Graphite-Diamond Phase Coexistence Study Employing a Neural-Network Mapping of the Ab Initio Potential Energy Surface. Phys. Rev. B: Condens. Matter Mater. Phys. 2010, 81 (10), 100103, DOI: 10.1103/PhysRevB.81.100103[Crossref], [CAS], Google Scholar3https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXkt1Ojtb4%253D&md5=ea08109866c0f25f4950b2263fd68b64Graphite-diamond phase coexistence study employing a neural-network mapping of the ab initio potential energy surfaceKhaliullin, Rustam Z.; Eshet, Hagai; Kuhne, Thomas D.; Behler, Jorg; Parrinello, MichelePhysical Review B: Condensed Matter and Materials Physics (2010), 81 (10), 100103/1-100103/4CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)An interat. potential for the diamond and graphite phases of carbon has been created using a neural-network (NN) representation of the ab initio potential energy surface. The NN potential combines the accuracy of a first-principles description of both phases with the efficiency of empirical force fields and allows one to perform a mol.-dynamics study, of ab initio quality, of the thermodn. of graphite-diamond coexistence. Good agreement between the exptl. and calcd. coexistence curves is achieved if nuclear quantum effects are included in the simulation.
- 4Khaliullin, R. Z.; Eshet, H.; Kühne, T. D.; Behler, J.; Parrinello, M. Nucleation Mechanism for the Direct Graphite-to-Diamond Phase Transition. Nat. Mater. 2011, 10 (9), 693, DOI: 10.1038/nmat3078[Crossref], [PubMed], [CAS], Google Scholar4https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXptlaku7o%253D&md5=c1a307e718eaa060f02f14e3af4aa93eNucleation mechanism for the direct graphite-to-diamond phase transitionKhaliullin, Rustam Z.; Eshet, Hagai; Kuehne, Thomas D.; Behler, Joerg; Parrinello, MicheleNature Materials (2011), 10 (9), 693-697CODEN: NMAACR; ISSN:1476-1122. (Nature Publishing Group)Graphite and diamond have comparable free energies, yet forming diamond from graphite in the absence of a catalyst requires pressures that are significantly higher than those at equil. coexistence. At lower temps., the formation of the metastable hexagonal polymorph of diamond is favored instead of the more stable cubic diamond. These phenomena cannot be explained by the concerted mechanism suggested in previous theor. studies. Using an ab initio quality neural-network potential, we carried out a large-scale study of the graphite-to-diamond transition assuming that it occurs through nucleation. The nucleation mechanism accounts for the obsd. phenomenol. and reveals its microscopic origins. We demonstrate that the large lattice distortions that accompany the formation of diamond nuclei inhibit the phase transition at low pressure, and direct it towards the hexagonal diamond phase at higher pressure. The proposed nucleation mechanism should improve our understanding of structural transformations in a wide range of carbon-based materials.
- 5Eshet, H.; Khaliullin, R. Z.; Kühne, T. D.; Behler, J.; Parrinello, M. Ab Initio Quality Neural-Network Potential for Sodium. Phys. Rev. B: Condens. Matter Mater. Phys. 2010, 81 (18), 184107, DOI: 10.1103/PhysRevB.81.184107[Crossref], [CAS], Google Scholar5https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXmvVKjsL0%253D&md5=f56756ae8cfdb097f84ced73ba87baa8Ab initio quality neural-network potential for sodiumEshet, Hagai; Khaliullin, Rustam Z.; Kuhne, Thomas D.; Behler, Jorg; Parrinello, MichelePhysical Review B: Condensed Matter and Materials Physics (2010), 81 (18), 184107/1-184107/8CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)An interat. potential for high-pressure high-temp. (HPHT) cryst. and liq. phases of sodium is created using a neural-network (NN) representation of the ab initio potential-energy surface. It is demonstrated that the NN potential provides an ab initio quality description of multiple properties of liq. sodium and bcc, fcc, and cI16 crystal phases in the P-T region up to 120 GPa and 1200 K. The unique combination of computational efficiency of the NN potential and its ability to reproduce quant. exptl. properties of sodium in the wide P-T range enables mol.-dynamics simulations of physicochem. processes in HPHT sodium of unprecedented quality.
- 6Gastegger, M.; Kauffmann, C.; Behler, J.; Marquetand, P. Comparing the Accuracy of High-Dimensional Neural Network Potentials and the Systematic Molecular Fragmentation Method: A Benchmark Study for All-Trans Alkanes. J. Chem. Phys. 2016, 144 (19), 194110, DOI: 10.1063/1.4950815[Crossref], [PubMed], [CAS], Google Scholar6https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC28XotleksrY%253D&md5=a805eda60b32bf67bacfa6d3fdf40befComparing the accuracy of high-dimensional neural network potentials and the systematic molecular fragmentation method: A benchmark study for all-trans alkanesGastegger, Michael; Kauffmann, Clemens; Behler, Joerg; Marquetand, PhilippJournal of Chemical Physics (2016), 144 (19), 194110/1-194110/6CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Many approaches, which have been developed to express the potential energy of large systems, exploit the locality of the at. interactions. A prominent example is the fragmentation methods in which the quantum chem. calcns. are carried out for overlapping small fragments of a given mol. that are then combined in a second step to yield the system's total energy. Here we compare the accuracy of the systematic mol. fragmentation approach with the performance of high-dimensional neural network (HDNN) potentials introduced by Behler and Parrinello. HDNN potentials are similar in spirit to the fragmentation approach in that the total energy is constructed as a sum of environment-dependent at. energies, which are derived indirectly from electronic structure calcns. As a benchmark set, we use all-trans alkanes contg. up to eleven carbon atoms at the coupled cluster level of theory. These mols. have been chosen because they allow to extrapolate reliable ref. energies for very long chains, enabling an assessment of the energies obtained by both methods for alkanes including up to 10 000 carbon atoms. We find that both methods predict high-quality energies with the HDNN potentials yielding smaller errors with respect to the coupled cluster ref. (c) 2016 American Institute of Physics.
- 7Gastegger, M.; Behler, J.; Marquetand, P. Machine Learning Molecular Dynamics for the Simulation of Infrared Spectra. Chem. Sci. 2017, 8 (10), 6924– 6935, DOI: 10.1039/C7SC02267K[Crossref], [PubMed], [CAS], Google Scholar7https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXhtlSrtL7J&md5=68f30b308f22e04fc416e62c7aa85eedMachine learning molecular dynamics for the simulation of infrared spectraGastegger, Michael; Behler, Joerg; Marquetand, PhilippChemical Science (2017), 8 (10), 6924-6935CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)A review. Machine learning has emerged as an invaluable tool in many research areas. In the present work, we harness this power to predict highly accurate mol. IR spectra with unprecedented computational efficiency. To account for vibrational anharmonic and dynamical effects - typically neglected by conventional quantum chem. approaches - we base our machine learning strategy on ab initio mol. dynamics simulations. While these simulations are usually extremely time consuming even for small mols., we overcome these limitations by leveraging the power of a variety of machine learning techniques, not only accelerating simulations by several orders of magnitude, but also greatly extending the size of systems that can be treated. To this end, we develop a mol. dipole moment model based on environment dependent neural network charges and combine it with the neural network potential approach of Behler and Parrinello. Contrary to the prevalent big data philosophy, we are able to obtain very accurate machine learning models for the prediction of IR spectra based on only a few hundreds of electronic structure ref. points. This is made possible through the use of mol. forces during neural network potential training and the introduction of a fully automated sampling scheme. We demonstrate the power of our machine learning approach by applying it to model the IR spectra of a methanol mol., n-alkanes contg. up to 200 atoms and the protonated alanine tripeptide, which at the same time represents the first application of machine learning techniques to simulate the dynamics of a peptide. In all of these case studies we find an excellent agreement between the IR spectra predicted via machine learning models and the resp. theor. and exptl. spectra.
- 8Bartók, A. P.; Payne, M. C.; Kondor, R.; Csányi, G. Gaussian Approximation Potentials: the Accuracy of Quantum Mechanics, without the Electrons. Phys. Rev. Lett. 2010, 104 (13), 136403, DOI: 10.1103/PhysRevLett.104.136403[Crossref], [PubMed], [CAS], Google Scholar8https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXkt1Kqur8%253D&md5=0a468458554e85413b53816c082419f2Gaussian Approximation Potentials: The Accuracy of Quantum Mechanics, without the ElectronsBartok, Albert P.; Payne, Mike C.; Kondor, Risi; Csanyi, GaborPhysical Review Letters (2010), 104 (13), 136403/1-136403/4CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)We introduce a class of interat. potential models that can be automatically generated from data consisting of the energies and forces experienced by atoms, as derived from quantum mech. calcns. The models do not have a fixed functional form and hence are capable of modeling complex potential energy landscapes. They are systematically improvable with more data. We apply the method to bulk crystals, and test it by calcg. properties at high temps. Using the interat. potential to generate the long mol. dynamics trajectories required for such calcns. saves orders of magnitude in computational cost.
- 9Bartók, A. P.; De, S.; Poelking, C.; Bernstein, N.; Kermode, J. R.; Csányi, G.; Ceriotti, M. Machine Learning Unifies the Modeling of Materials and Molecules. Sci. Adv. 2017, 3 (12), e1701816 DOI: 10.1126/sciadv.1701816[Crossref], [PubMed], [CAS], Google Scholar9https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXisVWgsbjP&md5=e996f3746c995ed0304f33762f7da713Machine learning unifies the modeling of materials and moleculesBartok, Albert P.; De, Sandip; Poelking, Carl; Bernstein, Noam; Kermode, James R.; Csanyi, Gabor; Ceriotti, MicheleScience Advances (2017), 3 (12), e1701816/1-e1701816/8CODEN: SACDAF; ISSN:2375-2548. (American Association for the Advancement of Science)Detg. the stability of mols. and condensed phases is the cornerstone of atomistic modeling, underpinning our understanding of chem. and materials properties and transformations. We show that a machine-learning model, based on a local description of chem. environments and Bayesian statistical learning, provides a unified framework to predict at.-scale properties. It captures the quantum mech. effects governing the complex surface reconstructions of silicon, predicts the stability of different classes of mols. with chem. accuracy, and distinguishes active and inactive protein ligands with more than 99% reliability. The universality and the systematic nature of our framework provide new insight into the potential energy surface of materials and mols.
- 10Caro, M. A.; Deringer, V. L.; Koskinen, J.; Laurila, T.; Csányi, G. Growth Mechanism and Origin of High Sp3 Content in Tetrahedral Amorphous Carbon. Phys. Rev. Lett. 2018, 120, 166101, DOI: 10.1103/PhysRevLett.120.166101[Crossref], [PubMed], [CAS], Google Scholar10https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXltFSgt78%253D&md5=9b29a42ea8c0a1bccff5d98eed9713c0Growth Mechanism and Origin of High sp3 Content in Tetrahedral Amorphous CarbonCaro, Miguel A.; Deringer, Volker L.; Koskinen, Jari; Laurila, Tomi; Csanyi, GaborPhysical Review Letters (2018), 120 (16), 166101CODEN: PRLTAO; ISSN:1079-7114. (American Physical Society)We study the deposition of tetrahedral amorphous carbon (ta-C) films from mol. dynamics simulations based on a machine-learned interat. potential trained from d.-functional theory data. For the first time, the high sp3 fractions in excess of 85% obsd. exptl. are reproduced by means of computational simulation, and the deposition energy dependence of the film's characteristics is also accurately described. High confidence in the potential and direct access to the at. interactions allow us to infer the microscopic growth mechanism in this material. While the widespread view is that ta-C grows by "subplantation," we show that the so-called "peening" model is actually the dominant mechanism responsible for the high sp3 content. We show that pressure waves lead to bond rearrangement away from the impact site of the incident ion, and high sp3 fractions arise from a delicate balance of transitions between three- and fourfold coordinated carbon atoms. These results open the door for a microscopic understanding of carbon nanostructure formation with an unprecedented level of predictive power.
- 11Deringer, V. L.; Bernstein, N.; Bartók, A. P.; Cliffe, M. J.; Kerber, R. N.; Marbella, L. E.; Grey, C. P.; Elliott, S. R.; Csányi, G. Realistic Atomistic Structure of Amorphous Silicon from Machine-Learning-Driven Molecular Dynamics. J. Phys. Chem. Lett. 2018, 9 (11), 2879– 2885, DOI: 10.1021/acs.jpclett.8b00902[ACS Full Text
], [CAS], Google Scholar
11https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXptlyitrk%253D&md5=803bfce8641590478e89276ebb7fc81dRealistic Atomistic Structure of Amorphous Silicon from Machine-Learning-Driven Molecular DynamicsDeringer, Volker L.; Bernstein, Noam; Bartok, Albert P.; Cliffe, Matthew J.; Kerber, Rachel N.; Marbella, Lauren E.; Grey, Clare P.; Elliott, Stephen R.; Csanyi, GaborJournal of Physical Chemistry Letters (2018), 9 (11), 2879-2885CODEN: JPCLCD; ISSN:1948-7185. (American Chemical Society)Amorphous silicon (a-Si) is a widely studied noncryst. material, and yet the subtle details of its atomistic structure are still unclear. Here, we show that accurate structural models of a-Si can be obtained using a machine-learning-based interat. potential. Our best a-Si network is obtained by simulated cooling from the melt at a rate of 1011 K/s (i.e., on the 10 ns time scale), contains less than 2% defects, and agrees with expts. regarding excess energies, diffraction data, and 29Si NMR chem. shifts. We show that this level of quality is impossible to achieve with faster quench simulations. We then generate a 4096-atom system that correctly reproduces the magnitude of the first sharp diffraction peak (FSDP) in the structure factor, achieving the closest agreement with expts. to date. Our study demonstrates the broader impact of machine-learning potentials for elucidating structures and properties of technol. important amorphous materials. - 12Hu, D.; Xie, Y.; Li, X.; Li, L.; Lan, Z. Inclusion of Machine Learning Kernel Ridge Regression Potential Energy Surfaces in On-the-Fly Nonadiabatic Molecular Dynamics Simulation. J. Phys. Chem. Lett. 2018, 9 (11), 2725– 2732, DOI: 10.1021/acs.jpclett.8b00684[ACS Full Text
], [CAS], Google Scholar
12https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXptVequ7g%253D&md5=d7257373e2b1bc7791fdda02c8e18b47Inclusion of Machine Learning Kernel Ridge Regression Potential Energy Surfaces in On-the-Fly Nonadiabatic Molecular Dynamics SimulationHu, Deping; Xie, Yu; Li, Xusong; Li, Lingyue; Lan, ZhenggangJournal of Physical Chemistry Letters (2018), 9 (11), 2725-2732CODEN: JPCLCD; ISSN:1948-7185. (American Chemical Society)We discuss a theor. approach that employs machine learning potential energy surfaces (ML-PESs) in the nonadiabatic dynamics simulation of polyat. systems by taking 6-aminopyrimidine as a typical example. The Zhu-Nakamura theory is employed in the surface hopping dynamics, which does not require the calcn. of the nonadiabatic coupling vectors. The kernel ridge regression is used in the construction of the adiabatic PESs. In the nonadiabatic dynamics simulation, we use ML-PESs for most geometries and switch back to the electronic structure calcns. for a few geometries either near the S1/S0 conical intersections or in the out-of-confidence regions. The dynamics results based on ML-PESs are consistent with those based on CASSCF PESs. The ML-PESs are further used to achieve the highly efficient massive dynamics simulations with a large no. of trajectories. This work displays the powerful role of ML methods in the nonadiabatic dynamics simulation of polyat. systems. - 13Smith, J. S.; Isayev, O.; Roitberg, A. E. ANI-1: An Extensible Neural Network Potential with DFT Accuracy at Force Field Computational Cost. Chem. Sci. 2017, 8 (4), 3192– 3203, DOI: 10.1039/C6SC05720A[Crossref], [PubMed], [CAS], Google Scholar13https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXitlGnsrs%253D&md5=95b2f5106c620c6f09560966dba3559eANI-1: an extensible neural network potential with DFT accuracy at force field computational costSmith, J. S.; Isayev, O.; Roitberg, A. E.Chemical Science (2017), 8 (4), 3192-3203CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)A review. Deep learning is revolutionizing many areas of science and technol., esp. image, text, and speech recognition. In this paper, we demonstrate how a deep neural network (NN) trained on quantum mech. (QM) DFT calcns. can learn an accurate and transferable potential for org. mols. We introduce ANAKIN-ME (Accurate NeurAl networK engINe for Mol. Energies) or ANI for short. ANI is a new method designed with the intent of developing transferable neural network potentials that utilize a highly-modified version of the Behler and Parrinello symmetry functions to build single-atom at. environment vectors (AEV) as a mol. representation. AEVs provide the ability to train neural networks to data that spans both configurational and conformational space, a feat not previously accomplished on this scale. We utilized ANI to build a potential called ANI-1, which was trained on a subset of the GDB databases with up to 8 heavy atoms in order to predict total energies for org. mols. contg. four atom types: H, C, N, and O. To obtain an accelerated but phys. relevant sampling of mol. potential surfaces, we also proposed a Normal Mode Sampling (NMS) method for generating mol. conformations. Through a series of case studies, we show that ANI-1 is chem. accurate compared to ref. DFT calcns. on much larger mol. systems (up to 54 atoms) than those included in the training data set.
- 14Smith, J. S.; Nebgen, B.; Lubbers, N.; Isayev, O.; Roitberg, A. E. Less Is More: Sampling Chemical Space with Active Learning. J. Chem. Phys. 2018, 148 (24), 241733, DOI: 10.1063/1.5023802[Crossref], [PubMed], [CAS], Google Scholar14https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXpvVOjurk%253D&md5=c6894cdd3c471d7ceed10d8b2c095a03Less is more: Sampling chemical space with active learningSmith, Justin S.; Nebgen, Ben; Lubbers, Nicholas; Isayev, Olexandr; Roitberg, Adrian E.Journal of Chemical Physics (2018), 148 (24), 241733/1-241733/10CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)The development of accurate and transferable machine learning (ML) potentials for predicting mol. energetics is a challenging task. The process of data generation to train such ML potentials is a task neither well understood nor researched in detail. In this work, we present a fully automated approach for the generation of datasets with the intent of training universal ML potentials. It is based on the concept of active learning (AL) via Query by Committee (QBC), which uses the disagreement between an ensemble of ML potentials to infer the reliability of the ensemble's prediction. QBC allows the presented AL algorithm to automatically sample regions of chem. space where the ML potential fails to accurately predict the potential energy. AL improves the overall fitness of ANAKIN-ME (ANI) deep learning potentials in rigorous test cases by mitigating human biases in deciding what new training data to use. AL also reduces the training set size to a fraction of the data required when using naive random sampling techniques. To provide validation of our AL approach, we develop the COmprehensive Machine-learning Potential (COMP6) benchmark (publicly available on GitHub) which contains a diverse set of org. mols. Active learning-based ANI potentials outperform the original random sampled ANI-1 potential with only 10% of the data, while the final active learning-based model vastly outperforms ANI-1 on the COMP6 benchmark after training to only 25% of the data. Finally, we show that our proposed AL technique develops a universal ANI potential (ANI-1x) that provides accurate energy and force predictions on the entire COMP6 benchmark. This universal ML potential achieves a level of accuracy on par with the best ML potentials for single mols. or materials, while remaining applicable to the general class of org. mols. composed of the elements CHNO. (c) 2018 American Institute of Physics.
- 15Smith, J. S.; Nebgen, B. T.; Zubatyuk, R.; Lubbers, N.; Devereux, C.; Barros, K.; Tretiak, S.; Isayev, O.; Roitberg, A. E. Approaching Coupled Cluster Accuracy with a General-Purpose Neural Network Potential through Transfer Learning. Nat. Commun. 2019, 10 (1), 2903, DOI: 10.1038/s41467-019-10827-4[Crossref], [PubMed], [CAS], Google Scholar15https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BB3MzjtFaitg%253D%253D&md5=6476e866a59d1408cbfba321242a353dApproaching coupled cluster accuracy with a general-purpose neural network potential through transfer learningSmith Justin S; Devereux Christian; Roitberg Adrian E; Smith Justin S; Nebgen Benjamin T; Zubatyuk Roman; Lubbers Nicholas; Barros Kipton; Tretiak Sergei; Smith Justin S; Lubbers Nicholas; Nebgen Benjamin T; Tretiak Sergei; Zubatyuk Roman; Isayev OlexandrNature communications (2019), 10 (1), 2903 ISSN:.Computational modeling of chemical and biological systems at atomic resolution is a crucial tool in the chemist's toolset. The use of computer simulations requires a balance between cost and accuracy: quantum-mechanical methods provide high accuracy but are computationally expensive and scale poorly to large systems, while classical force fields are cheap and scalable, but lack transferability to new systems. Machine learning can be used to achieve the best of both approaches. Here we train a general-purpose neural network potential (ANI-1ccx) that approaches CCSD(T)/CBS accuracy on benchmarks for reaction thermochemistry, isomerization, and drug-like molecular torsions. This is achieved by training a network to DFT data then using transfer learning techniques to retrain on a dataset of gold standard QM calculations (CCSD(T)/CBS) that optimally spans chemical space. The resulting potential is broadly applicable to materials science, biology, and chemistry, and billions of times faster than CCSD(T)/CBS calculations.
- 16Chmiela, S.; Tkatchenko, A.; Sauceda, H. E.; Poltavsky, I.; Schütt, K. T.; Müller, K. R. Machine Learning of Accurate Energy-Conserving Molecular Force Fields. Sci. Adv. 2017, 3 (5), e1603015 DOI: 10.1126/sciadv.1603015[Crossref], [PubMed], [CAS], Google Scholar16https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXkvVGjsrg%253D&md5=f26ef9dd87735d5d8fdbf5cc1ab9ae7cMachine learning of accurate energy-conserving molecular force fieldsChmiela, Stefan; Tkatchenko, Alexandre; Sauceda, Huziel E.; Poltavsky, Igor; Schuett, Kristof T.; Mueller, Klaus-RobertScience Advances (2017), 3 (5), e1603015/1-e1603015/6CODEN: SACDAF; ISSN:2375-2548. (American Association for the Advancement of Science)Using conservation of energy-a fundamental property of closed classical and quantum mech. systems-we develop an efficient gradient-domain machine learning (GDML) approach to construct accurate mol. force fields using a restricted no. of samples from ab initio mol. dynamics (AIMD) trajectories. The GDML implementation is able to reproduce global potential energy surfaces of intermediate-sized mols. with an accuracy of 0.3 kcal mol-1 for energies and 1 kcal mol-1 Å-1 for at. forces using only 1000 conformational geometries for training. We demonstrate this accuracy for AIMD trajectories of mols., including benzene, toluene, naphthalene, ethanol, uracil, and aspirin. The challenge of constructing conservative force fields is accomplished in our work by learning in a Hilbert space of vector-valued functions that obey the law of energy conservation. The GDML approach enables quant. mol. dynamics simulations for mols. at a fraction of cost of explicit AIMD calcns., thereby allowing the construction of efficient force fields with the accuracy and transferability of high-level ab initio methods.
- 17Chmiela, S.; Sauceda, H. E.; Müller, K. R.; Tkatchenko, A. Towards Exact Molecular Dynamics Simulations with Machine-Learned Force Fields. Nat. Commun. 2018, 9 (1), 3887, DOI: 10.1038/s41467-018-06169-2[Crossref], [PubMed], [CAS], Google Scholar17https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BB3czit1Oqsw%253D%253D&md5=90e14b965fe7f099602125037818aa32Towards exact molecular dynamics simulations with machine-learned force fieldsChmiela Stefan; Muller Klaus-Robert; Sauceda Huziel E; Muller Klaus-Robert; Muller Klaus-Robert; Tkatchenko AlexandreNature communications (2018), 9 (1), 3887 ISSN:.Molecular dynamics (MD) simulations employing classical force fields constitute the cornerstone of contemporary atomistic modeling in chemistry, biology, and materials science. However, the predictive power of these simulations is only as good as the underlying interatomic potential. Classical potentials often fail to faithfully capture key quantum effects in molecules and materials. Here we enable the direct construction of flexible molecular force fields from high-level ab initio calculations by incorporating spatial and temporal physical symmetries into a gradient-domain machine learning (sGDML) model in an automatic data-driven way. The developed sGDML approach faithfully reproduces global force fields at quantum-chemical CCSD(T) level of accuracy and allows converged molecular dynamics simulations with fully quantized electrons and nuclei. We present MD simulations, for flexible molecules with up to a few dozen atoms and provide insights into the dynamical behavior of these molecules. Our approach provides the key missing ingredient for achieving spectroscopic accuracy in molecular simulations.
- 18Sauceda, H. E.; Chmiela, S.; Poltavsky, I.; Müller, K. R.; Tkatchenko, A. Molecular Force Fields with Gradient-Domain Machine Learning: Construction and Application to Dynamics of Small Molecules with Coupled Cluster Forces. J. Chem. Phys. 2019, 150 (11), 114102, DOI: 10.1063/1.5078687[Crossref], [PubMed], [CAS], Google Scholar18https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXltFGgsrk%253D&md5=f7e80e4bb220c36c4e5fb7e44454657aMolecular force fields with gradient-domain machine learning: Construction and application to dynamics of small molecules with coupled cluster forcesSauceda, Huziel E.; Chmiela, Stefan; Poltavsky, Igor; Mueller, Klaus-Robert; Tkatchenko, AlexandreJournal of Chemical Physics (2019), 150 (11), 114102/1-114102/12CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We present the construction of mol. force fields for small mols. (less than 25 atoms) using the recently developed symmetrized gradient-domain machine learning (sGDML) approach [Chmiela et al., Nat. Commun. 9, 3887 (2018) and Chmiela et al., Sci. Adv. 3, e1603015 (2017)]. This approach is able to accurately reconstruct complex high-dimensional potential-energy surfaces from just a few 100s of mol. conformations extd. from ab initio mol. dynamics trajectories. The data efficiency of the sGDML approach implies that at. forces for these conformations can be computed with high-level wavefunction-based approaches, such as the "gold std." coupled-cluster theory with single, double and perturbative triple excitations [CCSD(T)]. We demonstrate that the flexible nature of the sGDML model recovers local and non-local electronic interactions (e.g., H-bonding, proton transfer, lone pairs, changes in hybridization states, steric repulsion, and n → π* interactions) without imposing any restriction on the nature of interat. potentials. The anal. of sGDML mol. dynamics trajectories yields new qual. insights into dynamics and spectroscopy of small mols. close to spectroscopic accuracy. (c) 2019 American Institute of Physics.
- 19Schütt, K. T.; Arbabzadah, F.; Chmiela, S.; Müller, K. R.; Tkatchenko, A. Quantum-Chemical Insights from Deep Tensor Neural Networks. Nat. Commun. 2017, 8 (1), 13890, DOI: 10.1038/ncomms13890[Crossref], [PubMed], [CAS], Google Scholar19https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXpsVOnsg%253D%253D&md5=6007758e2a0029c5a854673a5451bc7fQuantum-chemical insights from deep tensor neural networksSchuett, Kristof T.; Arbabzadah, Farhad; Chmiela, Stefan; Mueller, Klaus R.; Tkatchenko, AlexandreNature Communications (2017), 8 (), 13890CODEN: NCAOBW; ISSN:2041-1723. (Nature Publishing Group)Learning from data has led to paradigm shifts in a multitude of disciplines, including web, text and image search, speech recognition, as well as bioinformatics. Can machine learning enable similar breakthroughs in understanding quantum many-body systems. Here we develop an efficient deep learning approach that enables spatially and chem. resolved insights into quantum-mech. observables of mol. systems. We unify concepts from many-body Hamiltonians with purpose-designed deep tensor neural networks, which leads to size-extensive and uniformly accurate (1 kcal mol-1) predictions in compositional and configurational chem. space for mols. of intermediate size. As an example of chem. relevance, the model reveals a classification of arom. rings with respect to their stability. Further applications of our model for predicting at. energies and local chem. potentials in mols., reliable isomer energies, and mols. with peculiar electronic structure demonstrate the potential of machine learning for revealing insights into complex quantum-chem. systems.
- 20Schütt, K. T.; Sauceda, H. E.; Kindermans, P.-J.; Tkatchenko, A.; Müller, K. R. SchNet-A Deep Learning Architecture for Molecules and Materials. J. Chem. Phys. 2018, 148 (24), 241722, DOI: 10.1063/1.5019779[Crossref], [PubMed], [CAS], Google Scholar20https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXms1Ggurs%253D&md5=988638d520a423f529a16b35031243aaSchNet - A deep learning architecture for molecules and materialsSchuett, K. T.; Sauceda, H. E.; Kindermans, P.-J.; Tkatchenko, A.; Mueller, K.-R.Journal of Chemical Physics (2018), 148 (24), 241722/1-241722/11CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Deep learning has led to a paradigm shift in artificial intelligence, including web, text, and image search, speech recognition, as well as bioinformatics, with growing impact in chem. physics. Machine learning, in general, and deep learning, in particular, are ideally suitable for representing quantum-mech. interactions, enabling us to model nonlinear potential-energy surfaces or enhancing the exploration of chem. compd. space. Here, we present the deep learning architecture SchNet that is specifically designed to model atomistic systems by making use of continuous-filter convolutional layers. We demonstrate the capabilities of SchNet by accurately predicting a range of properties across chem. space for mols. and materials, where our model learns chem. plausible embeddings of atom types across the periodic table. Finally, we employ SchNet to predict potential-energy surfaces and energy-conserving force fields for mol. dynamics simulations of small mols. and perform an exemplary study on the quantum-mech. properties of C20-fullerene that would have been infeasible with regular ab initio mol. dynamics. (c) 2018 American Institute of Physics.
- 21Schütt, K. T.; Kessel, P.; Gastegger, M.; Nicoli, K. A.; Tkatchenko, A.; Müller, K. R. SchNetPack: A Deep Learning Toolbox for Atomistic Systems. J. Chem. Theory Comput. 2019, 15 (1), 448– 455, DOI: 10.1021/acs.jctc.8b00908[ACS Full Text
], [CAS], Google Scholar
21https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXitlalsLvN&md5=51b19679412f490ee9260a986371e66bSchNetPack: A Deep Learning Toolbox For Atomistic SystemsSchuett, K. T.; Kessel, P.; Gastegger, M.; Nicoli, K. A.; Tkatchenko, A.; Mueller, K.-R.Journal of Chemical Theory and Computation (2019), 15 (1), 448-455CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)SchNetPack is a toolbox for the development and application of deep neural networks that predict potential energy surfaces and other quantum-chem. properties of mols. and materials. It contains basic building blocks of atomistic neural networks, manages their training, and provides simple access to common benchmark datasets. This allows for an easy implementation and evaluation of new models. For now, SchNetPack includes implementations of (weighted) atom-centered symmetry functions and the deep tensor neural network SchNet, as well as ready-to-use scripts that allow one to train these models on mol. and material datasets. Based on the PyTorch deep learning framework, SchNetPack allows one to efficiently apply the neural networks to large datasets with millions of ref. calcns., as well as parallelize the model across multiple GPUs. Finally, SchNetPack provides an interface to the Atomic Simulation Environment in order to make trained models easily accessible to researchers that are not yet familiar with neural networks. - 22Unke, O. T.; Meuwly, M. PhysNet: A Neural Network for Predicting Energies, Forces, Dipole Moments and Partial Charges. J. Chem. Theory Comput. 2019, 15 (6), 3678– 3693, DOI: 10.1021/acs.jctc.9b00181[ACS Full Text
], [CAS], Google Scholar
22https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXosF2ms7g%253D&md5=77bea45c52d12b93267bc785d3d4b375PhysNet: A Neural Network for Predicting Energies, Forces, Dipole Moments, and Partial ChargesUnke, Oliver T.; Meuwly, MarkusJournal of Chemical Theory and Computation (2019), 15 (6), 3678-3693CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)In recent years, machine learning (ML) methods have become increasingly popular in computational chem. After being trained on appropriate ab initio ref. data, these methods allow for accurately predicting the properties of chem. systems, circumventing the need for explicitly solving the electronic Schrodinger equation. Because of their computational efficiency and scalability to large data sets, deep neural networks (DNNs) are a particularly promising ML algorithm for chem. applications. This work introduces PhysNet, a DNN architecture designed for predicting energies, forces, and dipole moments of chem. systems. PhysNet achieves state-of-the-art performance on the QM9, MD17, and ISO17 benchmarks. Further, two new data sets are generated in order to probe the performance of ML models for describing chem. reactions, long-range interactions, and condensed phase systems. It is shown that explicitly including electrostatics in energy predictions is crucial for a qual. correct description of the asymptotic regions of a potential energy surface (PES). PhysNet models trained on a systematically constructed set of small peptide fragments (at most eight heavy atoms) are able to generalize to considerably larger proteins like deca-alanine (Ala10): The optimized geometry of helical Ala10 predicted by PhysNet is virtually identical to ab initio results (RMSD = 0.21 Å). By running unbiased mol. dynamics (MD) simulations of Ala10 on the PhysNet-PES in gas phase, it is found that instead of a helical structure, Ala10 folds into a "wreath-shaped" configuration, which is more stable than the helical form by 0.46 kcal mol-1 according to the ref. ab initio calcns. - 23Brockherde, F.; Vogt, L.; Li, L.; Tuckerman, M. E.; Burke, K.; Müller, K. R. Bypassing the Kohn-Sham Equations with Machine Learning. Nat. Commun. 2017, 8 (1), 872, DOI: 10.1038/s41467-017-00839-3[Crossref], [PubMed], [CAS], Google Scholar23https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BC1M%252Fps1GhsA%253D%253D&md5=c3e11bea0346fbb9ae06c3f67453f2a2Bypassing the Kohn-Sham equations with machine learningBrockherde Felix; Muller Klaus-Robert; Brockherde Felix; Vogt Leslie; Tuckerman Mark E; Li Li; Burke Kieron; Tuckerman Mark E; Tuckerman Mark E; Burke Kieron; Muller Klaus-Robert; Muller Klaus-RobertNature communications (2017), 8 (1), 872 ISSN:.Last year, at least 30,000 scientific papers used the Kohn-Sham scheme of density functional theory to solve electronic structure problems in a wide variety of scientific fields. Machine learning holds the promise of learning the energy functional via examples, bypassing the need to solve the Kohn-Sham equations. This should yield substantial savings in computer time, allowing larger systems and/or longer time-scales to be tackled, but attempts to machine-learn this functional have been limited by the need to find its derivative. The present work overcomes this difficulty by directly learning the density-potential and energy-density maps for test systems and various molecules. We perform the first molecular dynamics simulation with a machine-learned density functional on malonaldehyde and are able to capture the intramolecular proton transfer process. Learning density models now allows the construction of accurate density functionals for realistic molecular systems.Machine learning allows electronic structure calculations to access larger system sizes and, in dynamical simulations, longer time scales. Here, the authors perform such a simulation using a machine-learned density functional that avoids direct solution of the Kohn-Sham equations.
- 24Petraglia, R.; Nicolaï, A.; Wodrich, M. D.; Ceriotti, M.; Corminboeuf, C. Beyond Static Structures: Putting Forth REMD as a Tool to Solve Problems in Computational Organic Chemistry. J. Comput. Chem. 2016, 37 (1), 83– 92, DOI: 10.1002/jcc.24025[Crossref], [PubMed], [CAS], Google Scholar24https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2MXht1GltbjE&md5=046930a574db03f8e28453700de204aaBeyond static structures: Putting forth REMD as a tool to solve problems in computational organic chemistryPetraglia, Riccardo; Nicolai, Adrien; Wodrich, Matthew D.; Ceriotti, Michele; Corminboeuf, ClemenceJournal of Computational Chemistry (2016), 37 (1), 83-92CODEN: JCCHDD; ISSN:0192-8651. (John Wiley & Sons, Inc.)Computational studies of org. systems are frequently limited to static pictures that closely align with textbook style presentations of reaction mechanisms and isomerization processes. Of course, in reality chem. systems are dynamic entities where a multitude of mol. conformations exists on incredibly complex potential energy surfaces (PES). Here, we borrow a computational technique originally conceived to be used in the context of biol. simulations, together with empirical force fields, and apply it to org. chem. problems. Replica-exchange mol. dynamics (REMD) permits thorough exploration of the PES. We combined REMD with d. functional tight binding (DFTB), thereby establishing the level of accuracy necessary to analyze small mol. systems. Through the study of four prototypical problems: isomer identification, reaction mechanisms, temp.-dependent rotational processes, and catalysis, we reveal new insights and chem. that likely would be missed using static electronic structure computations. The REMD-DFTB methodol. at the heart of this study is powered by i-PI, which efficiently handles the interface between the DFTB and REMD codes. © 2015 Wiley Periodicals, Inc.
- 25Sugita, Y.; Okamoto, Y. Replica-Exchange Molecular Dynamics Method for Protein Folding. Chem. Phys. Lett. 1999, 314, 141– 151, DOI: 10.1016/S0009-2614(99)01123-9[Crossref], [CAS], Google Scholar25https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK1MXotVKrsLc%253D&md5=0fec0ff81ca7806c1e1ac29e5f50ce19Replica-exchange molecular dynamics method for protein foldingSugita, Y.; Okamoto, Y.Chemical Physics Letters (1999), 314 (1,2), 141-151CODEN: CHPLBC; ISSN:0009-2614. (Elsevier Science B.V.)We have developed a formulation for mol. dynamics algorithm for the replica-exchange method. The effectiveness of the method for the protein-folding problem is tested with the penta-peptide Met-enkephalin. The method can overcome the multiple-min. problem by exchanging non-interacting replicas of the system at several temps. From only one simulation run, one can obtain probability distributions in canonical ensemble for a wide temp. range using multiple-histogram re-weighting techniques, which allows the calcn. of any thermodn. quantity as a function of temp. in that range.
- 26Gaus, M.; Cui, Q.; Elstner, M. DFTB3: Extension of the Self-Consistent-Charge Density-Functional Tight-Binding Method (SCC-DFTB). J. Chem. Theory Comput. 2011, 7 (4), 931– 948, DOI: 10.1021/ct100684s[ACS Full Text
], [CAS], Google Scholar
26https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXjtVKgu74%253D&md5=179659060fa503023375266a674d02e7DFTB3: Extension of the Self-Consistent-Charge Density-Functional Tight-Binding Method (SCC-DFTB)Gaus, Michael; Cui, Qiang; Elstner, MarcusJournal of Chemical Theory and Computation (2011), 7 (4), 931-948CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The self-consistent-charge d.-functional tight-binding method (SCC-DFTB) is an approx. quantum chem. method derived from d. functional theory (DFT) based on a second-order expansion of the DFT total energy around a ref. d. In the present study, we combine earlier extensions and improve them consistently with, first, an improved Coulomb interaction between at. partial charges and, second, the complete third-order expansion of the DFT total energy. These modifications lead us to the next generation of the DFTB methodol. called DFTB3, which substantially improves the description of charged systems contg. elements C, H, N, O, and P, esp. regarding hydrogen binding energies and proton affinities. As a result, DFTB3 is particularly applicable to biomol. systems. Remaining challenges and possible solns. are also briefly discussed. - 27Gaus, M.; Goez, A.; Elstner, M. Parametrization and Benchmark of DFTB3 for Organic Molecules. J. Chem. Theory Comput. 2013, 9 (1), 338– 354, DOI: 10.1021/ct300849w[ACS Full Text
], [CAS], Google Scholar
27https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC38Xhs1ent77L&md5=2ddde03653e32a857d7416e755332515Parametrization and Benchmark of DFTB3 for Organic MoleculesGaus, Michael; Goez, Albrecht; Elstner, MarcusJournal of Chemical Theory and Computation (2013), 9 (1), 338-354CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)DFTB3 is a recent extension of the self-consistent-charge d.-functional tight-binding method (SCC-DFTB) and derived from a third order expansion of the d. functional theory (DFT) total energy around a given ref. d. Being applied in combination with the parametrization of its predecessor (MIO), DFTB3 improves for hydrogen binding energies, proton affinities, and hydrogen transfer barriers. In the present study, parameters esp. designed for DFTB3 are presented, and its performance is evaluated for small org. mols. focusing on thermochem., geometries, and vibrational frequencies from our own and several databases from literature. The new parameters remove significant overbinding errors, reduce errors for geometries of noncovalent interactions, and improve the overall performance. - 28Gaus, M.; Lu, X.; Elstner, M.; Cui, Q. Parameterization of DFTB3/3OB for Sulfur and Phosphorus for Chemical and Biological Applications. J. Chem. Theory Comput. 2014, 10 (4), 1518– 1537, DOI: 10.1021/ct401002w[ACS Full Text
], [CAS], Google Scholar
28https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2cXktFeitrw%253D&md5=6eda218757c17fc08cdf820c3c9452eaParameterization of DFTB3/3OB for Sulfur and Phosphorus for Chemical and Biological ApplicationsGaus, Michael; Lu, Xiya; Elstner, Marcus; Cui, QiangJournal of Chemical Theory and Computation (2014), 10 (4), 1518-1537CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)We report the parametrization of the approx. d. functional tight binding method, DFTB3, for sulfur and phosphorus. The parametrization is done in a framework consistent with our previous 3OB set established for O, N, C, and H, thus the resulting parameters can be used to describe a broad set of org. and biol. relevant mols. The 3d orbitals are included in the parametrization, and the electronic parameters are chosen to minimize errors in the atomization energies. The parameters are tested using a fairly diverse set of mols. of biol. relevance, focusing on the geometries, reaction energies, proton affinities, and hydrogen bonding interactions of these mols.; vibrational frequencies are also examd., although less systematically. The results of DFTB3/3OB are compared to those from DFT (B3LYP and PBE), ab initio (MP2, G3B3), and several popular semiempirical methods (PM6 and PDDG), as well as predictions of DFTB3 with the older parametrization (the MIO set). In general, DFTB3/3OB is a major improvement over the previous parametrization (DFTB3/MIO), and for the majority cases tested here, it also outperforms PM6 and PDDG, esp. for structural properties, vibrational frequencies, hydrogen bonding interactions, and proton affinities. For reaction energies, DFTB3/3OB exhibits major improvement over DFTB3/MIO, due mainly to significant redn. of errors in atomization energies; compared to PM6 and PDDG, DFTB3/3OB also generally performs better, although the magnitude of improvement is more modest. Compared to high-level calcns., DFTB3/3OB is most successful at predicting geometries; larger errors are found in the energies, although the results can be greatly improved by computing single point energies at a high level with DFTB3 geometries. There are several remaining issues with the DFTB3/3OB approach, most notably its difficulty in describing phosphate hydrolysis reactions involving a change in the coordination no. of the phosphorus, for which a specific parametrization (3OB/OPhyd) is developed as a temporary soln.; this suggests that the current DFTB3 methodol. has limited transferability for complex phosphorus chem. at the level of accuracy required for detailed mechanistic investigations. Therefore, fundamental improvements in the DFTB3 methodol. are needed for a reliable method that describes phosphorus chem. without ad hoc parameters. Nevertheless, DFTB3/3OB is expected to be a competitive QM method in QM/MM calcns. for studying phosphorus/sulfur chem. in condensed phase systems, esp. as a low-level method that drives the sampling in a dual-level QM/MM framework. - 29Laio, A.; Parrinello, M. Escaping free-energy minima. Proc. Natl. Acad. Sci. U. S. A. 2002, 99 (20), 12562– 12566, DOI: 10.1073/pnas.202427399[Crossref], [PubMed], [CAS], Google Scholar29https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD38XnvFGiurc%253D&md5=48d5bc7436f3ef9d78369671e70fa608Escaping free-energy minimaLaio, Alessandro; Parrinello, MicheleProceedings of the National Academy of Sciences of the United States of America (2002), 99 (20), 12562-12566CODEN: PNASA6; ISSN:0027-8424. (National Academy of Sciences)We introduce a powerful method for exploring the properties of the multidimensional free energy surfaces (FESs) of complex many-body systems by means of coarse-grained non-Markovian dynamics in the space defined by a few collective coordinates. A characteristic feature of these dynamics is the presence of a history-dependent potential term that, in time, fills the min. in the FES, allowing the efficient exploration and accurate detn. of the FES as a function of the collective coordinates. We demonstrate the usefulness of this approach in the case of the dissocn. of a NaCl mol. in water and in the study of the conformational changes of a dialanine in soln.
- 30Grimme, S. Exploration of Chemical Compound, Conformer, and Reaction Space with Meta-Dynamics Simulations Based on Tight-Binding Quantum Chemical Calculations. J. Chem. Theory Comput. 2019, 15 (5), 2847– 2862, DOI: 10.1021/acs.jctc.9b00143[ACS Full Text
], [CAS], Google Scholar
30https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXms1ahs7Y%253D&md5=ec5d26600f13710436a96c608c2b743dExploration of Chemical Compound, Conformer, and Reaction Space with Meta-Dynamics Simulations Based on Tight-Binding Quantum Chemical CalculationsGrimme, StefanJournal of Chemical Theory and Computation (2019), 15 (5), 2847-2862CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The semiempirical tight-binding based quantum chem. method GFN2-xTB is used in the framework of meta-dynamics (MTD) to globally explore chem. compd., conformer, and reaction space. The biasing potential given as a sum of Gaussian functions is expressed with the root-mean-square-deviation (RMSD) in Cartesian space as a metric for the collective variables. This choice makes the approach robust and generally applicable to three common problems (i.e., conformer search, chem. reaction space exploration in a virtual nanoreactor, and for guessing reaction paths). Because of the inherent locality of the at. RMSD, functional group or fragment selective treatments are possible facilitating the investigation of catalytic processes where, for example, only the substrate is thermally activated. Due to the approx. character of the GFN2-xTB method, the resulting structure ensembles require further refinement with more sophisticated, for example, d. functional or wave function theory methods. However, the approach is extremely efficient running routinely on common laptop computers in minutes to hours of computation time even for realistically sized mols. with a few hundred atoms. Furthermore, the underlying potential energy surface for mols. contg. almost all elements (Z = 1-86) is globally consistent including the covalent dissocn. process and electronically complicated situations in, for example, transition metal systems. As examples, thermal decompn., ethyne oligomerization, the oxidn. of hydrocarbons (by oxygen and a P 450 enzyme model), a Miller-Urey model system, a thermally forbidden dimerization, and a multistep intramol. cyclization reaction are shown. For typical conformational search problems of org. drug mols., the new MTD(RMSD) algorithm yields lower energy structures and more complete conformer ensembles at reduced computational effort compared with its already well performing predecessor. - 31Kaczor, A.; Reva, I. D.; Proniewicz, L. M.; Fausto, R. Importance of Entropy in the Conformational Equilibrium of Phenylalanine: A Matrix-Isolation Infrared Spectroscopy and Density Functional Theory Study. J. Phys. Chem. A 2006, 110 (7), 2360– 2370, DOI: 10.1021/jp0550715[ACS Full Text
], [CAS], Google Scholar
31https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD28Xnslegug%253D%253D&md5=f404ca3987ca47b83de2b44adbeedde3Importance of Entropy in the Conformational Equilibrium of Phenylalanine: A Matrix-Isolation Infrared Spectroscopy and Density Functional Theory StudyKaczor, A.; Reva, I. D.; Proniewicz, L. M.; Fausto, R.Journal of Physical Chemistry A (2006), 110 (7), 2360-2370CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)The conformational behavior and IR spectrum of L-phenylalanine were studied by matrix-isolation IR spectroscopy and DFT [B3LYP/6-311++G(d,p)] calcns. The fourteen most stable structures were predicted to differ in energy by less than 10 kJ mol-1, eight of them with abundances higher than 5% at the temp. of evapn. of the compd. (423 K). Exptl. results suggest that six conformers contribute to the spectrum of the isolated compd., whereas two conformers (IIb3 and IIIb3) relax in matrix to a more stable form (IIb2) due to low energy barriers for conformational isomerization (conformational cooling). The two lowest-energy conformers (Ib1, Ia) differ only in the arrangement of the amino acid group relative to the Ph ring; they exhibit a relatively strong stabilizing intramol. hydrogen bond of the O-H···N type and the carboxylic group in the trans configuration (O:C-O-H dihedral angle ca. 180°). Type II conformers have a weaker H-bond of the N-H···O=C type, but they bear the more favorable cis arrangement of the carboxylic group. Being considerably more flexible, type II conformers are stabilized by entropy and the relative abundances of two conformers of this type (IIb2 and IIc1) are shown to significantly increase with temp. due to entropic stabilization. At 423 K, these conformers are found to be the first and third most abundant species present in the conformational equil., with relative populations of ca. 15% each, whereas their populations could be expected to be only ca. 5% if entropy effects were not taken into consideration. Indeed, phenylalanine can be considered a notable example of a mol. where entropy plays an essential role in detg. the relative abundance of the possible low-energy conformational states and then, the thermodn. of the compd., even at moderate temps. Upon UV irradn. (λ > 235 nm) of the matrix-isolated compd., unimol. photodecompn. of phenylalanine is obsd. with prodn. of CO2 and phenethylamine. - 32Ess, D. H.; Wheeler, S. E.; Iafe, R. G.; Xu, L.; Çelebi-Ölçüm, N.; Houk, K. N. Bifurcations on Potential Energy Surfaces of Organic Reactions. Angew. Chem., Int. Ed. 2008, 47 (40), 7592– 7601, DOI: 10.1002/anie.200800918[Crossref], [CAS], Google Scholar32https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1cXht1GltLzJ&md5=03ccbec3508f982d3f39941887485ef3Bifurcations on potential energy surfaces of organic reactionsEss, Daniel H.; Wheeler, Steven E.; Iafe, Robert G.; Xu, Lai; Celebi-Olcum, Nihan; Houk, Kendall N.Angewandte Chemie, International Edition (2008), 47 (40), 7592-7601CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)A review. A single transition state may lead to multiple intermediates or products if there is a post-transition-state reaction pathway bifurcation. These bifurcations arise when there are sequential transition states with no intervening energy min. For such systems, the shape of the potential energy surface and dynamic effects, rather than transition-state energetics, control selectivity. This Mini review covers recent investigations of org. reactions exhibiting reaction pathway bifurcations. Such phenomena are surprisingly general and affect exptl. observables such as kinetic isotope effects and product distributions.
- 33Rehbein, J.; Carpenter, B. K. Do We Fully Understand What Controls Chemical Selectivity?. Phys. Chem. Chem. Phys. 2011, 13 (47), 20906, DOI: 10.1039/c1cp22565k[Crossref], [PubMed], [CAS], Google Scholar33https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXhsFaiu77L&md5=93dfbfa2f6514a4c343503142b536d6aDo we fully understand what controls chemical selectivity?Rehbein, Julia; Carpenter, Barry K.Physical Chemistry Chemical Physics (2011), 13 (47), 20906-20922CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)Reaction rates and product selectivity of kinetically controlled reactions are not always sufficiently described by std. RRKM or TST theory. Reactions taking place on potential energy surfaces featuring a valley ridge inflection point belong to this class of reactions. Though various research groups could show that reaction path bifurcations are far from being an exception in org. reactions the underlying principles that govern product distributions of those bifurcating reaction pathways are yet not fully understood. This Perspective has the intention to provide an overview of how far our understanding and the development of the theor. foundation have progressed.
- 34Schreiner, P. R.; Reisenauer, H. P.; Ley, D.; Gerbig, D.; Wu, C.-H.; Allen, W. D. Methylhydroxycarbene: Tunneling Control of a Chemical Reaction. Science 2011, 332 (6035), 1300– 1303, DOI: 10.1126/science.1203761[Crossref], [PubMed], [CAS], Google Scholar34https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXntVGltL8%253D&md5=777ec16ef3e1128376c7b12cd2872c85Methylhydroxycarbene: Tunneling Control of a Chemical ReactionSchreiner, Peter R.; Reisenauer, Hans Peter; Ley, David; Gerbig, Dennis; Wu, Chia-Hua; Allen, Wesley D.Science (Washington, DC, United States) (2011), 332 (6035), 1300-1303CODEN: SCIEAS; ISSN:0036-8075. (American Association for the Advancement of Science)Chem. reactivity is conventionally understood in broad terms of kinetic vs. thermodn. control, wherein the decisive factor is the lowest activation barrier among the various reaction paths or the lowest free energy of the final products, resp. We demonstrate that quantum-mech. tunneling can supersede traditional kinetic control and direct a reaction exclusively to a product whose reaction path has a higher barrier. Specifically, we prepd. methylhydroxycarbene (H3C-C-OH) via vacuum pyrolysis of pyruvic acid at about 1200 K (K), followed by argon matrix trapping at 11 K. The previously elusive carbene, characterized by UV and IR spectroscopy as well as exacting quantum-mech. computations, undergoes a facile [1,2]hydrogen shift to acetaldehyde via tunneling under a barrier of 28.0 kcal per mol (kcal mol-1), with a half-life of around 1 h. The analogous isomerization to vinyl alc. has a substantially lower barrier of 22.6 kcal mol-1 but is precluded at low temp. by the greater width of the potential energy profile for tunneling.
- 35Plata, R. E.; Singleton, D. A. A Case Study of the Mechanism of Alcohol-Mediated Morita Baylis-Hillman Reactions. The Importance of Experimental Observations. J. Am. Chem. Soc. 2015, 137 (11), 3811– 3826, DOI: 10.1021/ja5111392[ACS Full Text
], [CAS], Google Scholar
35https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2MXjsVWksb8%253D&md5=dcd1973465edea3142926862a225beaeA Case Study of the Mechanism of Alcohol-Mediated Morita Baylis-Hillman Reactions. The Importance of Experimental ObservationsPlata, R. Erik; Singleton, Daniel A.Journal of the American Chemical Society (2015), 137 (11), 3811-3826CODEN: JACSAT; ISSN:0002-7863. (American Chemical Society)The mechanism of the Morita Baylis-Hillman reaction has been heavily studied in the literature, and a long series of computational studies have defined complete theor. energy profiles in these reactions. We employ here a combination of mechanistic probes, including the observation of intermediates, the independent generation and partitioning of intermediates, thermodn. and kinetic measurements on the main reaction and side reactions, isotopic incorporation from solvent, and kinetic isotope effects, to define the mechanism and an exptl. mechanistic free-energy profile for a prototypical Morita Baylis-Hillman reaction in methanol. The results are then used to critically evaluate the ability of computations to predict the mechanism. The most notable prediction of the many computational studies, that of a proton-shuttle pathway, is refuted in favor of a simple but computationally intractable acid-base mechanism. Computational predictions vary vastly, and it is not clear that any significant accurate information that was not already apparent from expt. could have been garnered from computations. With care, entropy calcns. are only a minor contributor to the larger computational error, while literature entropy-correction processes lead to absurd free-energy predictions. The computations aid in interpreting observations but fail utterly as a replacement for expt. - 36Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning; Springer Series in Statistics; Springer New York: New York, NY, 2009.
- 37Fukunishi, H.; Watanabe, O.; Takada, S. On the Hamiltonian Replica Exchange Method for Efficient Sampling of Biomolecular Systems: Application to Protein Structure Prediction. J. Chem. Phys. 2002, 116 (20), 9058– 9067, DOI: 10.1063/1.1472510[Crossref], [CAS], Google Scholar37https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD38XjsFKmsLo%253D&md5=7ac571a5afdd63b0b4b29cfdec06f53bOn the Hamiltonian replica exchange method for efficient sampling of biomolecular systems: Application to protein structure predictionFukunishi, Hiroaki; Watanabe, Osamu; Takada, ShojiJournal of Chemical Physics (2002), 116 (20), 9058-9067CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Motivated by the protein structure prediction problem, we develop two variants of the Hamiltonian replica exchange methods (REMs) for efficient configuration sampling, (1) the scaled hydrophobicity REM and (2) the phantom chain REM, and compare their performance with the ordinary REM. We first point out that the ordinary REM has a shortage for the application to large systems such as biomols. and that the Hamiltonian REM, an alternative formulation of the REM, can give a remedy for it. We then propose two examples of the Hamiltonian REM that are suitable for a coarse-grained protein model. (1) The scaled hydrophobicity REM preps. replicas that are characterized by various strengths of hydrophobic interaction. The strongest interaction that mimics aq. soln. environment makes proteins folding, while weakened hydrophobicity unfolds proteins as in org. solvent. Exchange between these environments enables proteins to escape from misfolded traps and accelerate conformational search. This resembles the roles of mol. chaperone that assist proteins to fold in vivo. (2) The phantom chain REM uses replicas that allow various degrees of at. overlaps. By allowing at. overlap in some of replicas, the peptide chain can cross over itself, which can accelerate conformation sampling. Using a coarse-gained model we developed, we compute equil. probability distributions for poly-alanine 16-mer and for a small protein by these REMs and compare the accuracy of the results. We see that the scaled hydrophobicity REM is the most efficient method among the three REMs studied.
- 38Okur, A.; Roe, D. R.; Cui, G.; Hornak, V.; Simmerling, C. Improving Convergence of Replica Exchange Simulations through Coupling to a High-Temperature Structure Reservoir. J. Chem. Theory Comput. 2007, 3 (2), 557– 568, DOI: 10.1021/ct600263e[ACS Full Text
], [CAS], Google Scholar
38https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD2sXjtVeitQ%253D%253D&md5=94557a5a1fc0d6750f988638d3ced8aeImproving Convergence of Replica-Exchange Simulations through Coupling to a High-Temperature Structure ReservoirOkur, Asim; Roe, Daniel R.; Cui, Guanglei; Hornak, Viktor; Simmerling, CarlosJournal of Chemical Theory and Computation (2007), 3 (2), 557-568CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Parallel tempering or replica-exchange mol. dynamics (REMD) significantly increases the efficiency of conformational sampling for complex mol. systems. However, obtaining converged data with REMD remains challenging, esp. for large systems with complex topologies. We propose a new variant to REMD where the replicas are also permitted to exchange with an ensemble of structures that have been generated in advance using high-temp. MD simulations, similar in spirit to J-walking methods. We tested this approach on two non-trivial model systems, a β-hairpin and a 3-stranded β-sheet and compared the results to those obtained from very long ( > 100 ns) std. REMD simulations. The resulting ensembles were indistinguishable, including relative populations of different conformations on the unfolded state. The use of the reservoir is shown to significantly reduce the time required for convergence. - 39Brémond, É.; Golubev, N.; Steinmann, S. N.; Corminboeuf, C. How Important Is Self-Consistency for the dDsC Density Dependent Dispersion Correction?. J. Chem. Phys. 2014, 140, 18A516, DOI: 10.1063/1.4867195
- 40Petraglia, R.; Steinmann, S. N.; Corminboeuf, C. A Fast Charge-Dependent Atom-Pairwise Dispersion Correction for DFTB3. Int. J. Quantum Chem. 2015, 115 (18), 1265– 1272, DOI: 10.1002/qua.24887[Crossref], [CAS], Google Scholar40https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2MXks12gtbo%253D&md5=766623367f245520ce3076f429544665A fast charge-Dependent atom-pairwise dispersion correction for DFTB3Petraglia, Riccardo; Steinmann, Stephan N.; Corminboeuf, ClemenceInternational Journal of Quantum Chemistry (2015), 115 (18), 1265-1272CODEN: IJQCB2; ISSN:0020-7608. (John Wiley & Sons, Inc.)Org. electronic materials remarkably illustrate the importance of the "weak" dispersion interactions that are neglected in the most cost-efficient electronic structure approaches. This work introduces a fast atom-pairwise dispersion correction, dDMC that is compatible with the most recent variant of self-consistent charge d. functional tight binding (SCC-DFTB). The emphasis is placed on improving the description of π-π stacked motifs featuring sulfur-contg. mols. that are known to be esp. challenging for DFTB. Our scheme relies upon the use of Mulliken charges using minimal basis set that are readily available from the DFTB computations at no addnl. cost. The performance and efficiency of the dDMC correction are validated on examples targeting energies, geometries, and mol. dynamic trajectories.
- 41Mashraqui, S. H.; Sangvikar, Y. S.; Meetsma, A. Synthesis and Structures of Thieno[2,3-b]Thiophene Incorporated [3.3]Dithiacyclophanes. Enhanced First Hyperpolarizability in an Unsymmetrically Polarized Cyclophane. Tetrahedron Lett. 2006, 47 (31), 5599– 5602, DOI: 10.1016/j.tetlet.2006.05.098[Crossref], [CAS], Google Scholar41https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD28XmsVyjs7k%253D&md5=db81a2a4f340553bb169377f90323e7bSynthesis and structures of thieno[2,3-b]thiophene incorporated [3.3]dithiacyclophanes. Enhanced first hyperpolarizability in an unsymmetrically polarized cyclophaneMashraqui, Sabir H.; Sangvikar, Yogesh S.; Meetsma, AukeTetrahedron Letters (2006), 47 (31), 5599-5602CODEN: TELEAY; ISSN:0040-4039. (Elsevier B.V.)Dithiacyclophanes incorporating thieno[2,3-b]thiophene have been synthesized, in order to investigate the nonlinear optical properties of donor-acceptor cyclophane I. I displayed significantly higher first hyperpolarizability β (21.6 × 10-30 esu) compared to model II (9.58 × 10-30esu). Relatively higher β in I presumably arises from an extra electron redistribution arising from through-space charge transfer, a feature lacking in II. Moreover, the thermal decompn. temp. of I (300 °C) is higher than that reported for the NLO prototype DANS (295 °C).
- 42Dijkstra, G. D. H.; Kellogg, R. M.; Wynberg, H.; Svendsen, J. S.; Marko, I.; Sharpless, K. B. Conformational Study of Cinchona Alkaloids. A Combined NMR, Molecular Mechanics and x-Ray Approach. J. Am. Chem. Soc. 1989, 111 (21), 8069, DOI: 10.1021/ja00203a001[ACS Full Text
], [CAS], Google Scholar
42https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaL1MXlslCnu7g%253D&md5=700cb5ab06c4760120e47704e698111aConformational study of cinchona alkaloids. A combined NMR, molecular mechanics and x-ray approachDijkstra, Gerard D. H.; Kellogg, Richard M.; Wynberg, Hans; Svendsen, John S.; Marko, Istvan; Sharpless, K. BarryJournal of the American Chemical Society (1989), 111 (21), 8069-76CODEN: JACSAT; ISSN:0002-7863.A conformational study of cinchona alkaloids is presented. The conformations of the alkaloids in soln. have been detd. by NOE difference and NOESY NMR technique. The necessary chem. shift assignments of the cinchona alkaloids have been elucidated by combination of COSY and NOE NMR expts. Min. energy conformations of the alkaloids have been calcd. by mol. mechanics. The first known x-ray structure of a cinchona alkaloid with a closed conformation is presented. Finally, the conformational effects of protonation and complexation on the tertiary nitrogen of the quinuclidine ring have been investigated. - 43Zhou, J.; Wakchaure, V.; Kraft, P.; List, B. Primary-Amine-Catalyzed Enantioselective Intramolecular Aldolizations. Angew. Chem., Int. Ed. 2008, 47 (40), 7656, DOI: 10.1002/anie.200802497[Crossref], [CAS], Google Scholar43https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1cXht1GltL3I&md5=b400840d85d1f3ddc2f7858dd87f5779Primary-amine-catalyzed enantioselective intramolecular aldolizationsZhou, Jian; Wakchaure, Vijay; Kraft, Philip; List, BenjaminAngewandte Chemie, International Edition (2008), 47 (40), 7656-7658CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)Aldol cyclodehydration of 4-substituted-2,6-heptanediones leads to enantiomerically enriched 5-substituted-3-methyl-2-cyclohexene-1-ones, which serve as perfume ingredients and valuable synthetic building blocks. Primary amines derived from cinchona alkaloids in combination with acetic acid are efficient catalysts for this transformation, which deliver both enantiomers of the celery ketone.
- 44Ramakrishnan, R.; Dral, P. O.; Rupp, M.; von Lilienfeld, O. A. Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach. J. Chem. Theory Comput. 2015, 11 (5), 2087– 2096, DOI: 10.1021/acs.jctc.5b00099[ACS Full Text
], [CAS], Google Scholar
44https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2MXmtlams7Y%253D&md5=a59b33f51a9dd6dbad95290f2642c306Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning ApproachRamakrishnan, Raghunathan; Dral, Pavlo O.; Rupp, Matthias; von Lilienfeld, O. AnatoleJournal of Chemical Theory and Computation (2015), 11 (5), 2087-2096CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Chem. accurate and comprehensive studies of the virtual space of all possible mols. are severely limited by the computational cost of quantum chem. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approx. legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron correlation energies are possible, for significantly larger mol. sets than used for training. For thermochem. properties of up to 16k isomers of C7H10O2 we present numerical evidence that chem. accuracy can be reached. We also predict electron correlation energy in post Hartree-Fock methods, at the computational cost of Hartree-Fock, and we establish a qual. relationship between mol. entropy and electron correlation. The transferability of our approach is demonstrated, using semiempirical quantum chem. and machine learning models trained on 1 and 10% of 134k org. mols., to reproduce enthalpies of all remaining mols. at d. functional theory level of accuracy. - 45Fabregat, R. Modular Replica Exchange Simulatior. http://doi.org/10.5281/zenodo.3630553, 2020.Google ScholarThere is no corresponding record for this reference.
- 46Elstner, M.; Hobza, P.; Frauenheim, T.; Suhai, S.; Kaxiras, E. Hydrogen Bonding and Stacking Interactions of Nucleic Acid Base Pairs: A Density-Functional-Theory Based Treatment. J. Chem. Phys. 2001, 114 (12), 5149– 5155, DOI: 10.1063/1.1329889[Crossref], [CAS], Google Scholar46https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD3MXhvV2ktbk%253D&md5=6aa6bf45783f7706d44fa514092a2bccHydrogen bonding and stacking interactions of nucleic acid base pairs: A density-functional-theory based treatmentElstner, Marcus; Hobza, Pavel; Frauenheim, Thomas; Suhai, Sandor; Kaxiras, EfthimiosJournal of Chemical Physics (2001), 114 (12), 5149-5155CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We extend an approx. d. functional theory (DFT) method for the description of long-range dispersive interactions which are normally neglected by construction, irresp. of the correlation function applied. An empirical formula, consisting of an R-6 term is introduced, which is appropriately damped for short distances; the corresponding C6 coeff., which is calcd. from exptl. at. polarizabilities, can be consistently added to the total energy expression of the method. We apply this approx. DFT plus dispersion energy method to describe the hydrogen bonding and stacking interactions of nucleic acid base pairs. Comparison to MP2/6-31G*(0.25) results shows that the method is capable of reproducing hydrogen bonding as well as the vertical and twist dependence of the interaction energy very accurately.
- 47Aradi, B.; Hourahine, B.; Frauenheim, T. DFTB+, a Sparse Matrix-Based Implementation of the DFTB Method. J. Phys. Chem. A 2007, 111 (26), 5678– 5684, DOI: 10.1021/jp070186p[ACS Full Text
], [CAS], Google Scholar
47https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD2sXmsVaju7c%253D&md5=f75a28788c2a83fdb00bb395178548ddDFTB+, a Sparse Matrix-Based Implementation of the DFTB MethodAradi, B.; Hourahine, B.; Frauenheim, Th.Journal of Physical Chemistry A (2007), 111 (26), 5678-5684CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)A new Fortran 95 implementation of the DFTB (d. functional-based tight binding) method was developed, where the sparsity of the DFTB system of equations was exploited. Conventional dense algebra was used only to evaluate the eigenproblems of the system and long-range Coulombic terms, but drop-in O(N) or O(N2) modules were planned to replace the small code sections that these entail. The developed sparse storage structure is discussed in detail, and a short overview of other features of the new code is given. - 48Riplinger, C.; Neese, F. An Efficient and near Linear Scaling Pair Natural Orbital Based Local Coupled Cluster Method. J. Chem. Phys. 2013, 138 (3), 034106, DOI: 10.1063/1.4773581[Crossref], [PubMed], [CAS], Google Scholar48https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3sXpslOqtw%253D%253D&md5=4327115b95524107245acb44ff4aaa7bAn efficient and near linear scaling pair natural orbital based local coupled cluster methodRiplinger, Christoph; Neese, FrankJournal of Chemical Physics (2013), 138 (3), 034106/1-034106/18CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)In previous publications, it was shown that an efficient local coupled cluster method with single- and double excitations can be based on the concept of pair natural orbitals (PNOs) . The resulting local pair natural orbital-coupled-cluster single double (LPNO-CCSD) method has since been proven to be highly reliable and efficient. For large mols., the no. of amplitudes to be detd. is reduced by a factor of 105-106 relative to a canonical CCSD calcn. on the same system with the same basis set. In the original method, the PNOs were expanded in the set of canonical virtual orbitals and single excitations were not truncated. This led to a no. of fifth order scaling steps that eventually rendered the method computationally expensive for large mols. (e.g., >100 atoms). In the present work, these limitations are overcome by a complete redesign of the LPNO-CCSD method. The new method is based on the combination of the concepts of PNOs and projected AOs (PAOs). Thus, each PNO is expanded in a set of PAOs that in turn belong to a given electron pair specific domain. In this way, it is possible to fully exploit locality while maintaining the extremely high compactness of the original LPNO-CCSD wavefunction. No terms are dropped from the CCSD equations and domains are chosen conservatively. The correlation energy loss due to the domains remains below <0.05%, which implies typically 15-20 but occasionally up to 30 atoms per domain on av. The new method has been given the acronym DLPNO-CCSD ("domain based LPNO-CCSD"). The method is nearly linear scaling with respect to system size. The original LPNO-CCSD method had three adjustable truncation thresholds that were chosen conservatively and do not need to be changed for actual applications. In the present treatment, no addnl. truncation parameters have been introduced. Any addnl. truncation is performed on the basis of the three original thresholds. There are no real-space cutoffs. Single excitations are truncated using singles-specific natural orbitals. Pairs are prescreened according to a multipole expansion of a pair correlation energy est. based on local orbital specific virtual orbitals (LOSVs). Like its LPNO-CCSD predecessor, the method is completely of black box character and does not require any user adjustments. It is shown here that DLPNO-CCSD is as accurate as LPNO-CCSD while leading to computational savings exceeding one order of magnitude for larger systems. The largest calcns. reported here featured >8800 basis functions and >450 atoms. In all larger test calcns. done so far, the LPNO-CCSD step took less time than the preceding Hartree-Fock calcn., provided no approxns. have been introduced in the latter. Thus, based on the present development reliable CCSD calcns. on large mols. with unprecedented efficiency and accuracy are realized. (c) 2013 American Institute of Physics.
- 49Neese, F.; Valeev, E. F. Revisiting the Atomic Natural Orbital Approach for Basis Sets: Robust Systematic Basis Sets for Explicitly Correlated and Conventional Correlated ab initio Methods?. J. Chem. Theory Comput. 2011, 7 (1), 33– 43, DOI: 10.1021/ct100396y[ACS Full Text
], [CAS], Google Scholar
49https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXhsFSkurjE&md5=65cebb90f419ae18f35f28de2e1169e7Revisiting the Atomic Natural Orbital Approach for Basis Sets: Robust Systematic Basis Sets for Explicitly Correlated and Conventional Correlated ab initio Methods?Neese, Frank; Valeev, Edward F.Journal of Chemical Theory and Computation (2011), 7 (1), 33-43CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The performance of several families of basis sets for correlated wave function calcns. on mols. is studied. The widely used correlation-consistent basis set family cc-pVXZ (n = D, T, Q, 5) is compared to a systematic series of at. natural orbital basis sets (ano-pVXZ). These basis sets are built from the cc-pV6Z primitives in at. multireference av. coupled pair functional (MR-ACPF) calcns. Segmented basis sets optimized for SCF calcns. (def2-SVP, def2-TZVPP, and def2-QZVPP as well as "pc-n", n = 1, 2, 3) were also tested. Ref. Hartree-Fock energies are detd. with the uncontracted aug-cc-pV6Z basis set for a set of 21 small mols. built from H, B, C, N, O, and F. Ref. coupled cluster CCSD(T) correlation energies were detd. from extrapolation at the cc-pV5Z/cc-pV6Z level. It is found that the ano-pVXZ basis sets outperform the other basis sets. The error in the SCF energies compared to cc-pVXZ basis sets is reduced by about a factor of 3 at each cardinal no. In addn., the ano-pVXZ consistently recovers more correlation energy than their competitors at each cardinal no. The ability of the four families of basis sets to extrapolate SCF and correlation energies to the basis set limit has been investigated. A conclusion by Truhlar is confirmed that the optimum exponent for correlation energy extrapolations at the DZ/TZ level is ∼2.4. All TZ/QZ basis set pairs lead to an optimum exponent close to the expected value of 3. The SCF energy extrapolation proposed by Petersson and co-workers is found to be effective. At the DZ/TZ level, errors in total energies of less than 2 mEh are found for the test set, while at the TZ/QZ level one obtains the total energies within ∼0.3 mEh of the basis set limit. For extrapolation, the "cc" and "ano" bases are found to be similarly successful. Extrapolation results were compared to explicitly correlated calcns. with dedicated basis sets (cc-pVXZ-F12) as well as the ano-pVXZ bases. It is found that the ano-pVXZ + basis sets perform as well as the cc-pVXZ-F12 family (both are of comparable size); addnl. improvement should be possible by reoptimizing the ANO basis sets for explicitly correlated calcns. The error of the extrapolated energies is about 2-3 times smaller than what was found in the explicitly correlated calcns. However, the error in the explicitly correlated calcns. is more systematic, and hence the same conclusion may not hold for the computation of energy differences. - 50Neese, F. The ORCA Program System. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2012, 2 (1), 73– 78, DOI: 10.1002/wcms.81[Crossref], [CAS], Google Scholar50https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC38XhvFGls7s%253D&md5=a753e33a6f9a326553295596f5c754e5The ORCA program systemNeese, FrankWiley Interdisciplinary Reviews: Computational Molecular Science (2012), 2 (1), 73-78CODEN: WIRCAH; ISSN:1759-0884. (Wiley-Blackwell)A review. ORCA is a general-purpose quantum chem. program package that features virtually all modern electronic structure methods (d. functional theory, many-body perturbation and coupled cluster theories, and multireference and semiempirical methods). It is designed with the aim of generality, extendibility, efficiency, and user friendliness. Its main field of application is larger mols., transition metal complexes, and their spectroscopic properties. ORCA uses std. Gaussian basis functions and is fully parallelized. The article provides an overview of its current possibilities and documents its efficiency.
- 51Dunning, T. H. Gaussian Basis Sets for Use in Correlated Molecular Calculations. I. The Atoms Boron through Neon and Hydrogen. J. Chem. Phys. 1989, 90 (2), 1007– 1023, DOI: 10.1063/1.456153[Crossref], [CAS], Google Scholar51https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaL1MXksVGmtrk%253D&md5=c6cd67a3748dc61692a9cb622d2694a0Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogenDunning, Thom H., Jr.Journal of Chemical Physics (1989), 90 (2), 1007-23CODEN: JCPSA6; ISSN:0021-9606.Guided by the calcns. on oxygen in the literature, basis sets for use in correlated at. and mol. calcns. were developed for all of the first row atoms from boron through neon, and for hydrogen. As in the oxygen atom calcns., the incremental energy lowerings, due to the addn. of correlating functions, fall into distinct groups. This leads to the concept of correlation-consistent basis sets, i.e., sets which include all functions in a given group as well as all functions in any higher groups. Correlation-consistent sets are given for all of the atoms considered. The most accurate sets detd. in this way, [5s4p3d2f1g], consistently yield 99% of the correlation energy obtained with the corresponding at.-natural-orbital sets, even though the latter contains 50% more primitive functions and twice as many primitive polarization functions. It is estd. that this set yields 94-97% of the total (HF + 1 + 2) correlation energy for the atoms neon through boron.
- 52Kendall, R. A.; Dunning, T. H.; Harrison, R. J. Electron Affinities of the First-Row Atoms Revisited. Systematic Basis Sets and Wave Functions. J. Chem. Phys. 1992, 96 (9), 6796– 6806, DOI: 10.1063/1.462569[Crossref], [CAS], Google Scholar52https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK38XktFClurw%253D&md5=948a06eee10604a8fa37eae2b2ada4beElectron affinities of the first-row atoms revisited. Systematic basis sets and wave functionsKendall, Rick A.; Dunning, Thom H., Jr.; Harrison, Robert J.Journal of Chemical Physics (1992), 96 (9), 6796-806CODEN: JCPSA6; ISSN:0021-9606.The authors describe a reliable procedure for calcg. the electron affinity of an atom and present results for H, B, C, O, and F (H is included for completeness). This procedure involves the use of the recently proposed correlation-consistent basis sets augmented with functions to describe the more diffuse character of the at. anion coupled with a straightforward, uniform expansion of the ref. space for multireference singles and doubles configuration-interaction (MRSD-CI) calcns. A comparison is given with previous results and with corresponding full CI calcns. The most accurate EAs obtained from the MRSD-CI calcns. are (with exptl. values in parentheses): H 0.740 eV (0.754), B 0.258 (0.277), C 1.245 (1.263), O 1.384 (1.461), and F 3.337 (3.401). The EAs obtained from the MR-SDCI calcns. differ by less than 0.03 eV from those predicted by the full CI calcns.
- 53Adamo, C.; Barone, V. Toward Reliable Density Functional Methods without Adjustable Parameters: The PBE0Model. J. Chem. Phys. 1999, 110 (13), 6158– 6170, DOI: 10.1063/1.478522[Crossref], [CAS], Google Scholar53https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK1MXitVCmt7Y%253D&md5=cad4185c69f9232753497f5203d6dc9fToward reliable density functional methods without adjustable parameters: the PBE0 modelAdamo, Carlo; Barone, VincenzoJournal of Chemical Physics (1999), 110 (13), 6158-6170CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We present an anal. of the performances of a parameter free d. functional model (PBE0) obtained combining the so called PBE generalized gradient functional with a predefined amt. of exact exchange. The results obtained for structural, thermodn., kinetic and spectroscopic (magnetic, IR and electronic) properties are satisfactory and not far from those delivered by the most reliable functionals including heavy parameterization. The way in which the functional is derived and the lack of empirical parameters fitted to specific properties make the PBE0 model a widely applicable method for both quantum chem. and condensed matter physics.
- 54Grimme, S.; Antony, J.; Ehrlich, S.; Krieg, H. A Consistent and Accurate Ab Initio Parametrization of Density Functional Dispersion Correction (DFT-D) for the 94 Elements H-Pu. J. Chem. Phys. 2010, 132 (15), 154104, DOI: 10.1063/1.3382344[Crossref], [PubMed], [CAS], Google Scholar54https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3cXkvVyks7o%253D&md5=2bca89d904579d5565537a0820dc2ae8A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-PuGrimme, Stefan; Antony, Jens; Ehrlich, Stephan; Krieg, HelgeJournal of Chemical Physics (2010), 132 (15), 154104/1-154104/19CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)The method of dispersion correction as an add-on to std. Kohn-Sham d. functional theory (DFT-D) has been refined regarding higher accuracy, broader range of applicability, and less empiricism. The main new ingredients are atom-pairwise specific dispersion coeffs. and cutoff radii that are both computed from first principles. The coeffs. for new eighth-order dispersion terms are computed using established recursion relations. System (geometry) dependent information is used for the first time in a DFT-D type approach by employing the new concept of fractional coordination nos. (CN). They are used to interpolate between dispersion coeffs. of atoms in different chem. environments. The method only requires adjustment of two global parameters for each d. functional, is asymptotically exact for a gas of weakly interacting neutral atoms, and easily allows the computation of at. forces. Three-body nonadditivity terms are considered. The method has been assessed on std. benchmark sets for inter- and intramol. noncovalent interactions with a particular emphasis on a consistent description of light and heavy element systems. The mean abs. deviations for the S22 benchmark set of noncovalent interactions for 11 std. d. functionals decrease by 15%-40% compared to the previous (already accurate) DFT-D version. Spectacular improvements are found for a tripeptide-folding model and all tested metallic systems. The rectification of the long-range behavior and the use of more accurate C6 coeffs. also lead to a much better description of large (infinite) systems as shown for graphene sheets and the adsorption of benzene on an Ag(111) surface. For graphene it is found that the inclusion of three-body terms substantially (by about 10%) weakens the interlayer binding. We propose the revised DFT-D method as a general tool for the computation of the dispersion energy in mols. and solids of any kind with DFT and related (low-cost) electronic structure methods for large systems. (c) 2010 American Institute of Physics.
- 55Ufimtsev, I. S.; Martinez, T. J. Quantum Chemistry on Graphical Processing Units. 3. Analytical Energy Gradients, Geometry Optimization, and First Principles Molecular Dynamics. J. Chem. Theory Comput. 2009, 5 (10), 2619– 2628, DOI: 10.1021/ct9003004[ACS Full Text
], [CAS], Google Scholar
55https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1MXhtVelurvJ&md5=f4e320ac3479e4b566a6b8bdb9e5add8Quantum Chemistry on Graphical Processing Units. 3. Analytical Energy Gradients, Geometry Optimization, and First Principles Molecular DynamicsUfimtsev, Ivan S.; Martinez, Todd J.Journal of Chemical Theory and Computation (2009), 5 (10), 2619-2628CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)We demonstrate that a video gaming machine contg. two consumer graphical cards can outpace a state-of-the-art quad-core processor workstation by a factor of more than 180× in Hartree-Fock energy + gradient calcns. Such performance makes it possible to run large scale Hartree-Fock and D. Functional Theory calcns., which typically require hundreds of traditional processor cores, on a single workstation. Benchmark Born-Oppenheimer mol. dynamics simulations are performed on two mol. systems using the 3-21G basis set - a hydronium ion solvated by 30 waters (94 atoms, 405 basis functions) and an aspartic acid mol. solvated by 147 waters (457 atoms, 2014 basis functions). Our GPU implementation can perform 27 ps/day and 0.7 ps/day of ab initio mol. dynamics simulation on a single desktop computer for these systems. - 56Titov, A. V.; Ufimtsev, I. S.; Luehr, N.; Martinez, T. J. Generating Efficient Quantum Chemistry Codes for Novel Architectures. J. Chem. Theory Comput. 2013, 9 (1), 213– 221, DOI: 10.1021/ct300321a[ACS Full Text
], [CAS], Google Scholar
56https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC38XhsFygtb7E&md5=8f7e43b6920dea4df1743eef3094401dGenerating Efficient Quantum Chemistry Codes for Novel ArchitecturesTitov, Alexey V.; Ufimtsev, Ivan S.; Luehr, Nathan; Martinez, Todd J.Journal of Chemical Theory and Computation (2013), 9 (1), 213-221CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)We describe an extension of our graphics processing unit (GPU) electronic structure program TeraChem to include atom-centered Gaussian basis sets with d angular momentum functions. This was made possible by a "meta-programming" strategy that leverages computer algebra systems for the derivation of equations and their transformation to correct code. We generate a multitude of code fragments that are formally math. equiv., but differ in their memory and floating-point operation footprints. We then select between different code fragments using empirical testing to find the highest performing code variant. This leads to an optimal balance of floating-point operations and memory bandwidth for a given target architecture without laborious manual tuning. We show that this approach is capable of similar performance compared to our hand-tuned GPU kernels for basis sets with s and p angular momenta. We also demonstrate that mixed precision schemes (using both single and double precision) remain stable and accurate for mols. with d functions. We provide benchmarks of the execution time of entire SCF calcns. using our GPU code and compare to mature CPU based codes, showing the benefits of the GPU architecture for electronic structure theory with appropriately redesigned algorithms. We suggest that the meta-programming and empirical performance optimization approach may be important in future computational chem. applications, esp. in the face of quickly evolving computer architectures. - 57Pearlman, D. A.; Case, D. A.; Caldwell, J. W.; Ross, W. S.; Cheatham, T. E.; DeBolt, S.; Ferguson, D.; Seibel, G.; Kollman, P. AMBER, a Package of Computer Programs for Applying Molecular Mechanics, Normal Mode Analysis, Molecular Dynamics and Free Energy Calculations to Simulate the Structural and Energetic Properties of Molecules. Comput. Phys. Commun. 1995, 91 (1), 1– 41, DOI: 10.1016/0010-4655(95)00041-D[Crossref], [CAS], Google Scholar57https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK2MXps1Wrtrw%253D&md5=8dc71939a46bbc17d5da08782d4e6ec8"AMBER", a package of computer programs for applying molecular mechanics, normal mode analysis, molecular dynamics and free energy calculations to stimulate the structural and energetic properties of moleculesPearlman, David A.; Case, David A.; Caldwell, James W.; Ross, Wilson S.; Cheatham, Thomas E. III; DeBolt, Steve; Ferguson, David; Seibel, George; Kollman, PeterComputer Physics Communications (1995), 91 (1-3), 1-42CODEN: CPHCBZ; ISSN:0010-4655. (Elsevier)We describe the development, current features, and some directions for future development of the AMBER package of computer programs. This package has evolved from a program that was constructed to do Assisted Model Building and Energy Refinement to a group of programs embodying a no. of the powerful tools of modern computational chem.-mol. dynamics and free energy calcns.
- 58Marenich, A. V.; Cramer, C. J.; Truhlar, D. G. Universal Solvation Model Based on Solute Electron Density and on a Continuum Model of the Solvent Defined by the Bulk Dielectric Constant and Atomic Surface Tensions. J. Phys. Chem. B 2009, 113 (18), 6378– 6396, DOI: 10.1021/jp810292n[ACS Full Text
], [CAS], Google Scholar
58https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1MXksV2is74%253D&md5=54931a64c70d28445ee53876a8b1a4b9Universal Solvation Model Based on Solute Electron Density and on a Continuum Model of the Solvent Defined by the Bulk Dielectric Constant and Atomic Surface TensionsMarenich, Aleksandr V.; Cramer, Christopher J.; Truhlar, Donald G.Journal of Physical Chemistry B (2009), 113 (18), 6378-6396CODEN: JPCBFK; ISSN:1520-6106. (American Chemical Society)We present a new continuum solvation model based on the quantum mech. charge d. of a solute mol. interacting with a continuum description of the solvent. The model is called SMD, where the "D" stands for "d." to denote that the full solute electron d. is used without defining partial at. charges. "Continuum" denotes that the solvent is not represented explicitly but rather as a dielec. medium with surface tension at the solute-solvent boundary. SMD is a universal solvation model, where "universal" denotes its applicability to any charged or uncharged solute in any solvent or liq. medium for which a few key descriptors are known (in particular, dielec. const., refractive index, bulk surface tension, and acidity and basicity parameters). The model separates the observable solvation free energy into two main components. The first component is the bulk electrostatic contribution arising from a self-consistent reaction field treatment that involves the soln. of the nonhomogeneous Poisson equation for electrostatics in terms of the integral-equation-formalism polarizable continuum model (IEF-PCM). The cavities for the bulk electrostatic calcn. are defined by superpositions of nuclear-centered spheres. The second component is called the cavity-dispersion-solvent-structure term and is the contribution arising from short-range interactions between the solute and solvent mols. in the first solvation shell. This contribution is a sum of terms that are proportional (with geometry-dependent proportionality consts. called at. surface tensions) to the solvent-accessible surface areas of the individual atoms of the solute. The SMD model has been parametrized with a training set of 2821 solvation data including 112 aq. ionic solvation free energies, 220 solvation free energies for 166 ions in acetonitrile, methanol, and DMSO, 2346 solvation free energies for 318 neutral solutes in 91 solvents (90 nonaq. org. solvents and water), and 143 transfer free energies for 93 neutral solutes between water and 15 org. solvents. The elements present in the solutes are H, C, N, O, F, Si, P, S, Cl, and Br. The SMD model employs a single set of parameters (intrinsic at. Coulomb radii and at. surface tension coeffs.) optimized over six electronic structure methods: M05-2X/MIDI!6D, M05-2X/6-31G*, M05-2X/6-31+G**, M05-2X/cc-pVTZ, B3LYP/6-31G*, and HF/6-31G*. Although the SMD model has been parametrized using the IEF-PCM protocol for bulk electrostatics, it may also be employed with other algorithms for solving the nonhomogeneous Poisson equation for continuum solvation calcns. in which the solute is represented by its electron d. in real space. This includes, for example, the conductor-like screening algorithm. With the 6-31G* basis set, the SMD model achieves mean unsigned errors of 0.6-1.0 kcal/mol in the solvation free energies of tested neutrals and mean unsigned errors of 4 kcal/mol on av. for ions with either Gaussian03 or GAMESS. - 59Huang, B.; von Lilienfeld, O. A. The “DNA” of Chemistry: Scalable Quantum Machine Learning with “Amons”. 2017 arXiv:1707.04146. https://arxiv.org/abs/1707.04146 (accessed March 23, 2020)Google ScholarThere is no corresponding record for this reference.
- 60Christensen Faber, H. A Python Toolkit for Quantum Machine Learning. http://doi.org/10.5281/zenodo.817332 2017.Google ScholarThere is no corresponding record for this reference.
- 61Rupp, M.; Tkatchenko, A.; Müller, K. R.; Lilienfeld, V.; Anatole, O. Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning. Phys. Rev. Lett. 2012, 108 (5), 058301, DOI: 10.1103/PhysRevLett.108.058301[Crossref], [PubMed], [CAS], Google Scholar61https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC38XisFant70%253D&md5=059b8ed186638101842c8df69173679fFast and accurate modeling of molecular atomization energies with machine learningRupp, Matthias; Tkatchenko, Alexandre; Mueller, Klaus-Robert; von Lilienfeld, O. AnatolePhysical Review Letters (2012), 108 (5), 058301/1-058301/5CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)The authors introduce a machine learning model to predict atomization energies of a diverse set of org. mols., based on nuclear charges and at. positions only. The problem of solving the mol. Schrodinger equation is mapped onto a nonlinear statistical regression problem of reduced complexity. Regression models are trained on and compared to atomization energies computed with hybrid d.-functional theory. Cross validation over more than seven thousand org. mols. yields a mean abs. error of ∼10 kcal/mol. Applicability is demonstrated for the prediction of mol. atomization potential energy curves.
- 62Hansen, K.; Biegler, F.; Ramakrishnan, R.; Pronobis, W.; Von Lilienfeld, O. A.; Müller, K. R.; Tkatchenko, A. Machine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical Space. J. Phys. Chem. Lett. 2015, 6 (12), 2326– 2331, DOI: 10.1021/acs.jpclett.5b00831[ACS Full Text
], [CAS], Google Scholar
62https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2MXpsFOmsrg%253D&md5=402cfd080b9ab0aae17932dffda375cfMachine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical SpaceHansen, Katja; Biegler, Franziska; Ramakrishnan, Raghunathan; Pronobis, Wiktor; von Lilienfeld, O. Anatole; Mueller, Klaus-Robert; Tkatchenko, AlexandreJournal of Physical Chemistry Letters (2015), 6 (12), 2326-2331CODEN: JPCLCD; ISSN:1948-7185. (American Chemical Society)Simultaneously accurate and efficient prediction of mol. properties throughout chem. compd. space is a crit. ingredient toward rational compd. design in chem. and pharmaceutical industries. Aiming toward this goal, we develop and apply a systematic hierarchy of efficient empirical methods to est. atomization and total energies of mols. These methods range from a simple sum over atoms, to addn. of bond energies, to pairwise interat. force fields, reaching to the more sophisticated machine learning approaches that are capable of describing collective interactions between many atoms or bonds. In the case of equil. mol. geometries, even simple pairwise force fields demonstrate prediction accuracy comparable to benchmark energies calcd. using d. functional theory with hybrid exchange-correlation functionals; however, accounting for the collective many-body interactions proves to be essential for approaching the "holy grail" of chem. accuracy of 1 kcal/mol for both equil. and out-of-equil. geometries. This remarkable accuracy is achieved by a vectorized representation of mols. (so-called Bag of Bonds model) that exhibits strong nonlocality in chem. space. In addn., the same representation allows us to predict accurate electronic properties of mols., such as their polarizability and mol. frontier orbital energies. - 63Nelder, J. A.; Mead, R. A Simplex Method for Function Minimization. Comp. J. 1965, 7 (4), 308– 313, DOI: 10.1093/comjnl/7.4.308
- 64Fukunishi, H.; Watanabe, O.; Takada, S. On the Hamiltonian Replica Exchange Method for Efficient Sampling of Biomolecular Systems: Application to Protein Structure Prediction. J. Chem. Phys. 2002, 116 (20), 9058– 9067, DOI: 10.1063/1.1472510[Crossref], [CAS], Google Scholar64https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD38XjsFKmsLo%253D&md5=7ac571a5afdd63b0b4b29cfdec06f53bOn the Hamiltonian replica exchange method for efficient sampling of biomolecular systems: Application to protein structure predictionFukunishi, Hiroaki; Watanabe, Osamu; Takada, ShojiJournal of Chemical Physics (2002), 116 (20), 9058-9067CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Motivated by the protein structure prediction problem, we develop two variants of the Hamiltonian replica exchange methods (REMs) for efficient configuration sampling, (1) the scaled hydrophobicity REM and (2) the phantom chain REM, and compare their performance with the ordinary REM. We first point out that the ordinary REM has a shortage for the application to large systems such as biomols. and that the Hamiltonian REM, an alternative formulation of the REM, can give a remedy for it. We then propose two examples of the Hamiltonian REM that are suitable for a coarse-grained protein model. (1) The scaled hydrophobicity REM preps. replicas that are characterized by various strengths of hydrophobic interaction. The strongest interaction that mimics aq. soln. environment makes proteins folding, while weakened hydrophobicity unfolds proteins as in org. solvent. Exchange between these environments enables proteins to escape from misfolded traps and accelerate conformational search. This resembles the roles of mol. chaperone that assist proteins to fold in vivo. (2) The phantom chain REM uses replicas that allow various degrees of at. overlaps. By allowing at. overlap in some of replicas, the peptide chain can cross over itself, which can accelerate conformation sampling. Using a coarse-gained model we developed, we compute equil. probability distributions for poly-alanine 16-mer and for a small protein by these REMs and compare the accuracy of the results. We see that the scaled hydrophobicity REM is the most efficient method among the three REMs studied.
- 65Swendsen, R. H.; Wang, J.-S. Replica Monte Carlo Simulation of Spin-Glasses. Phys. Rev. Lett. 1986, 57 (21), 2607, DOI: 10.1103/PhysRevLett.57.2607[Crossref], [PubMed], [CAS], Google Scholar65https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BC2sfotFaisQ%253D%253D&md5=6f748450b1063d67006b4e0ff4273fddReplica Monte Carlo simulation of spin glassesSwendsen; WangPhysical review letters (1986), 57 (21), 2607-2609 ISSN:.There is no expanded citation for this reference.
- 66Affentranger, R.; Tavernelli, I.; Di Iorio, E. E. A Novel Hamiltonian Replica Exchange MD Protocol to Enhance Protein Conformational Space Sampling. J. Chem. Theory Comput. 2006, 2 (2), 217– 228, DOI: 10.1021/ct050250b[ACS Full Text
], [CAS], Google Scholar
66https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD28XjsFCntw%253D%253D&md5=973b0c5a6cc780bf5edf4cf16c0d63f3A Novel Hamiltonian Replica Exchange MD Protocol to Enhance Protein Conformational Space SamplingAffentranger, Roman; Tavernelli, Ivano; Di Iorio, Ernesto E.Journal of Chemical Theory and Computation (2006), 2 (2), 217-228CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Limited searching in the conformational space is one of the major obstacles for investigating protein dynamics by numerical approaches. For this reason, classical all-atom mol. dynamics (MD) simulations of proteins tend to be confined to local energy min., particularly when the bulk solvent is treated explicitly. To overcome this problem, the authors have developed a novel replica exchange protocol that uses modified force-field parameters to treat interparticle nonbonded potentials within the protein and between protein and solvent atoms, leaving unperturbed those relative to solvent-solvent interactions. The authors have tested the new protocol on the 18-residue-long tip of the P domain of calreticulin in an explicit solvent. With only eight replicas, the authors have been able to considerably enhance the conformational space sampled during a 100 ns simulation, compared to as many parallel classical mol. dynamics simulations of the same length or to a single one lasting 450 ns. A direct comparison between the various simulations has been possible thanks to the implementation of the weighted histogram anal. method, by which conformations simulated with modified force-field parameters can be assigned different wts. Interatom, inter-residue distances in the structural ensembles obtained with the authors' novel replica exchange approach and by classical MD simulations compare equally well with those derived from NMR data. Rare events, such as unfolding and refolding, occur with reasonable statistical frequency. Visiting of conformations characterized by very small Boltzmann wts. is also possible. Despite their low probability, such regions of the conformational space may play an important role in the search for local potential-energy min. and in dynamically controlled functions. - 67Wang, L.; Friesner, R. A.; Berne, B. J. Replica Exchange with Solute Scaling: A More Efficient Version of Replica Exchange with Solute Tempering (REST2). J. Phys. Chem. B 2011, 115 (30), 9431– 9438, DOI: 10.1021/jp204407d[ACS Full Text
], [CAS], Google Scholar
67https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXosFymurs%253D&md5=8acb0d670bcb70eacfc30e32d5dfb6ddReplica Exchange with Solute Scaling: A More Efficient Version of Replica Exchange with Solute Tempering (REST2)Wang, Lingle; Friesner, Richard A.; Berne, B. J.Journal of Physical Chemistry B (2011), 115 (30), 9431-9438CODEN: JPCBFK; ISSN:1520-5207. (American Chemical Society)A small change in the Hamiltonian scaling in Replica Exchange with Solute Tempering (REST) is found to improve its sampling efficiency greatly, esp. for the sampling of aq. protein solns. in which there are large-scale solute conformation changes. Like the original REST (REST1), the new version (which the authors call REST2) also bypasses the poor scaling with system size of the std. Temp. Replica Exchange Method (TREM), reducing the no. of replicas (parallel processes) from what must be used in TREM. This redn. is accomplished by deforming the Hamiltonian function for each replica in such a way that the acceptance probability for the exchange of replica configurations does not depend on the no. of explicit water mols. in the system. For proof of concept, REST2 is compared with TREM and with REST1 for the folding of the trpcage and β-hairpin in water. The comparisons confirm that REST2 greatly reduces the no. of CPUs required by regular replica exchange and greatly increases the sampling efficiency over REST1. This method reduces the CPU time required for calcg. thermodn. avs. and for the ab initio folding of proteins in explicit water. - 68Ceriotti, M.; More, J.; Manolopoulos, D. E. I-PI: A Python Interface for Ab Initio Path Integral Molecular Dynamics Simulations. Comput. Phys. Commun. 2014, 185 (3), 1019– 1026, DOI: 10.1016/j.cpc.2013.10.027[Crossref], [CAS], Google Scholar68https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3sXhvVSltL3I&md5=a7a21c0d8665ac7d2a4a01b0cc280057i-PI: A Python interface for ab initio path integral molecular dynamics simulationsCeriotti, Michele; More, Joshua; Manolopoulos, David E.Computer Physics Communications (2014), 185 (3), 1019-1026CODEN: CPHCBZ; ISSN:0010-4655. (Elsevier B.V.)Recent developments in path integral methodol. have significantly reduced the computational expense of including quantum mech. effects in the nuclear motion in ab initio mol. dynamics simulations. However, the implementation of these developments requires a considerable programming effort, which has hindered their adoption. Here we describe i-PI, an interface written in Python that has been designed to minimise the effort required to bring state-of-the-art path integral techniques to an electronic structure program. While it is best suited to first principles calcns. and path integral mol. dynamics, i-PI can also be used to perform classical mol. dynamics simulations, and can just as easily be interfaced with an empirical forcefield code. To give just one example of the many potential applications of the interface, we use it in conjunction with the CP2K electronic structure package to showcase the importance of nuclear quantum effects in high-pressure water.
- 69Kapil, V.; Rossi, M.; Marsalek, O.; Petraglia, R.; Litman, Y.; Spura, T.; Cheng, B.; Cuzzocrea, A.; Meißner, R. H.; Wilkins, D. M.; Juda, P.; Bienvenue, S. P.; Fang, W.; Kessler, J.; Poltavsky, I.; Vandenbrande, S.; Wieme, J.; Corminboeuf, C.; Kühne, T. D.; Manolopoulos, D. E.; Markland, T. E.; Richardson, J. O.; Tkatchenko, A.; Tribello, G. A.; Van Speybroeck, V. V.; Ceriotti, M. i-PI 2.0: A universal force engine for advanced molecular simulations. Comput. Phys. Commun. 2019, 236, 214– 223, DOI: 10.1016/j.cpc.2018.09.020[Crossref], [CAS], Google Scholar69https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXhvFWqtL3I&md5=02da73b3d98059a9a849ca4dc8266d99i-PI 2.0: A universal force engine for advanced molecular simulationsKapil, Venkat; Rossi, Mariana; Marsalek, Ondrej; Petraglia, Riccardo; Litman, Yair; Spura, Thomas; Cheng, Bingqing; Cuzzocrea, Alice; Meissner, Robert H.; Wilkins, David M.; Helfrecht, Benjamin A.; Juda, Przemyslaw; Bienvenue, Sebastien P.; Fang, Wei; Kessler, Jan; Poltavsky, Igor; Vandenbrande, Steven; Wieme, Jelle; Corminboeuf, Clemence; Kuhne, Thomas D.; Manolopoulos, David E.; Markland, Thomas E.; Richardson, Jeremy O.; Tkatchenko, Alexandre; Tribello, Gareth A.; Van Speybroeck, Veronique; Ceriotti, MicheleComputer Physics Communications (2019), 236 (), 214-223CODEN: CPHCBZ; ISSN:0010-4655. (Elsevier B.V.)Progress in the at.-scale modeling of matter over the past decade has been tremendous. This progress has been brought about by improvements in methods for evaluating interat. forces that work by either solving the electronic structure problem explicitly, or by computing accurate approxns. of the soln. and by the development of techniques that use the Born-Oppenheimer (BO) forces to move the atoms on the BO potential energy surface. As a consequence of these developments it is now possible to identify stable or metastable states, to sample configurations consistent with the appropriate thermodn. ensemble, and to est. the kinetics of reactions and phase transitions. All too often, however, progress is slowed down by the bottleneck assocd. with implementing new optimization algorithms and/or sampling techniques into the many existing electronic-structure and empirical-potential codes. To address this problem, we are thus releasing a new version of the i-PI software. This piece of software is an easily extensible framework for implementing advanced atomistic simulation techniques using interat. potentials and forces calcd. by an external driver code. While the original version of the code (Ceriotti et al., 2014) was developed with a focus on path integral mol. dynamics techniques, this second release of i-PI not only includes several new advanced path integral methods, but also offers other classes of algorithms. In other words, i-PI is moving towards becoming a universal force engine that is both modular and tightly coupled to the driver codes that evaluate the potential energy surface and its derivs.
- 70Frenkel, D.; Smit, B. Understanding Molecular Simulation: From Algorithms to Applications, 2nd ed.; Elsevier: London, UK, 2001.Google ScholarThere is no corresponding record for this reference.
- 71Kunsch, H. R. The Jackknife and the Bootstrap for General Stationary Observations. Ann. Stat. 1989, 17 (3), 1217– 1241, DOI: 10.1214/aos/1176347265
- 72Rathore, N.; Chopra, M.; de Pablo, J. J. Optimal Allocation of Replicas in Parallel Tempering Simulations. J. Chem. Phys. 2005, 122 (2), 024111, DOI: 10.1063/1.1831273[Crossref], [PubMed], [CAS], Google Scholar72https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD2MXislahug%253D%253D&md5=dea2c2021aaebda0c54b603858650bd4Optimal allocation of replicas in parallel tempering simulationsRathore, Nitin; Chopra, Manan; de Pablo, Juan J.Journal of Chemical Physics (2005), 122 (2), 024111/1-024111/8CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We have studied the efficiency of parallel tempering simulations for a variety of systems including a coarse-grained protein, an atomistic model polypeptide, and the Lennard-Jones fluid. A scheme is proposed for the optimal allocation of temps. in these simulations. The method is compared to the existing empirical approaches used for this purpose. Accuracy assocd. with the computed thermodn. quantities such as sp. heat is also computed and their dependence on the trial-exchange acceptance rate is reported.
- 73Jones, E.; Pearson, P.; Travis, O. SciPy: Open Source Scientific Tools for Python. http://www.scipy.org, 2001.Google ScholarThere is no corresponding record for this reference.
- 74Vannay, L.; Meyer, B.; Petraglia, R.; Sforazzini, G.; Ceriotti, M.; Corminboeuf, C. Analyzing Fluxional Molecules Using DORI. J. Chem. Theory Comput. 2018, 14 (5), 2370– 2379, DOI: 10.1021/acs.jctc.7b01176[ACS Full Text
], [CAS], Google Scholar
74https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXlvVKlsLw%253D&md5=3fd2d742bc7bce9cf3ef9eefbd58b4dcAnalyzing Fluxional Molecules Using DORIVannay, Laurent; Meyer, Benjamin; Petraglia, Riccardo; Sforazzini, Giuseppe; Ceriotti, Michele; Corminboeuf, ClemenceJournal of Chemical Theory and Computation (2018), 14 (5), 2370-2379CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The D. Overlap Region Indicator (DORI) is a d.-based scalar field that reveals covalent bonding patterns and non-covalent interactions in the same value range. This work goes beyond the traditional static quantum chem. use of scalar fields and illustrates the suitability of DORI for analyzing geometrical and electronic signatures in highly fluxional mol. systems. Examples include a dithiocyclophane, which possesses multiple local min. with differing extents of π-stacking interactions and a temp. dependent rotation of a mol. rotor, where the descriptor is employed to capture fingerprints of CH-π and π-π interactions. Finally, DORI serves to examine the fluctuating π-conjugation pathway of a photochromic torsional switch (PTS). Attention is also placed on post-processing the large amt. of generated data and juxtaposing DORI with a data-driven low-dimensional representation of the structural landscape. - 75Kapil, V.; Engel, E.; Rossi, M.; Ceriotti, M. Assessment of Approximate Methods for Anharmonic Free Energies. J. Chem. Theory Comput. 2019, 15 (11), 5845– 5857, DOI: 10.1021/acs.jctc.9b00596[ACS Full Text
], [CAS], Google Scholar
75https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXhsl2htL3N&md5=57a4331cecec93c76be026ec5b04b7d9Assessment of Approximate Methods for Anharmonic Free EnergiesKapil, Venkat; Engel, Edgar; Rossi, Mariana; Ceriotti, MicheleJournal of Chemical Theory and Computation (2019), 15 (11), 5845-5857CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Quant. evaluations of the thermodn. properties of materials - most notably their stability, as measured by the free energy - must take into account the role of thermal and zero-point energy fluctuations. While these effects can easily be estd. within a harmonic approxn., corrections arising from the anharmonic nature of the interat. potential are often crucial and require computationally costly path integral simulations. Consequently, different approx. frameworks for computing affordable ests. of the anharmonic free energies have been developed over the years. Understanding which of the approxns. involved are justified for a given system, and therefore choosing the most suitable method, is complicated by the lack of comparative benchmarks. To facilitate this choice we assess the accuracy and efficiency of some of the most commonly used approx. methods - the independent mode framework, the vibrational SCF and self-consistent phonons - by comparing the anharmonic correction to the Helmholtz free energy against ref. path integral calcns. These benchmarks are performed for a diverse set of systems, ranging from simple quasi-harmonic solids to flexible mol. crystals with freely-rotating units. Our results suggest that for simple solids such as allotropes of carbon these methods yield results that are in excellent agreement with the ref. calcns., at a considerably lower computational cost. For more complex mol. systems such as polymorphs of ice and paracetamol the methods do not consistently provide a reliable approxn. of the anharmonic correction. Despite substantial cancellation of errors when comparing the stability of different phases, we do not observe a systematic improvement over the harmonic approxn. even for relative free-energies. Our results suggest that efforts towards obtaining computationally-feasible anharmonic free-energies for flexible mol. solids should therefore be directed towards reducing the expense of path integral methods. - 76Bürgi, T.; Baiker, A. Conformational Behavior of Cinchonidine in Different Solvents: A Combined NMR and Ab Initio Investigation. J. Am. Chem. Soc. 1998, 120 (49), 12920– 12926, DOI: 10.1021/ja982466b[ACS Full Text
], [CAS], Google Scholar
76https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADyaK1cXnsFKmu78%253D&md5=6cac8e8ea2a3adf51ac14dcf7e63b3bdConformational Behavior of Cinchonidine in Different Solvents: A Combined NMR and ab Initio InvestigationBuergi, Thomas; Baiker, AlfonsJournal of the American Chemical Society (1998), 120 (49), 12920-12926CODEN: JACSAT; ISSN:0002-7863. (American Chemical Society)The conformation of cinchonidine in soln. has been investigated by NMR techniques as well as theor. Three conformers of cinchonidine are shown to be substantially populated at room temp., Closed(1), Closed(2), and Open(3), with the latter being the most stable in apolar solvents. The stability of the closed conformers relative to that of Open(3), however, increases with solvent polarity. In polar solvents the three conformers have similar energy. The relationship between relative energies and the dielec. const. of the solvent is not linear but resembles the form of an Onsager function. Ab initio and d. functional reaction field calcns. using cavity shapes detd. by an isodensity surface are in good agreement with expt. for solvents which do not show strong specific interaction with cinchonidine. The role of the conformational behavior of cinchonidine is illustrated using the example of the platinum-catalyzed enantioselective hydrogenation of ketopantolactone in different solvents. - 77Huang, M.; Dissanayake, T.; Kuechler, E.; Radak, B. K.; Lee, T. S.; Giese, T. J.; York, D. M. A Multidimensional B-Spline Correction for Accurate Modeling Sugar Puckering in QM/MM Simulations. J. Chem. Theory Comput. 2017, 13 (9), 3975– 3984, DOI: 10.1021/acs.jctc.7b00161[ACS Full Text
], [CAS], Google Scholar
77https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXht1Oht7%252FF&md5=b978922b7b7dbb942a0f5d3c90a9bfdeA Multidimensional B-Spline Correction for Accurate Modeling Sugar Puckering in QM/MM SimulationsHuang, Ming; Dissanayake, Thakshila; Kuechler, Erich; Radak, Brian K.; Lee, Tai-Sung; Giese, Timothy J.; York, Darrin M.Journal of Chemical Theory and Computation (2017), 13 (9), 3975-3984CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The computational efficiency of approx. quantum mech. methods allows their use for the construction of multidimensional reaction free energy profiles. It has recently been demonstrated that quantum models based on the NDDO approxn. have difficulty modeling deoxyribose and ribose sugar ring puckers and thus limit their predictive value in the study of RNA and DNA systems. A method has been introduced in our previous work to improve the description of the sugar puckering conformational landscape that uses a multidimensional B-spline correction map (BMAP correction) for systems involving intrinsically coupled torsion angles. This method greatly improved the adiabatic potential energy surface profiles of DNA and RNA sugar rings relative to high-level ab initio methods even for highly problematic NDDO-based models. In the present work, a BMAP correction is developed, implemented, and tested in mol. dynamics simulations using the AM1/d-PhoT semiempirical Hamiltonian for biol. phosphoryl transfer reactions. Results are presented for gas-phase adiabatic potential energy surfaces of RNA transesterification model reactions and condensed-phase QM/MM free energy surfaces for nonenzymic and RNase A-catalyzed transesterification reactions. The results show that the BMAP correction is stable, efficient, and leads to improvement in both the potential energy and free energy profiles for the reactions studied, as compared with ab initio and exptl. ref. data. Exploration of the effect of the size of the quantum mech. region indicates the best agreement with exptl. reaction barriers occurs when the full CpA dinucleotide substrate is treated quantum mech. with the sugar pucker correction.
Supporting Information
Supporting Information
ARTICLE SECTIONSThe Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jctc.0c00100.
Histogram of the cost of computations, learning curves, energy comparisons, basin potential energies, energies scatter plot, free energy landscapes and convergences, and detailed information on the training procedure (PDF)
Terms & Conditions
Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.