Machine Learning Adaptive Basis Sets for Efficient Large Scale Density Functional Theory SimulationClick to copy article linkArticle link copied!
- Ole Schütt
- Joost VandeVondele*Joost VandeVondele*E-mail: [email protected]Department of Materials, ETH Zürich, 8093 Zürich, SwitzerlandSwiss National Supercomputing Centre (CSCS), 6900 Lugano, SwitzerlandMore by Joost VandeVondele
Abstract
It is chemically intuitive that an optimal atom centered basis set must adapt to its atomic environment, for example by polarizing toward nearby atoms. Adaptive basis sets of small size can be significantly more accurate than traditional atom centered basis sets of the same size. The small size and well conditioned nature of these basis sets leads to large saving in computational cost, in particular in a linear scaling framework. Here, it is shown that machine learning can be used to predict such adaptive basis sets using local geometrical information only. As a result, various properties of standard DFT calculations can be easily obtained at much lower costs, including nuclear gradients. In our approach, a rotationally invariant parametrization of the basis is obtained by employing a potential anchored on neighboring atoms to ultimately construct a rotation matrix that turns a traditional atom centered basis set into a suitable adaptive basis set. The method is demonstrated using MD simulations of liquid water, where it is shown that minimal basis sets yield structural properties in fair agreement with basis set converged results, while reducing the computational cost in the best case by a factor of 200 and the required flops by 4 orders of magnitude. Already a very small training set yields satisfactory results as the variational nature of the method provides robustness.
1. Introduction
2. Methods
2.1. Polarized Atomic Orbitals





2.2. Potential Parametrization



Explicit Form of the Potential Terms




2.3. Machine Learning


Figure 1
Figure 1. Overview of the PAO-ML scheme for using the potential parametrization and machine learning to calculate the PAO basis from given atomic positions.
2.4. Analytic Forces

2.5. Training Data Acquisition
Regularization


3. Results
3.1. Learning Curve
Figure 2
Figure 2. Learning curve showing the decreasing error of PAO-ML (blue) with increased training set size. For comparison the error of a variationally optimized PAO basis (green) and a traditional minimal SZV-MOLOPT-GTH (red) basis set are shown. With very little training data, the variational limit is approached by the ML method.
3.2. Consistency of Energy and Forces
Figure 3
Figure 3. Energy fluctuation during a series of MD simulation of a water dimer using the PAO-ML scheme. The simulations were conducted in the NVE ensemble using different time steps Δt to demonstrate the consistency of the forces and thus the controllability of the integration error.
3.3. PAO-ML Molecular Dynamics of Liquid Water
Figure 4
Figure 4. Shown are oxygen–oxygen pair correlation functions for liquid water at 300 K. As reference the experimental (green, ref (63)) and TZV2P-MOLOPT-GTH basis sets (blue) results are shown. The SZV-MOLOPT-GTH curve (red) and DFTB (orange) are examples of results typically obtained from a minimal basis sets. The adaptive basis set PAO-ML (black) reproduces the reference (TZV2P) better than any of the alternative minimal basis set methods.
3.4. Check for Unphysical Minima
3.5. PAO-ML Speedup
nodes | 64 | 100 | 169 | 256 | 400 |
---|---|---|---|---|---|
PAO-ML | |||||
full | 87 | 58 | 41 | 33 | 24 |
mult | 23 | 17 | 13 | 11 | 8 |
DZVP-MOLOPT-GTH | |||||
full | 5215 | 2765 | 1996 | 1840 | 1201 |
mult | 5036 | 2655 | 1922 | 1779 | 1165 |
The PAO-ML method outperforms a standard DFT run with a DZVP-MOLOPT-GTH basis by a factor of at least 50×.
3.6. Computational Setup
4. Discussion and Conclusions
Supporting Information
The Supporting Information is available free of charge on the ACS Publications website at DOI: 10.1021/acs.jctc.8b00378.
Representative input files for most simulations (ZIP)
Terms & Conditions
Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.
References
This article references 71 other publications.
- 1Goedecker, S. Linear scaling electronic structure methods. Rev. Mod. Phys. 1999, 71, 1085, DOI: 10.1103/RevModPhys.71.1085Google Scholar1Linear scaling electronic structure methodsGoedecker, StefanReviews of Modern Physics (1999), 71 (4), 1085-1123CODEN: RMPHAT; ISSN:0034-6861. (American Physical Society)A review with many refs. Methods exhibiting linear scaling with respect to the size of the system, the so-called O(N) methods, are an essential tool for the calcn. of the electronic structure of large systems contg. many atoms. They are based on algorithms that take advantage of the decay properties of the d. matrix. In this article the phys. decay properties of the d. matrix are be studied for both metals and insulators. Several strategies for constructing O(N) algorithms are presented and critically examd. Some issues that are relevant only for self-consistent O(N) methods, such as the calcn. of the Hartree potential and mixing issues, are also discussed. Some typical applications of O(N) methods are briefly described.
- 2Bowler, D. R.; Miyazaki, T.
methods in electronic structure calculations. Rep. Prog. Phys. 2012, 75, 036503, DOI: 10.1088/0034-4885/75/3/036503
Google Scholar2(N) methods in electronic structure calculationsBowler, D. R.; Miyazaki, T.Reports on Progress in Physics (2012), 75 (3), 036503/1-036503/43CODEN: RPPHAG; ISSN:0034-4885. (Institute of Physics Publishing)A review. Linear-scaling methods, or O(N) methods, have computational and memory requirements which scale linearly with the no. of atoms in the system, N, in contrast to std. approaches which scale with the cube of the no. of atoms. These methods, which rely on the short-ranged nature of electronic structure, will allow accurate, ab initio simulations of systems of unprecedented size. The theory behind the locality of electronic structure is described and related to phys. properties of systems to be modelled, along with a survey of recent developments in real-space methods which are important for efficient use of high-performance computers. The linear-scaling methods proposed to date can be divided into seven different areas, and the applicability, efficiency and advantages of the methods proposed in these areas are then discussed. The applications of linear-scaling methods, as well as the implementations available as computer programs, are considered. Finally, the prospects for and the challenges facing linear-scaling methods are discussed. - 3VandeVondele, J.; Borštnik, U.; Hutter, J. Linear Scaling Self-Consistent Field Calculations with Millions of Atoms in the Condensed Phase. J. Chem. Theory Comput. 2012, 8, 3565– 3573, DOI: 10.1021/ct200897xGoogle Scholar3Linear Scaling Self-Consistent Field Calculations with Millions of Atoms in the Condensed PhaseVandeVondele, Joost; Borstnik, Urban; Hutter, JurgJournal of Chemical Theory and Computation (2012), 8 (10), 3565-3573CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The applicability and performance of a linear scaling algorithm is investigated for three-dimensional condensed phase systems. A simple but robust approach based on the matrix sign function is employed together with a thresholding matrix multiplication that does not require a prescribed sparsity pattern. Semiempirical methods and d. functional theory have been tested. We demonstrate that self-consistent calcns. with 1 million atoms are feasible for simple systems. With this approach, the computational cost of the calcn. depends strongly on basis set quality. In the current implementation, high quality calcns. for dense systems are limited to a few hundred thousand atoms. We report on the sparsities of the involved matrixes as obtained at convergence and for intermediate iterations. We investigate how detg. the chem. potential impacts the computational cost for very large systems.
- 4Bowler, D. R.; Miyazaki, T. Calculations for millions of atoms with density functional theory: linear scaling shows its potential. J. Phys.: Condens. Matter 2010, 22, 074207, DOI: 10.1088/0953-8984/22/7/074207Google Scholar4Calculations for millions of atoms with density functional theory: linear scaling shows its potentialBowler, D. R.; Miyazaki, T.Journal of Physics: Condensed Matter (2010), 22 (7), 074207/1-0074207/6CODEN: JCOMEL; ISSN:0953-8984. (Institute of Physics Publishing)An overview of the CONQUEST linear scaling d. functional theory (DFT) code is given, focusing particularly on the scaling behavior on modern high-performance computing platforms. We demonstrate that essentially perfect linear scaling and weak parallel scaling (with fixed no. of atoms per processor core) can be achieved, and that DFT calcns. on millions of atoms are now possible.
- 5Mulliken, R. S. Criteria for the Construction of Good Self-Consistent-Field Molecular Orbital Wave Functions, and the Significance of LCAO-MO Population Analysis. J. Chem. Phys. 1962, 36, 3428– 3439, DOI: 10.1063/1.1732476Google Scholar5Criteria for the construction of good self-consistent-field molecular orbital wave functions, and the significance of L.C.A.O.M.O. population analysisMulliken, R. S.Journal of Chemical Physics (1962), 36 (), 3428-39CODEN: JCPSA6; ISSN:0021-9606.Criteria for the optimal choice of finite linear combinations of Slater-type orbitals (S.T.O.) adequate to approx. closely S.C.F. M.O. were examd. in the light of computations on HF and other mols. Some aspects of the A.O. (generalized Heitler-London) method were discussed. The inherent limitations on the meaning of charges on atoms in a mol., or of degree of ionic character, were discussed. Unacceptable at. charges are found from an L.C.A.O. mol. orbital population analysis made on S.C.F. M.O. wave functions, approximated by using unbalanced S.T.O. sets, while acceptable results are obtained with judiciously balanced and sufficiently complete S.T.O. sets. The effects of insufficiently complete and unbalanced S.T.O. sets on the computed dipole moments of S.C.F. M.O. wave functions were discussed, with examples.
- 6Davidson, E. R. Electronic Population Analysis of Molecular Wavefunctions. J. Chem. Phys. 1967, 46, 3320– 3324, DOI: 10.1063/1.1841219Google Scholar6Electronic population analysis of molecular wavefunctionsDavidson, Ernest RoyJournal of Chemical Physics (1967), 46 (9), 3320-4CODEN: JCPSA6; ISSN:0021-9606.A general method for carrying out population analysis of wavefunctions calcd. with arbitrary basis sets is presented. The difficulties in defining orbital populations is discussed. Results are presented for the sequence BF, CO, and N2 which indicate that back-transfer of charge in the π bond cancels the normal transfer of the charge in the σ bond. Contrary to popular belief, the more electroneg. element has the larger degree of hybridization in each case. The amt. of promotion of 2s electrons is greater on the less-electroneg. element, as expected.
- 7Roby, K. R. Quantum theory of chemical valence concepts. Mol. Phys. 1974, 27, 81– 104, DOI: 10.1080/00268977400100071Google ScholarThere is no corresponding record for this reference.
- 8Heinzmann, R.; Ahlrichs, R. Population analysis based on occupation numbers of modified atomic orbitals (MAOs). Theor. Chim. Acta. 1976, 42, 33– 45, DOI: 10.1007/BF00548289Google Scholar8Population analysis based on occupation numbers of modified atomic orbitals (MAOs)Heinzmann, Rolf; Ahlrichs, ReinhartTheoretica Chimica Acta (1976), 42 (1), 33-45CODEN: TCHAAM; ISSN:0040-5744.A new interpretational scheme is proposed for the anal. of mol. wavefunctions. Starting from the mol. d. operator a minimal set of MAOs is constructed from the requirement that the MOs can be represented as closely as possible by the MAOs. The MAOs are used to compute at. occupation nos. N and shared electron nos. σ. The mol. d. is then discussed in terms of N and σ. This approach has the following advantages: (1) it is generally applicable, (2) the quantities N and σ are virtually basis set independent, (3) the quantities N and σ fulfil the intuitively expected boundary conditions, (4) the simultaneous consideration of N and σ allows for a more reliable description of chem. bonding than consideration of at. charges only.
- 9Ehrhardt, C.; Ahlrichs, R. Population analysis based on occupation numbers II. Relationship between shared electron numbers and bond energies and characterization of hypervalent contributions. Theor. Chim. Acta. 1985, 68, 231– 245, DOI: 10.1007/BF00526774Google Scholar9Population analysis based on occupation numbers. II. Relationship between shared electron numbers and bond energies and characterization of hypervalent contributionsEhrhardt, Claus; Ahlrichs, ReinhartTheoretica Chimica Acta (1985), 68 (3), 231-45CODEN: TCHAAM; ISSN:0040-5744.The population anal. based on occupation nos. is briefly reviewed. A new way is proposed to det. modified AOs and to characterize hypervalent contributions. This is discussed in application to the mols. NSF, NSF3, SF6, OPCl, OPCl2, O2PCl, SO2, and ClO4-. The connection was studied between shared electron nos. σ - considered as a measure of covalent bond strength - and bond energies. The σ is a reliable measure of bond energies.
- 10Reed, A. E.; Curtiss, L. A.; Weinhold, F. Intermolecular interactions from a natural bond orbital, donor-acceptor viewpoint. Chem. Rev. 1988, 88, 899– 926, DOI: 10.1021/cr00088a005Google Scholar10Intermolecular interactions from a natural bond orbital, donor-acceptor viewpointReed, Alan E.; Curtiss, Larry A.; Weinhold, FrankChemical Reviews (Washington, DC, United States) (1988), 88 (6), 899-926CODEN: CHREAY; ISSN:0009-2665.A review with >135 refs. is given with discussion of natural bond orbital (NBO) anal., intermol. interaction models based on NBO anal., relation of donor-acceptor and electrostatic models.
- 11Lee, M. S.; Head-Gordon, M. Extracting polarized atomic orbitals from molecular orbital calculations. Int. J. Quantum Chem. 2000, 76, 169– 184, DOI: 10.1002/(SICI)1097-461X(2000)76:2<169::AID-QUA7>3.0.CO;2-GGoogle Scholar11Extracting polarized atomic orbitals from molecular orbital calculationsLee, Michael S.; Head-Gordon, MartinInternational Journal of Quantum Chemistry (2000), 76 (2), 169-184CODEN: IJQCB2; ISSN:0020-7608. (John Wiley & Sons, Inc.)We present a class of methods for extg. a polarized AO (EPAO) minimal basis set from a converged MO calcn. Unlike minimal basis sets obtained from previous approaches, EPAOs rigorously contain the occupied MO space. EPAOs achieve this exactness because their spatial extent is not restricted. Nonetheless, EPAOs are optimally localized with respect to a localization criterion and are essentially single-centered. EPAOs provide an alternative scheme for partitioning the electron d. into at. subspaces. Therefore, they can be used to det. at. and chem. group properties such as charge populations. Since EPAOs provide a compact description to the occupied space, they may have other computational applications such as in local correlation methods. Addnl., the EPAOs give a description of valence antibonding orbitals that may be appropriate for nondynamical electron correlation.
- 12Mayer, I. Orthogonal effective atomic orbitals in the topological theory of atoms. Can. J. Chem. 1996, 74, 939– 942, DOI: 10.1139/v96-103Google ScholarThere is no corresponding record for this reference.
- 13Cioslowski, J.; Liashenko, A. Atomic orbitals in molecules. J. Chem. Phys. 1998, 108, 4405– 4412, DOI: 10.1063/1.475853Google ScholarThere is no corresponding record for this reference.
- 14Lu, W. C.; Wang, C. Z.; Schmidt, M. W.; Bytautas, L.; Ho, K. M.; Ruedenberg, K. Molecule intrinsic minimal basis sets. I. Exact resolution of ab initio optimized molecular orbitals in terms of deformed atomic minimal-basis orbitals. J. Chem. Phys. 2004, 120, 2629– 2637, DOI: 10.1063/1.1638731Google Scholar14Molecule intrinsic minimal basis sets. I. Exact resolution of ab initio optimized molecular orbitals in terms of deformed atomic minimal-basis orbitalsLu, W. C.; Wang, C. Z.; Schmidt, M. W.; Bytautas, L.; Ho, K. M.; Ruedenberg, K.Journal of Chemical Physics (2004), 120 (6), 2629-2637CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)A method is presented for expressing the occupied SCF orbitals of a mol. exactly in terms of chem. deformed at. minimal-basis-set orbitals that deviate as little as possible from free-atom SCF minimal-basis orbitals. The MOs referred to are the exact SCF orbitals, the free-atom orbitals referred to are the exact at. SCF orbitals, and the formulation of the deformed "quasiat. minimal-basis-sets" is independent of the calculational AO basis used. The resulting resoln. of MOs in terms of quasiat. minimal basis set orbitals is therefore intrinsic to the exact mol. wave functions. The deformations are analyzed in terms of interat. contributions. The Mulliken population anal. is formulated in terms of the quasiat. minimal-basis orbitals. In the virtual SCF orbital space the method leads to a quant. ab initio formulation of the qual. model of virtual valence orbitals, which are useful for calcg. electron correlation and the interpretation of reactions. The method is applicable to Kohn-Sham d. functional theory orbitals and is easily generalized to valence MCSCF orbitals.
- 15Laikov, D. N. Intrinsic minimal atomic basis representation of molecular electronic wavefunctions. Int. J. Quantum Chem. 2011, 111, 2851– 2867, DOI: 10.1002/qua.22767Google Scholar15Intrinsic minimal atomic basis representation of molecular electronic wavefunctionsLaikov, Dimitri N.International Journal of Quantum Chemistry (2011), 111 (12), 2851-2867CODEN: IJQCB2; ISSN:0020-7608. (John Wiley & Sons, Inc.)The problem of finding an effective minimal at. basis that spans the exact occupied wavefunctions of a mean-field theory at a given mol. geometry, which has a no. of special properties, is studied and a new general procedure is developed that (1) solves for a raw minimal set of strongly atom-centered functions-products of spherical harmonics and mol.-optimized radial parts-that approx. span the occupied mol. wavefunctions and minimize the sum of their energies, (2) uses projection operators to get a new set of deformed atom-centered functions that exactly span the occupied space and fall into core and valence subsets, (3) applies a new zero-bond-dipole orthogonalization scheme to the core-orthogonalized valence subset so that for each two-center product of these functions the projection of its dipole moment along the line going through the two centers is zero. The resulting effective minimal at. basis is intrinsic to the mol. problem and does not need a free-atoms input. Some interesting features of the zero-bond-dipole orthogonalization are showing up in the at. population anal. of a diverse set of mols. The new procedure may be useful for the interpretation of electronic structure, for the construction of model Hamiltonians in terms of transferable mol. integrals, and for the definition of active valence space in the treatment of electron correlation. © 2010 Wiley Periodicals, Inc. Int J Quantum Chem, 2011.
- 16Knizia, G. Intrinsic Atomic Orbitals: An Unbiased Bridge between Quantum Theory and Chemical Concepts. J. Chem. Theory Comput. 2013, 9, 4834– 4843, DOI: 10.1021/ct400687bGoogle Scholar16Intrinsic Atomic Orbitals: An Unbiased Bridge between Quantum Theory and Chemical ConceptsKnizia, GeraldJournal of Chemical Theory and Computation (2013), 9 (11), 4834-4843CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Modern quantum chem. can make quant. predictions on an immense array of chem. systems. However, the interpretation of those predictions is often complicated by the complex wave function expansions used. Here we show that an exceptionally simple algebraic construction allows for defining at. core and valence orbitals, polarized by the mol. environment, which can exactly represent SCF wave functions. This construction provides an unbiased and direct connection between quantum chem. and empirical chem. concepts, and can be used, for example, to calc. the nature of bonding in mols., in chem. terms, from first principles. In particular, we find consistency with electronegativities (χ), C 1s core-level shifts, resonance substituent parameters (σR), Lewis structures, and oxidn. states of transition-metal complexes.
- 17Adams, W. H. On the Solution of the Hartree-Fock Equation in Terms of Localized Orbitals. J. Chem. Phys. 1961, 34, 89– 102, DOI: 10.1063/1.1731622Google Scholar17The solution of the Hartree-Fock equation in terms of localized orbitalsAdams, William H.Journal of Chemical Physics (1961), 34 (), 89-102CODEN: JCPSA6; ISSN:0021-9606.The Hartree-Fock method was discussed, with emphasis on the transformation properties of the Hartree-Fock equation. This equation can be solved in terms of nonorthogonal one-electron functions. In some cases it may be more convenient to choose such solns. Equations were developed which define the localized one-electron functions. For a system of closedshell atoms or ions, it was suggested that the localized orbitals of each atom or ion can be expanded in terms of functions centered on its nucleus. This was based on the success of the ionic theory of crystals. Because of the symmetry of a crystal, use of the localized orbitals could lead to expressions for the 1st-order, Hartree-Fock d. matrix and the Hartree-Fock energy of a crystal.
- 18Adams, W. H. Orbital Theories of Electronic Structure. J. Chem. Phys. 1962, 37, 2009– 2018, DOI: 10.1063/1.1733420Google ScholarThere is no corresponding record for this reference.
- 19Adams, W. Distortion of interacting atoms and ions. Chem. Phys. Lett. 1971, 12, 295– 298, DOI: 10.1016/0009-2614(71)85068-6Google ScholarThere is no corresponding record for this reference.
- 20Lee, M. S.; Head-Gordon, M. Polarized atomic orbitals for self-consistent field electronic structure calculations. J. Chem. Phys. 1997, 107, 9085– 9095, DOI: 10.1063/1.475199Google Scholar20Polarized atomic orbitals for self-consistent field electronic structure calculationsLee, Michael S.; Head-Gordon, MartinJournal of Chemical Physics (1997), 107 (21), 9085-9095CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We present a new SCF approach which, given a large "secondary" basis set of AOs, variationally optimizes MOs in terms of a small "primary" basis set of distorted AOs, which are simultaneously optimized. If the primary basis is taken as a minimal basis, the resulting functions are termed polarized AOs (PAO's) because they are valence (or core) AOs which have distorted or polarized in an optimal way for their mol. environment. The PAO's derive their flexibility from the fact that they are formed from atom-centered linear-combinations of the larger set of secondary AOs. The variational conditions satisfied by PAO's are defined, and an iterative method for performing a PAO-SCF calcn. is introduced. We compare the PAO-SCF approach against full SCF calcns. for the energies, dipoles, and mol. geometries of various mols. The PAO's are potentially useful for studying large systems that are currently intractable with larger than minimal basis sets, as well as offering potential interpretative benefits relative to calcns. in extended basis sets.
- 21Lee, M. S.; Head-Gordon, M. Absolute and relative energies from polarized atomic orbital self-consistent field calculations and a second order correction.: Convergence with size and composition of the secondary basis. Comput. Chem. 2000, 24, 295– 301, DOI: 10.1016/S0097-8485(99)00086-8Google Scholar21Absolute and relative energies from polarized atomic orbital self-consistent field calculations and a second order correction. Convergence with size and composition of the secondary basisLee, Michael S.; Head-Gordon, MartinComputers & Chemistry (Oxford) (2000), 24 (3,4), 295-301CODEN: COCHDK; ISSN:0097-8485. (Elsevier Science Ltd.)Polarized AOs (PAO's) are mol.-adapted minimal basis functions that are variationally obtained as an atom-blocked transformation from a conventional extended basis set, as a Hartree-Fock calcn. is performed in the PAO basis. This approxn. yields a higher energy than a HF calcn. performed in the extended basis, although the two results converge to the same limit as the extended basis approaches completeness on each atom. To test the rate of convergence, PAO-HF calcns. were performed using cc-pVXZ and aug-cc-pVXZ basis sets for the water monomer and dimer, and six substituted ethylenes. The results show that the quality of PAO calcns. converges smoothly with X. The use of augmented functions is recommended. To correct a PAO-HF calcn. for residual deficiencies, a noniterative second order correction is introduced. This correction corresponds to an energy-weighted steepest descent step, and substantially improves the quality of PAO energies.
- 22Berghold, G.; Parrinello, M.; Hutter, J. Polarized atomic orbitals for linear scaling methods. J. Chem. Phys. 2002, 116, 1800– 1810, DOI: 10.1063/1.1431270Google ScholarThere is no corresponding record for this reference.
- 23Bowler, D. R.; Miyazaki, T.; Gillan, M. J. Recent progress in linear scaling ab initio electronic structure techniques. J. Phys.: Condens. Matter 2002, 14, 2781, DOI: 10.1088/0953-8984/14/11/303Google Scholar23Recent progress in linear scaling ab initio electronic structure techniquesBowler, D. R.; Miyazaki, T.; Gillan, M. J.Journal of Physics: Condensed Matter (2002), 14 (11), 2781-2798CODEN: JCOMEL; ISSN:0953-8984. (Institute of Physics Publishing)A review. We describe recent progress in developing linear scaling ab initio electronic structure methods, referring in particular to our highly parallel code CONQUEST. After reviewing the state of the field, we present the basic ideas underlying almost all linear scaling methods, and discuss specific practical details of the implementation. We also note the connection between linear scaling methods and embedding techniques.
- 24Torralba, A. S.; Todorović, M.; Brázdová, V.; Choudhury, R.; Miyazaki, T.; Gillan, M. J.; Bowler, D. R. Pseudo-atomic orbitals as basis sets for the
DFT code CONQUEST. J. Phys.: Condens. Matter 2008, 20 (29), 294206, DOI: 10.1088/0953-8984/20/29/294206
Google Scholar24Pseudo-atomic orbitals as basis sets for the O(N) DFT code CONQUESTTorralba, A. S.; Todorovic, M.; Brazdova, V.; Choudhury, R.; Miyazaki, T.; Gillan, M. J.; Bowler, D. R.Journal of Physics: Condensed Matter (2008), 20 (29), 294206/1-294206/8CODEN: JCOMEL; ISSN:0953-8984. (Institute of Physics Publishing)Various aspects of the implementation of pseudo-AOs (PAOs) as basis functions for the linear scaling CONQUEST code are presented. Preliminary results for the assignment of a large set of PAOs to a smaller space of support functions are encouraging, and an important related proof on the necessary symmetry of the support functions is shown. Details of the generation and integration schemes for the PAOs are also given. - 25Skylaris, C.-K.; Haynes, P. D.; Mostofi, A. A.; Payne, M. C. Introducing ONETEP: Linear-scaling density functional simulations on parallel computers. J. Chem. Phys. 2005, 122, 084119, DOI: 10.1063/1.1839852Google Scholar25Introducing ONETEP: Linear-scaling density functional simulations on parallel computersSkylaris, Chris-Kriton; Haynes, Peter D.; Mostofi, Arash A.; Payne, Mike C.Journal of Chemical Physics (2005), 122 (8), 084119/1-084119/10CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We present ONETEP (order-N electronic total energy package), a d. functional program for parallel computers whose computational cost scales linearly with the no. of atoms and the no. of processors. ONETEP is based on our reformulation of the plane wave pseudopotential method which exploits the electronic localization that is inherent in systems with a nonvanishing band gap. We summarize the theor. developments that enable the direct optimization of strictly localized quantities expressed in terms of a delocalized plane wave basis. These same localized quantities lead us to a phys. way of dividing the computational effort among many processors to allow calcns. to be performed efficiently on parallel supercomputers. We show with examples that ONETEP achieves excellent speedups with increasing nos. of processors and confirm that the time taken by ONETEP as a function of increasing no. of atoms for a given no. of processors is indeed linear. What distinguishes our approach is that the localization is achieved in a controlled and math. consistent manner so that ONETEP obtains the same accuracy as conventional cubic-scaling plane wave approaches and offers fast and stable convergence. We expect that calcns. with ONETEP have the potential to provide quant. theor. predictions for problems involving thousands of atoms such as those often encountered in nanoscience and biophysics.
- 26Skylaris, C.-K.; Mostofi, A. A.; Haynes, P. D.; Diéguez, O.; Payne, M. C. Nonorthogonal generalized Wannier function pseudopotential plane-wave method. Phys. Rev. B: Condens. Matter Mater. Phys. 2002, 66, 035119, DOI: 10.1103/PhysRevB.66.035119Google Scholar26Nonorthogonal generalized Wannier function pseudopotential plane-wave methodSkylaris, Chris-Kriton; Mostofi, Arash A.; Haynes, Peter D.; Dieguez, Oswaldo; Payne, Mike C.Physical Review B: Condensed Matter and Materials Physics (2002), 66 (3), 035119/1-035119/12CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)We present a reformulation of the plane-wave pseudopotential method for insulators. This new approach allows us to perform d.-functional calcns. by solving directly for "nonorthogonal generalized Wannier functions" rather than extended Bloch states. We outline the theory on which our method is based and present test calcns. on a variety of systems. Comparison of our results with a std. plane-wave code shows that they are equiv. Apart from the usual advantages of the plane-wave approach such as the applicability to any lattice symmetry and the high accuracy, our method also benefits from the localization properties of our functions in real space. The localization of all our functions greatly facilitates the future extension of our method to linear-scaling schemes or calcns. of the elec. polarization of cryst. insulators.
- 27Mohr, S.; Ratcliff, L. E.; Genovese, L.; Caliste, D.; Boulanger, P.; Goedecker, S.; Deutsch, T. Accurate and efficient linear scaling DFT calculations with universal applicability. Phys. Chem. Chem. Phys. 2015, 17, 31360– 31370, DOI: 10.1039/C5CP00437CGoogle Scholar27Accurate and efficient linear scaling DFT calculations with universal applicabilityMohr, Stephan; Ratcliff, Laura E.; Genovese, Luigi; Caliste, Damien; Boulanger, Paul; Goedecker, Stefan; Deutsch, ThierryPhysical Chemistry Chemical Physics (2015), 17 (47), 31360-31370CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)D. functional theory calcns. are computationally extremely expensive for systems contg. many atoms due to their intrinsic cubic scaling. This fact has led to the development of so-called linear scaling algorithms during the last few decades. In this way it becomes possible to perform ab initio calcns. for several tens of thousands of atoms within reasonable walltimes. However, even though the use of linear scaling algorithms is phys. well justified, their implementation often introduces some small errors. Consequently most implementations offering such a linear complexity either yield only a limited accuracy or, if one wants to go beyond this restriction, require a tedious fine tuning of many parameters. In our linear scaling approach within the BigDFT package, we were able to overcome this restriction. Using an ansatz based on localized support functions expressed in an underlying Daubechies wavelet basis - which offers ideal properties for accurate linear scaling calcns. - we obtain an amazingly high accuracy and a universal applicability while still keeping the possibility of simulating large system with linear scaling walltimes requiring only a moderate demand of computing resources. We prove the effectiveness of our method on a wide variety of systems with different boundary conditions, for single-point calcns. as well as for geometry optimizations and mol. dynamics.
- 28Mohr, S.; Ratcliff, L. E.; Boulanger, P.; Genovese, L.; Caliste, D.; Deutsch, T.; Goedecker, S. Daubechies wavelets for linear scaling density functional theory. J. Chem. Phys. 2014, 140, 204110, DOI: 10.1063/1.4871876Google Scholar28Daubechies wavelets for linear scaling density functional theoryMohr, Stephan; Ratcliff, Laura E.; Boulanger, Paul; Genovese, Luigi; Caliste, Damien; Deutsch, Thierry; Goedecker, StefanJournal of Chemical Physics (2014), 140 (20), 204110/1-204110/16CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We demonstrate that Daubechies wavelets can be used to construct a minimal set of optimized localized adaptively contracted basis functions in which the Kohn-Sham orbitals can be represented with an arbitrarily high, controllable precision. Ground state energies and the forces acting on the ions can be calcd. in this basis with the same accuracy as if they were calcd. directly in a Daubechies wavelets basis, provided that the amplitude of these adaptively contracted basis functions is sufficiently small on the surface of the localization region, which is guaranteed by the optimization procedure described in this work. This approach reduces the computational costs of d. functional theory calcns., and can be combined with sparse matrix algebra to obtain linear scaling with respect to the no. of electrons in the system. Calcns. on systems of 10 000 atoms or more thus become feasible in a systematic basis set with moderate computational resources. Further computational savings can be achieved by exploiting the similarity of the adaptively contracted basis functions for closely related environments, e.g., in geometry optimizations or combined calcns. of neutral and charged systems. (c) 2014 American Institute of Physics.
- 29Ozaki, T. Variationally optimized atomic orbitals for large-scale electronic structures. Phys. Rev. B: Condens. Matter Mater. Phys. 2003, 67, 155108, DOI: 10.1103/PhysRevB.67.155108Google Scholar29Variationally optimized atomic orbitals for large-scale electronic structuresOzaki, T.Physical Review B: Condensed Matter and Materials Physics (2003), 67 (15), 155108/1-155108/5CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)A simple and practical method for variationally optimizing numerical AOs used in d. functional calcns. is presented based on the force theorem. The derived equation provides the same procedure for the optimization of AOs as that for the geometry optimization. The optimized orbitals well reproduce convergent results calcd. by a larger no. of unoptimized orbitals. In addn., we demonstrate that the optimized orbitals significantly reduce the computational effort in the geometry optimization, while keeping a high degree of accuracy.
- 30Ozaki, T.; Kino, H. Numerical atomic basis orbitals from H to Kr. Phys. Rev. B: Condens. Matter Mater. Phys. 2004, 69, 195113, DOI: 10.1103/PhysRevB.69.195113Google Scholar30Numerical atomic basis orbitals from H to KrOzaki, T.; Kino, H.Physical Review B: Condensed Matter and Materials Physics (2004), 69 (19), 195113/1-195113/19CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)We present a systematic study for numerical at. basis orbitals ranging from H to Kr, which could be used in large scale O(N) electronic structure calcns. based on d.-functional theories (DFT). The comprehensive investigation of convergence properties with respect to our primitive basis orbitals provides a practical guideline in an optimum choice of basis sets for each element, which well balances the computational efficiency and accuracy. Moreover, starting from the primitive basis orbitals, a simple and practical method for variationally optimizing basis orbitals is presented based on the force theorem, which enables us to maximize both the computational efficiency and accuracy. The optimized orbitals well reproduce convergent results calcd. by a larger no. of primitive orbitals. As illustrations of the orbital optimization, we demonstrate two examples: the geometry optimization coupled with the orbital optimization of a C60 mol. and the preorbital optimization for a specific group such as proteins. They clearly show that the optimized orbitals significantly reduce the computational efforts, while keeping a high degree of accuracy, thus indicating that the optimized orbitals are quite suitable for large scale DFT calcns.
- 31Junquera, J.; Paz, O.; Sánchez-Portal, D.; Artacho, E. Numerical atomic orbitals for linear-scaling calculations. Phys. Rev. B: Condens. Matter Mater. Phys. 2001, 64, 235111, DOI: 10.1103/PhysRevB.64.235111Google Scholar31Numerical atomic orbitals for linear-scaling calculationsJunquera, Javier; Paz, Oscar; Sanchez-Portal, Daniel; Artacho, EmilioPhysical Review B: Condensed Matter and Materials Physics (2001), 64 (23), 235111/1-235111/9CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)The performance of basis sets made of numerical AOs is explored in d.-functional calcns. of solids and mols. With the aim of optimizing basis quality while maintaining strict localization of the orbitals, as needed for linear-scaling calcns., several schemes have been tried. The best performance is obtained for the basis sets generated according to a new scheme presented here, a flexibilization of previous proposals. Strict localization is maintained while ensuring the continuity of the basis-function deriv. at the cutoff radius. The basis sets are tested vs. converged plane-wave calcns. on a significant variety of systems, including covalent, ionic, and metallic. Satisfactory convergence is obtained for reasonably small basis sizes, with a clear improvement over previous schemes. The transferability of the obtained basis sets is tested in several cases and it is found to be satisfactory as well.
- 32Basanta, M.; Dappe, Y.; Jelínek, P.; Ortega, J. Optimized atomic-like orbitals for first-principles tight-binding molecular dynamics. Comput. Mater. Sci. 2007, 39, 759– 766, DOI: 10.1016/j.commatsci.2006.09.003Google Scholar32Optimized atomic-like orbitals for first-principles tight-binding molecular dynamicsBasanta, M. A.; Dappe, Y. J.; Jelinek, P.; Ortega, J.Computational Materials Science (2007), 39 (4), 759-766CODEN: CMMSEM; ISSN:0927-0256. (Elsevier B.V.)We analyze the optimization of at.-like minimal basis sets for the hydrocarbons and for materials made up only of C atoms, e.g. C-nanotubes. In our approach the optimized numerical AOs (NAOs) are obtained as a linear combination of only two primitive NAOs. We find that the optimized basis sets yield an important lowering of the total energy, and bond lengths in very good agreement with the exptl. evidence. Also, we find that a good "universal" minimal basis set for the hydrocarbons and C-only materials can be obtained using this simple method. The approach discussed is a promising tool for the simulation of complex org. materials, beyond the hydrocarbons, using optimized minimal basis sets.
- 33Rayson, M. J.; Briddon, P. R. Highly efficient method for Kohn-Sham density functional calculations of 500–10000 atom systems. Phys. Rev. B: Condens. Matter Mater. Phys. 2009, 80, 205104, DOI: 10.1103/PhysRevB.80.205104Google Scholar33Highly efficient method for Kohn-Sham density functional calculations of 500-10 000 atom systemsRayson, M. J.; Briddon, P. R.Physical Review B: Condensed Matter and Materials Physics (2009), 80 (20), 205104/1-205104/11CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)A method for the soln. of the self-consistent Kohn-Sham equations using Gaussian-type orbitals is presented. Accurate relative energies and forces are demonstrated to be achievable at a fraction of the computational expense for large systems. With this approach calcns. involving around 1000 atoms can easily be performed with a serial desktop computer and ∼10 000 atom systems are within reach of relatively modest parallel computational resources. The method is applicable to arbitrary systems including metals. The approach generates a minimal basis on the fly while retaining the accuracy of the large underpinning basis set. Convergence of energies and forces are given for clusters as well as cubic cells of silicon and aluminum, for which the formation energies of defects are calcd. in systems up to 8000 and 4000 atoms, resp. For these systems the method exhibits linear scaling with the no. of atoms in the presently important size range of ∼500-3000 atoms.
- 34Rayson, M. Rapid filtration algorithm to construct a minimal basis on the fly from a primitive Gaussian basis. Comput. Phys. Commun. 2010, 181, 1051– 1056, DOI: 10.1016/j.cpc.2010.02.012Google Scholar34Rapid filtration algorithm to construct a minimal basis on the fly from a primitive Gaussian basisRayson, M. J.Computer Physics Communications (2010), 181 (6), 1051-1056CODEN: CPHCBZ; ISSN:0010-4655. (Elsevier B.V.)In a recent work an implementation of filter diagonalization with localization constraints was shown to provide an accurate and highly efficient method to solve the Kohn-Sham equations using a primitive Gaussian basis for systems contg. several thousand electrons. In this work, an alternative filtration algorithm is proposed, based on a rational approxn. to the filtration function, to replace the kernel of this algorithm. This approach is considerably faster than the diagonalization approach used previously and also its performance is largely independent of the filtration temp., aiding a more flexible approach to the construction of filtered basis sets.
- 35Nakata, A.; Bowler, D. R.; Miyazaki, T. Efficient Calculations with Multisite Local Orbitals in a Large-Scale DFT Code CONQUEST. J. Chem. Theory Comput. 2014, 10, 4813– 4822, DOI: 10.1021/ct5004934Google Scholar35Efficient Calculations with Multisite Local Orbitals in a Large-Scale DFT Code CONQUESTNakata, Ayako; Bowler, David R.; Miyazaki, TsuyoshiJournal of Chemical Theory and Computation (2014), 10 (11), 4813-4822CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Multisite local orbitals, which are formed from linear combinations of pseudoat. orbitals from a target atom and its neighbor atoms, have been introduced in the large-scale d. functional theory calcn. code CONQUEST. Multisite local orbitals correspond to local MOs so that the no. of required local orbitals can be minimal. The multisite support functions are detd. by using the localized filter diagonalization (LFD) method. Two new methods, the double cutoff method and the smoothing method, are introduced to the LFD method to improve efficiency and stability. The Hamiltonian and overlap matrixes with multisite local orbitals are constructed by efficient sparse-matrix multiplications in CONQUEST. The investigation of the calcd. energetic and geometrical properties and band structures of bulk Si, Al, and DNA systems demonstrate the accuracy and the computational efficiency of the present method. The representability of both occupied and unoccupied band structures with the present method has been also confirmed.
- 36Lin, L.; Lu, J.; Ying, L.; E, W. Adaptive local basis set for Kohn-Sham density functional theory in a discontinuous Galerkin framework I: Total energy calculation. J. Comput. Phys. 2012, 231, 2140– 2154, DOI: 10.1016/j.jcp.2011.11.032Google ScholarThere is no corresponding record for this reference.
- 37Lin, L.; Lu, J.; Ying, L.; E, W. Optimized local basis set for Kohn-Sham density functional theory. J. Comput. Phys. 2012, 231, 4515– 4529, DOI: 10.1016/j.jcp.2012.03.009Google ScholarThere is no corresponding record for this reference.
- 38Mao, Y.; Horn, P. R.; Mardirossian, N.; Head-Gordon, T.; Skylaris, C.-K.; Head-Gordon, M. Approaching the basis set limit for DFT calculations using an environment-adapted minimal basis with perturbation theory: Formulation, proof of concept, and a pilot implementation. J. Chem. Phys. 2016, 145, 044109, DOI: 10.1063/1.4959125Google Scholar38Approaching the basis set limit for DFT calculations using an environment-adapted minimal basis with perturbation theory: Formulation, proof of concept, and a pilot implementationMao, Yuezhi; Horn, Paul R.; Mardirossian, Narbe; Head-Gordon, Teresa; Skylaris, Chris-Kriton; Head-Gordon, MartinJournal of Chemical Physics (2016), 145 (4), 044109/1-044109/17CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Recently developed d. functionals have good accuracy for both thermochem. (TC) and non-covalent interactions (NC) if very large AO basis sets are used. To approach the basis set limit with potentially lower computational cost, a new SCF scheme is presented that employs minimal adaptive basis (MAB) functions. The MAB functions are optimized on each at. site by minimizing a surrogate function. High accuracy is obtained by applying a perturbative correction (PC) to the MAB calcn., similar to dual basis approaches. Compared to exact SCF results, using this MAB-SCF (PC) approach with the same large target basis set produces <0.15 kcal/mol root-mean-square deviations for most of the tested TC datasets, and <0.1 kcal/mol for most of the NC datasets. The performance of d. functionals near the basis set limit can be even better reproduced. With further improvement to its implementation, MAB-SCF (PC) is a promising lower-cost substitute for conventional large-basis calcns. as a method to approach the basis set limit of modern d. functionals. (c) 2016 American Institute of Physics.
- 39Ramakrishnan, R.; von Lilienfeld, O. A. Rev. Comput. Chem.; John Wiley & Sons, Inc., 2017; pp 225– 256.Google ScholarThere is no corresponding record for this reference.
- 40Hansen, K.; Montavon, G.; Biegler, F.; Fazli, S.; Rupp, M.; Scheffler, M.; von Lilienfeld, O. A.; Tkatchenko, A.; Müller, K.-R. Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies. J. Chem. Theory Comput. 2013, 9, 3404– 3419, DOI: 10.1021/ct400195dGoogle Scholar40Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization EnergiesHansen, Katja; Montavon, Gregoire; Biegler, Franziska; Fazli, Siamac; Rupp, Matthias; Scheffler, Matthias; von Lilienfeld, O. Anatole; Tkatchenko, Alexandre; Mueller, Klaus-RobertJournal of Chemical Theory and Computation (2013), 9 (8), 3404-3419CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The accurate and reliable prediction of properties of mols. typically requires computationally intensive quantum-chem. calcns. Recently, machine learning techniques applied to ab initio calcns. have been proposed as an efficient approach for describing the energies of mols. in their given ground-state structure throughout chem. compd. space. In this paper we outline a no. of established machine learning techniques and investigate the influence of the mol. representation on the methods performance. The best methods achieve prediction errors of 3 kcal/mol for the atomization energies of a wide variety of mols. Rationales for this performance improvement are given together with pitfalls and challenges when applying machine learning approaches to the prediction of quantum-mech. observables.
- 41Handley, C. M.; Popelier, P. L. A. Potential Energy Surfaces Fitted by Artificial Neural Networks. J. Phys. Chem. A 2010, 114, 3371– 3383, DOI: 10.1021/jp9105585Google Scholar41Potential Energy Surfaces Fitted by Artificial Neural NetworksHandley, Chris M.; Popelier, Paul L. A.Journal of Physical Chemistry A (2010), 114 (10), 3371-3383CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)A review. Mol. mechanics is the tool of choice for the modeling of systems that are so large or complex that it is impractical or impossible to model them by ab initio methods. For this reason there is a need for accurate potentials that are able to quickly reproduce ab initio quality results at the fraction of the cost. The interactions within force fields are represented by a no. of functions. Some interactions are well understood and can be represented by simple math. functions while others are not so well understood and their functional form is represented in a simplistic manner or not even known. In the last 20 years there have been the first examples of a new design ethic, where novel and contemporary methods using machine learning, in particular, artificial neural networks, have been used to find the nature of the underlying functions of a force field. Here we appraise what has been achieved over this time and what requires further improvements, while offering some insight and guidance for the development of future force fields.
- 42Behler, J. Neural network potential-energy surfaces in chemistry: a tool for large-scale simulations. Phys. Chem. Chem. Phys. 2011, 13, 17930– 17955, DOI: 10.1039/c1cp21668fGoogle Scholar42Neural network potential-energy surfaces in chemistry: a tool for large-scale simulationsBehler, JoergPhysical Chemistry Chemical Physics (2011), 13 (40), 17930-17955CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)A review. The accuracy of the results obtained in mol. dynamics or Monte Carlo simulations crucially depends on a reliable description of the at. interactions. A large variety of efficient potentials has been proposed in the literature, but often the optimum functional form is difficult to find and strongly depends on the particular system. In recent years, artificial neural networks (NN) have become a promising new method to construct potentials for a wide range of systems. They offer a no. of advantages: they are very general and applicable to systems as different as small mols., semiconductors and metals; they are numerically very accurate and fast to evaluate; and they can be constructed using any electronic structure method. Significant progress has been made in recent years and a no. of successful applications demonstrate the capabilities of neural network potentials. In this Perspective, the current status of NN potentials is reviewed, and their advantages and limitations are discussed.
- 43Morawietz, T.; Singraber, A.; Dellago, C.; Behler, J. How van der Waals interactions determine the unique properties of water. Proc. Natl. Acad. Sci. U. S. A. 2016, 113, 8368– 8373, DOI: 10.1073/pnas.1602375113Google Scholar43How van der Waals interactions determine the unique properties of waterMorawietz, Tobias; Singraber, Andreas; Dellago, Christoph; Behler, JoergProceedings of the National Academy of Sciences of the United States of America (2016), 113 (30), 8368-8373CODEN: PNASA6; ISSN:0027-8424. (National Academy of Sciences)Whereas the interactions between water mols. are dominated by strongly directional hydrogen bonds (HBs), it was recently proposed that relatively weak, isotropic van der Waals (vdW) forces are essential for understanding the properties of liq. water and ice. This insight was derived from ab initio computer simulations, which provide an unbiased description of water at the at. level and yield information on the underlying mol. forces. However, the high computational cost of such simulations prevents the systematic investigation of the influence of vdW forces on the thermodn. anomalies of water. Here, we develop efficient ab initio-quality neural network potentials and use them to demonstrate that vdW interactions are crucial for the formation of water's d. max. and its neg. vol. of melting. Both phenomena can be explained by the flexibility of the HB network, which is the result of a delicate balance of weak vdW forces, causing, e.g., a pronounced expansion of the second solvation shell upon cooling that induces the d. max.
- 44Snyder, J. C.; Rupp, M.; Hansen, K.; Blooston, L.; Mueller, K.-R.; Burke, K. Orbital-free bond breaking via machine learning. J. Chem. Phys. 2013, 139, 224104, DOI: 10.1063/1.4834075Google Scholar44Orbital-free bond breaking via machine learningSnyder, John C.; Rupp, Matthias; Hansen, Katja; Blooston, Leo; Mueller, Klaus-Robert; Burke, KieronJournal of Chemical Physics (2013), 139 (22), 224104/1-224104/10CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Using a one-dimensional model, we explore the ability of machine learning to approx. the non-interacting kinetic energy d. functional of diatomics. This nonlinear interpolation between Kohn-Sham ref. calcns. can (i) accurately dissoc. a diat., (ii) be systematically improved with increased ref. data and (iii) generate accurate self-consistent densities via a projection method that avoids directions with no data. With relatively few densities, the error due to the interpolation is smaller than typical errors in std. exchange-correlation functionals. (c) 2013 American Institute of Physics.
- 45Schütt, K. T.; Glawe, H.; Brockherde, F.; Sanna, A.; Müller, K. R.; Gross, E. K. U. How to represent crystal structures for machine learning: Towards fast prediction of electronic properties. Phys. Rev. B: Condens. Matter Mater. Phys. 2014, 89, 205118, DOI: 10.1103/PhysRevB.89.205118Google Scholar45How to represent crystal structures for machine learning: towards fast prediction of electronic propertiesSchuett, K. T.; Glawe, H.; Brockherde, F.; Sanna, A.; Mueller, K. R.; Gross, E. K. U.Physical Review B: Condensed Matter and Materials Physics (2014), 89 (20), 205118/1-205118/5, 5 pp.CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)High-throughput d. functional calcns. of solids are highly time-consuming. As an alternative, we propose a machine learning approach for the fast prediction of solid-state properties. To achieve this, local spin-d. approxn. calcns. are used as a training set. We focus on predicting the value of the d. of electronic states at the Fermi energy. We find that conventional representations of the input data, such as the Coulomb matrix, are not suitable for the training of learning machines in the case of periodic solids. We propose a novel crystal structure representation for which learning and competitive prediction accuracies become possible within an unrestricted class of spd systems of arbitrary unit-cell size.
- 46Dral, P. O.; von Lilienfeld, O. A.; Thiel, W. Machine Learning of Parameters for Accurate Semiempirical Quantum Chemical Calculations. J. Chem. Theory Comput. 2015, 11, 2120– 2125, DOI: 10.1021/acs.jctc.5b00141Google Scholar46Machine Learning of Parameters for Accurate Semiempirical Quantum Chemical CalculationsDral, Pavlo O.; von Lilienfeld, O. Anatole; Thiel, WalterJournal of Chemical Theory and Computation (2015), 11 (5), 2120-2125CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)We investigate possible improvements in the accuracy of semiempirical quantum chem. (SQC) methods through the use of machine learning (ML) models for the parameters. For a given class of compds., ML techniques require sufficiently large training sets to develop ML models that can be used for adapting SQC parameters to reflect changes in mol. compn. and geometry. The ML-SQC approach allows the automatic tuning of SQC parameters for individual mols., thereby improving the accuracy without deteriorating transferability to mols. with mol. descriptors very different from those in the training set. The performance of this approach is demonstrated for the semiempirical OM2 method using a set of 6095 constitutional isomers C7H10O2, for which accurate ab initio atomization enthalpies are available. The ML-OM2 results show improved av. accuracy and a much reduced error range compared with those of std. OM2 results, with mean abs. errors in atomization enthalpies dropping from 6.3 to 1.7 kcal/mol. They are also found to be superior to the results from specific OM2 reparameterizations (rOM2) for the same set of isomers. The ML-SQC approach thus holds promise for fast and reasonably accurate high-throughput screening of materials and mols.
- 47Kranz, J. J.; Kubillus, M.; Ramakrishnan, R.; von Lilienfeld, O. A.; Elstner, M. Generalized Density-Functional Tight-Binding Repulsive Potentials from Unsupervised Machine Learning. J. Chem. Theory Comput. 2018, 14, 2341– 2352, DOI: 10.1021/acs.jctc.7b00933Google Scholar47Generalized Density-Functional Tight-Binding Repulsive Potentials from Unsupervised Machine LearningKranz, Julian J.; Kubillus, Maximilian; Ramakrishnan, Raghunathan; von Lilienfeld, O. Anatole; Elstner, MarcusJournal of Chemical Theory and Computation (2018), 14 (5), 2341-2352CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)We combine the approx. d.-functional tight-binding (DFTB) method with unsupervised machine learning. This allows us to improve transferability and accuracy, make use of large quantum chem. data sets for the parametrization, and efficiently automatize the parametrization process of DFTB. For this purpose, generalized pair-potentials are introduced, where the chem. environment is included during the learning process, leading to more specific effective two-body potentials. We train on energies and forces of equil. and nonequil. structures of 2100 mols., and test on ∼130,000 org. mols. contg. O, N, C, H, and F atoms. Atomization energies of the ref. method can be reproduced within an error of ∼2.6 kcal/mol, indicating drastic improvement over std. DFTB.
- 48White, C. A.; Maslen, P.; Lee, M. S.; Head-Gordon, M. The tensor properties of energy gradients within a non-orthogonal basis. Chem. Phys. Lett. 1997, 276, 133– 138, DOI: 10.1016/S0009-2614(97)88046-3Google ScholarThere is no corresponding record for this reference.
- 49Bartók, A. P.; Kondor, R.; Csányi, G. On representing chemical environments. Phys. Rev. B: Condens. Matter Mater. Phys. 2013, 87, 184115, DOI: 10.1103/PhysRevB.87.184115Google Scholar49On representing chemical environmentsBartok, Albert P.; Kondor, Risi; Csanyi, GaborPhysical Review B: Condensed Matter and Materials Physics (2013), 87 (18), 184115/1-184115/16CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)We review some recently published methods to represent at. neighborhood environments, and analyze their relative merits in terms of their faithfulness and suitability for fitting potential energy surfaces. The crucial properties that such representations (sometimes called descriptors) must have are differentiability with respect to moving the atoms and invariance to the basic symmetries of physics: rotation, reflection, translation, and permutation of atoms of the same species. We demonstrate that certain widely used descriptors that initially look quite different are specific cases of a general approach, in which a finite set of basis functions with increasing angular wave nos. are used to expand the at. neighborhood d. function. Using the example system of small clusters, we quant. show that this expansion needs to be carried to higher and higher wave nos. as the no. of neighbors increases in order to obtain a faithful representation, and that variants of the descriptors converge at very different rates. We also propose an altogether different approach, called Smooth Overlap of Atomic Positions, that sidesteps these difficulties by directly defining the similarity between any two neighborhood environments, and show that it is still closely connected to the invariant descriptors. We test the performance of the various representations by fitting models to the potential energy surface of small silicon clusters and the bulk crystal.
- 50De, S.; Bartok, A. P.; Csanyi, G.; Ceriotti, M. Comparing molecules and solids across structural and alchemical space. Phys. Chem. Chem. Phys. 2016, 18, 13754– 13769, DOI: 10.1039/C6CP00415FGoogle Scholar50Comparing molecules and solids across structural and alchemical spaceDe, Sandip; Bartok, Albert P.; Csanyi, Gabor; Ceriotti, MichelePhysical Chemistry Chemical Physics (2016), 18 (20), 13754-13769CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)Evaluating the (dis)similarity of cryst., disordered and mol. compds. is a crit. step in the development of algorithms to navigate automatically the configuration space of complex materials. For instance, a structural similarity metric is crucial for classifying structures, searching chem. space for better compds. and materials, and driving the next generation of machine-learning techniques for predicting the stability and properties of mols. and materials. In the last few years several strategies have been designed to compare at. coordination environments. In particular, the smooth overlap of at. positions (SOAPs) has emerged as an elegant framework to obtain translation, rotation and permutation-invariant descriptors of groups of atoms, underlying the development of various classes of machine-learned inter-at. potentials. Here we discuss how one can combine such local descriptors using a regularized entropy match (REMatch) approach to describe the similarity of both whole mol. and bulk periodic structures, introducing powerful metrics that enable the navigation of alchem. and structural complexities within a unified framework. Furthermore, using this kernel and a ridge regression method we can predict atomization energies for a database of small org. mols. with a mean abs. error below 1 kcal mol-1, reaching an important milestone in the application of machine-learning techniques for the evaluation of mol. properties.
- 51Zhu, L.; Amsler, M.; Fuhrer, T.; Schäfer, B.; Faraji, S.; Rostami, S.; Ghasemi, S. A.; Sadeghi, A.; Grauzinyte, M.; Wolverton, C.; Goedecker, S. A fingerprint based metric for measuring similarities of crystalline structures. J. Chem. Phys. 2016, 144, 034203, DOI: 10.1063/1.4940026Google Scholar51A fingerprint based metric for measuring similarities of crystalline structuresZhu, Li; Amsler, Maximilian; Fuhrer, Tobias; Schaefer, Bastian; Faraji, Somayeh; Rostami, Samare; Ghasemi, S. Alireza; Sadeghi, Ali; Grauzinyte, Migle; Wolverton, Chris; Goedecker, StefanJournal of Chemical Physics (2016), 144 (3), 034203/1-034203/9CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Measuring similarities/dissimilarities between at. structures is important for the exploration of potential energy landscapes. However, the cell vectors together with the coordinates of the atoms, which are generally used to describe periodic systems, are quantities not directly suitable as fingerprints to distinguish structures. Based on a characterization of the local environment of all atoms in a cell, we introduce crystal fingerprints that can be calcd. easily and define configurational distances between cryst. structures that satisfy the math. properties of a metric. This distance between two configurations is a measure of their similarity/dissimilarity and it allows in particular to distinguish structures. The new method can be a useful tool within various energy landscape exploration schemes, such as min. hopping, random search, swarm intelligence algorithms, and high-throughput screenings. (c) 2016 American Institute of Physics.
- 52Sadeghi, A.; Ghasemi, S. A.; Schäfer, B.; Mohr, S.; Lill, M. A.; Goedecker, S. Metrics for measuring distances in configuration spaces. J. Chem. Phys. 2013, 139, 184118, DOI: 10.1063/1.4828704Google Scholar52Metrics for measuring distances in configuration spacesSadeghi, Ali; Ghasemi, S. Alireza; Schaefer, Bastian; Mohr, Stephan; Lill, Markus A.; Goedecker, StefanJournal of Chemical Physics (2013), 139 (18), 184118/1-184118/11CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)In order to characterize mol. structures we introduce configurational fingerprint vectors which are counterparts of quantities used exptl. to identify structures. The Euclidean distance between the configurational fingerprint vectors satisfies the properties of a metric and can therefore safely be used to measure dissimilarities between configurations in the high dimensional configuration space. In particular we show that these metrics are a perfect and computationally cheap replacement for the root-mean-square distance (RMSD) when one has to decide whether two noise contaminated configurations are identical or not. We introduce a Monte Carlo approach to obtain the global min. of the RMSD between configurations, which is obtained from a global minimization over all translations, rotations, and permutations of at. indexes. (c) 2013 American Institute of Physics.
- 53Rupp, M.; Tkatchenko, A.; Müller, K.-R.; von Lilienfeld, O. A. Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning. Phys. Rev. Lett. 2012, 108, 058301, DOI: 10.1103/PhysRevLett.108.058301Google Scholar53Fast and accurate modeling of molecular atomization energies with machine learningRupp, Matthias; Tkatchenko, Alexandre; Mueller, Klaus-Robert; von Lilienfeld, O. AnatolePhysical Review Letters (2012), 108 (5), 058301/1-058301/5CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)The authors introduce a machine learning model to predict atomization energies of a diverse set of org. mols., based on nuclear charges and at. positions only. The problem of solving the mol. Schrodinger equation is mapped onto a nonlinear statistical regression problem of reduced complexity. Regression models are trained on and compared to atomization energies computed with hybrid d.-functional theory. Cross validation over more than seven thousand org. mols. yields a mean abs. error of ∼10 kcal/mol. Applicability is demonstrated for the prediction of mol. atomization potential energy curves.
- 54Behler, J. Constructing high-dimensional neural network potentials: A tutorial review. Int. J. Quantum Chem. 2015, 115, 1032– 1050, DOI: 10.1002/qua.24890Google Scholar54Constructing high-dimensional neural network potentials: A tutorial reviewBehler, JoergInternational Journal of Quantum Chemistry (2015), 115 (16), 1032-1050CODEN: IJQCB2; ISSN:0020-7608. (John Wiley & Sons, Inc.)A lot of progress has been made in recent years in the development of atomistic potentials using machine learning (ML) techniques. In contrast to most conventional potentials, which are based on phys. approxns. and simplifications to derive an analytic functional relation between the at. configuration and the potential-energy, ML potentials rely on simple but very flexible math. terms without a direct phys. meaning. Instead, in case of ML potentials the topol. of the potential-energy surface is "learned" by adjusting a no. of parameters with the aim to reproduce a set of ref. electronic structure data as accurately as possible. Due to this bias-free construction, they are applicable to a wide range of systems without changes in their functional form, and a very high accuracy close to the underlying first-principles data can be obtained. Neural network potentials (NNPs), which have first been proposed about two decades ago, are an important class of ML potentials. Although the first NNPs have been restricted to small mols. with only a few degrees of freedom, they are now applicable to high-dimensional systems contg. thousands of atoms, which enables addressing a variety of problems in chem., physics, and materials science. In this tutorial review, the basic ideas of NNPs are presented with a special focus on developing NNPs for high-dimensional condensed systems. A recipe for the construction of these potentials is given and remaining limitations of the method are discussed. © 2015 Wiley Periodicals, Inc.
- 55Rasmussen, C. E.; Williams, C. K. I. Gaussian Processes for Machine Learning; The MIT Press, 2006; http://www.gaussianprocess.org/gpml/ (accessed Apr. 20, 2018).Google ScholarThere is no corresponding record for this reference.
- 56Tikhonov, A. N.; Goncharsky, A. V.; Stepanov, V. V.; Yagola, A. G. Numerical Methods for the Solution of Ill-Posed Problems; Kluwer Academic, 1995.Google ScholarThere is no corresponding record for this reference.
- 57Gillan, M. J.; Alfé, D.; Michaelides, A. Perspective: How good is DFT for water. J. Chem. Phys. 2016, 144, 130901, DOI: 10.1063/1.4944633Google Scholar57Perspective: How good is DFT for water?Gillan, Michael J.; Alfe, Dario; Michaelides, AngelosJournal of Chemical Physics (2016), 144 (13), 130901/1-130901/33CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)A review. Kohn-Sham d. functional theory (DFT) has become established as an indispensable tool for investigating aq. systems of all kinds, including those important in chem., surface science, biol., and the earth sciences. Nevertheless, many widely used approxns. for the exchange-correlation (XC) functional describe the properties of pure water systems with an accuracy that is not fully satisfactory. The explicit inclusion of dispersion interactions generally improves the description, but there remain large disagreements between the predictions of different dispersion-inclusive methods. We present here a review of DFT work on water clusters, ice structures, and liq. water, with the aim of elucidating how the strengths and weaknesses of different XC approxns. manifest themselves across this variety of water systems. Our review highlights the crucial role of dispersion in describing the delicate balance between compact and extended structures of many different water systems, including the liq. By referring to a wide range of published work, we argue that the correct description of exchange-overlap interactions is also extremely important, so that the choice of semi-local or hybrid functional employed in dispersion-inclusive methods is crucial. The origins and consequences of beyond-2-body errors of approx. XC functionals are noted, and we also discuss the substantial differences between different representations of dispersion. We propose a simple numerical scoring system that rates the performance of different XC functionals in describing water systems, and we suggest possible future developments. (c) 2016 American Institute of Physics.
- 58Del Ben, M.; Schönherr, M.; Hutter, J.; VandeVondele, J. Bulk Liquid Water at Ambient Temperature and Pressure from MP2 Theory. J. Phys. Chem. Lett. 2013, 4, 3753– 3759, DOI: 10.1021/jz401931fGoogle Scholar58Bulk Liquid Water at Ambient Temperature and Pressure from MP2 TheoryDel Ben, Mauro; Schonherr, Mandes; Hutter, Jurg; Vande Vondele, JoostJournal of Physical Chemistry Letters (2013), 4 (21), 3753-3759CODEN: JPCLCD; ISSN:1948-7185. (American Chemical Society)MP2 provides a good description of hydrogen bonding in water clusters and includes long-range dispersion interactions without the need to introduce empirical elements in the description of the interat. potential. To assess its performance for bulk liq. water under ambient conditions, an isobaric-isothermal (NpT) Monte Carlo simulation at the second-order Moller-Plesset perturbation theory level (MP2) has been performed. The obtained value of the water d. is excellent (1.02 g/mL), and the calcd. radial distribution functions are in fair agreement with exptl. data. The MP2 results are compared to a few d. functional approxns., including semilocal functionals, hybrid functionals, and functionals including empirical dispersion corrections. These results demonstrate the feasibility of directly sampling the potential energy surface of condensed-phase systems using correlated wave function theory, and their quality paves the way for further applications.
- 59Del Ben, M.; Hutter, J.; VandeVondele, J. Probing the structural and dynamical properties of liquid water with models including non-local electron correlation. J. Chem. Phys. 2015, 143, 054506, DOI: 10.1063/1.4927325Google Scholar59Probing the structural and dynamical properties of liquid water with models including non-local electron correlationDel Ben, Mauro; Hutter, Jurg; Vande Vondele, JoostJournal of Chemical Physics (2015), 143 (5), 054506/1-054506/12CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Water is a ubiquitous liq. that displays a wide range of anomalous properties and has a delicate structure that challenges expt. and simulation alike. The various intermol. interactions that play an important role, such as repulsion, polarization, hydrogen bonding, and van der Waals interactions, are often difficult to reproduce faithfully in atomistic models. Here, electronic structure theories including all these interactions at equal footing, which requires the inclusion of non-local electron correlation, are used to describe structure and dynamics of bulk liq. water. Isobaric-isothermal (NpT) ensemble simulations based on the RPA yield excellent d. (0.994 g/mL) and fair radial distribution functions, while various other d. functional approxns. produce scattered results (0.8-1.2 g/mL). Mol. dynamics simulation in the microcanonical (NVE) ensemble based on Moller-Plesset perturbation theory (MP2) yields dynamical properties in the condensed phase, namely, the IR spectrum and diffusion const. At the MP2 and RPA levels of theory, ice is correctly predicted to float on water, resolving one of the anomalies as resulting from a delicate balance between van der Waals and hydrogen bonding interactions. For several properties, obtaining quant. agreement with expt. requires correction for nuclear quantum effects (NQEs), highlighting their importance, for structure, dynamics, and electronic properties. A computed NQE shift of 0.6 eV for the band gap and absorption spectrum illustrates the latter. Giving access to both structure and dynamics of condensed phase systems, non-local electron correlation will increasingly be used to study systems where weak interactions are of paramount importance. (c) 2015 American Institute of Physics.
- 60Verlet, L. Computer ”Experiments” on Classical Fluids. I. Thermodynamical Properties of Lennard-Jones Molecules. Phys. Rev. 1967, 159, 98– 103, DOI: 10.1103/PhysRev.159.98Google Scholar60Computer "experiments" on classical fluids. I. Thermodynamical properties of Lennard-Jones moleculesVerlet, LoupPhysical Review (1967), 159 (1), 98-103CODEN: PHRVAO; ISSN:0031-899X.The equation of motion of a system of 864 particles interacting through a Lennard-Jones potential was integrated for various values of the temp. and d., relative, generally, to a fluid state. The equil. properties agree with the corresponding properties of Ar. The equil. state of Ar can be described through a 2-body potential.
- 61Elstner, M.; Porezag, D.; Jungnickel, G.; Elsner, J.; Haugk, M.; Frauenheim, T.; Suhai, S.; Seifert, G. Self-consistent-charge density-functional tight-binding method for simulations of complex materials properties. Phys. Rev. B: Condens. Matter Mater. Phys. 1998, 58, 7260– 7268, DOI: 10.1103/PhysRevB.58.7260Google Scholar61Self-consistent-charge density-functional tight-binding method for simulations of complex materials propertiesElstner, M.; Porezag, D.; Jungnickel, G.; Elsner, J.; Haugk, M.; Frauenheim, Th.; Suhai, S.; Seifert, G.Physical Review B: Condensed Matter and Materials Physics (1998), 58 (11), 7260-7268CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)We outline details about an extension of the tight-binding (TB) approach to improve total energies, forces, and transferability. The method is based on a second-order expansion of the Kohn-Sham total energy in d.-functional theory (DFT) with respect to charge-d. fluctuations. The zeroth-order approach is equiv. to a common std. non-self-consistent (TB) scheme, while at second-order a transparent, parameter-free, and readily calculable expression for generalized Hamiltonian matrix elements may be derived. These are modified by a self-consistent redistribution of Mulliken charges (SCC). Besides the usual "band structure" and short-range repulsive terms the final approx. Kohn-Sham energy addnl. includes a Coulomb interaction between charge fluctuations. At large distances this accounts for long-range electrostatic forces between two point charges and approx. includes self-interaction contributions of a given atom if the charges are located at one and the same atom. We apply the new SCC scheme to problems where deficiencies within the non-SCC std. TB approach become obvious. We thus considerably improve transferability.
- 62Hu, H.; Lu, Z.; Elstner, M.; Hermans, J.; Yang, W. Simulating Water with the Self-Consistent-Charge Density Functional Tight Binding Method: From Molecular Clusters to the Liquid State. J. Phys. Chem. A 2007, 111, 5685– 5691, DOI: 10.1021/jp070308dGoogle Scholar62Simulating Water with the Self-Consistent-Charge Density Functional Tight Binding Method: From Molecular Clusters to the Liquid StateHu, Hao; Lu, Zhenyu; Elstner, Marcus; Hermans, Jan; Yang, WeitaoJournal of Physical Chemistry A (2007), 111 (26), 5685-5691CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)The recently developed self-consistent-charge d. functional tight binding (SCCDFTB) method provides an accurate and inexpensive quantum mech. soln. to many mol. systems of interests. To examine the performance of the SCCDFTB method on (liq.) water, the most fundamental yet indispensable mol. in biol. systems, we report here the simulation results of water in sizes ranging from mol. clusters to the liq. state. The latter simulation was achieved through the use of the linear scaling divide-and-conquer approach. The results of liq. water simulation indicate that the SCCDFTB method can describe the structural and energetics of liq. water in qual. agreement with expts., and the results for water clusters suggest potential future improvements of the SCCDFTB method.
- 63Skinner, L. B.; Huang, C.; Schlesinger, D.; Pettersson, L. G. M.; Nilsson, A.; Benmore, C. J. Benchmark oxygen-oxygen pair-distribution function of ambient water from x-ray diffraction measurements with a wide Q-range. J. Chem. Phys. 2013, 138, 074506, DOI: 10.1063/1.4790861Google Scholar63Benchmark oxygen-oxygen pair-distribution function of ambient water from x-ray diffraction measurements with a wide Q-rangeSkinner, Lawrie B.; Huang, Congcong; Schlesinger, Daniel; Pettersson, Lars G. M.; Nilsson, Anders; Benmore, Chris J.Journal of Chemical Physics (2013), 138 (7), 074506/1-074506/12CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Four recent x-ray diffraction measurements of ambient liq. water are reviewed here. Each of these measurements represents a significant development of the x-ray diffraction technique applied to the study of liq. water. Sources of uncertainty from statistical noise, Q-range, Compton scattering, and self-scattering are discussed. The oxygen-hydrogen contribution to the measured x-ray scattering pattern was subtracted using literature data to yield an exptl. detn., with error bars, of the oxygen-oxygen pair-distribution function, gOO(r), which essentially describes the distribution of mol. centers. The extended Q-range and low statistical noise of these measurements has significantly reduced truncation effects and related errors in the gOO(r) functions obtained. From these measurements and error anal., the position and height of the nearest neighbor max. in gOO(r) were found to be 2.80(1) Å and 2.57(5) resp. Numerical data for the coherent differential x-ray scattering cross-section IX(Q), the oxygen-oxygen structure factor SOO(Q), and the derived gOO(r) are provided as benchmarks for calibrating force-fields for water. (c) 2013 American Institute of Physics.
- 64The CP2K developers group. CP2K , 2018; https://www.cp2k.org, (accessed Apr. 20, 2018).Google ScholarThere is no corresponding record for this reference.
- 65Hutter, J.; Iannuzzi, M.; Schiffmann, F.; VandeVondele, J. CP2K: atomistic simulations of condensed matter systems. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2014, 4, 15– 25, DOI: 10.1002/wcms.1159Google Scholar65cp2k: atomistic simulations of condensed matter systemsHutter, Juerg; Iannuzzi, Marcella; Schiffmann, Florian; VandeVondele, JoostWiley Interdisciplinary Reviews: Computational Molecular Science (2014), 4 (1), 15-25CODEN: WIRCAH; ISSN:1759-0884. (Wiley-Blackwell)A review. Cp2k has become a versatile open-source tool for the simulation of complex systems on the nanometer scale. It allows for sampling and exploring potential energy surfaces that can be computed using a variety of empirical and first principles models. Excellent performance for electronic structure calcns. is achieved using novel algorithms implemented for modern and massively parallel hardware. This review briefly summarizes the main capabilities and illustrates with recent applications the science cp2k has enabled in the field of atomistic simulation. WIREs Comput Mol Sci 2014, 4:15-25. doi: 10.1002/wcms.1159 The authors have declared no conflicts of interest in relation to this article. For further resources related to this article, please visit the WIREs website.
- 66VandeVondele, J.; Krack, M.; Mohamed, F.; Parrinello, M.; Chassaing, T.; Hutter, J. Quickstep: Fast and accurate density functional calculations using a mixed Gaussian and plane waves approach. Comput. Phys. Commun. 2005, 167, 103– 128, DOI: 10.1016/j.cpc.2004.12.014Google Scholar66QUICKSTEP: fast and accurate density functional calculations using a mixed Gaussian and plane waves approachVandeVondele, Joost; Krack, Matthias; Mohamed, Fawzi; Parrinello, Michele; Chassaing, Thomas; Hutter, JuergComputer Physics Communications (2005), 167 (2), 103-128CODEN: CPHCBZ; ISSN:0010-4655. (Elsevier B.V.)We present the Gaussian and plane waves (GPW) method and its implementation in which is part of the freely available program package CP2K. The GPW method allows for accurate d. functional calcns. in gas and condensed phases and can be effectively used for mol. dynamics simulations. We show how derivs. of the GPW energy functional, namely ionic forces and the Kohn-Sham matrix, can be computed in a consistent way. The computational cost of computing the total energy and the Kohn-Sham matrix is scaling linearly with the system size, even for condensed phase systems of just a few tens of atoms. The efficiency of the method allows for the use of large Gaussian basis sets for systems up to 3000 atoms, and we illustrate the accuracy of the method for various basis sets in gas and condensed phases. Agreement with basis set free calcns. for single mols. and plane wave based calcns. in the condensed phase is excellent. Wave function optimization with the orbital transformation technique leads to good parallel performance, and outperforms traditional diagonalisation methods. Energy conserving Born-Oppenheimer dynamics can be performed, and a highly efficient scheme is obtained using an extrapolation of the d. matrix. We illustrate these findings with calcns. using commodity PCs as well as supercomputers.
- 67Lippert, G.; Hutter, J.; Parrinello, M. A hybrid Gaussian and plane wave density functional scheme. Mol. Phys. 1997, 92, 477– 488, DOI: 10.1080/00268979709482119Google Scholar67A hybrid Gaussian and plane wave density functional schemeLippert, Gerald; Hutter, Juerg; Parrinello, MicheleMolecular Physics (1997), 92 (3), 477-487CODEN: MOPHAM; ISSN:0026-8976. (Taylor & Francis)A d.-functional theory-based algorithm for periodic and nonperiodic ab initio calcns. is presented. This scheme uses pseudopotentials in order to integrate out the core electrons from the problem. The valence pseudo wave functions are expanded in Gaussian-type orbitals and the d. is represented in a plane wave auxiliary basis. The Gaussian basis functions make it possible to use the efficient anal. integration schemes and screening algorithms of quantum chem. Novel recursion relations are developed for the calcn. of the matrix elements of the d.-dependent Kohn-Sham self-consistent potential. At the same time the use of a plane wave basis for the electron d. permits efficient calcn. of the Hartree energy using fast Fourier transforms, thus circumventing one of the major bottlenecks of std. Gaussian based calcns. Furthermore, this algorithm avoids the fitting procedures that go along with intermediate basis sets for the charge d. The performance and accuracy of this new scheme are discussed and selected examples are given.
- 68Perdew, J. P.; Burke, K.; Ernzerhof, M. Generalized Gradient Approximation Made Simple. Phys. Rev. Lett. 1996, 77, 3865– 3868, DOI: 10.1103/PhysRevLett.77.3865Google Scholar68Generalized gradient approximation made simplePerdew, John P.; Burke, Kieron; Ernzerhof, MatthiasPhysical Review Letters (1996), 77 (18), 3865-3868CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)Generalized gradient approxns. (GGA's) for the exchange-correlation energy improve upon the local spin d. (LSD) description of atoms, mols., and solids. We present a simple derivation of a simple GGA, in which all parameters (other than those in LSD) are fundamental consts. Only general features of the detailed construction underlying the Perdew-Wang 1991 (PW91) GGA are invoked. Improvements over PW91 include an accurate description of the linear response of the uniform electron gas, correct behavior under uniform scaling, and a smoother potential.
- 69Goedecker, S.; Teter, M.; Hutter, J. Separable dual-space Gaussian pseudopotentials. Phys. Rev. B: Condens. Matter Mater. Phys. 1996, 54, 1703– 1710, DOI: 10.1103/PhysRevB.54.1703Google Scholar69Separable dual-space Gaussian pseudopotentialsGoedecker, S.; Teter, M.; Hutter, J.Physical Review B: Condensed Matter (1996), 54 (3), 1703-1710CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)We present pseudopotential coeffs. for the first two rows of the Periodic Table. The pseudopotential is of an analytic form that gives optimal efficiency in numerical calculations using plane waves as a basis set. At most, even coeffs. are necessary to specify its analytic form. It is separable and has optimal decay properties in both real and Fourier space. Because of this property, the application of the nonlocal part of the pseudopotential to a wave function can be done efficiently on a grid in real space. Real space integration is much faster for large systems than ordinary multiplication in Fourier space, since it shows only quadratic scaling with respect to the size of the system. We systematically verify the high accuracy of these pseudopotentials by extensive at. and mol. test calcns.
- 70Niklasson, A. M.; Tymczak, C.; Challacombe, M. Trace resetting density matrix purification in
self-consistent-field theory. J. Chem. Phys. 2003, 118, 8611– 8620, DOI: 10.1063/1.1559913
Google ScholarThere is no corresponding record for this reference. - 71Schütt, O. Enabling Large Scale DFT Simulation with GPU Acceleration and Machine Learning. Ph.D. Thesis, ETH Zürich, 2017.Google ScholarThere is no corresponding record for this reference.
Cited By
Smart citations by scite.ai include citation statements extracted from the full text of the citing article. The number of the statements may be higher than the number of citations provided by ACS Publications if one paper cites another multiple times or lower if scite has not yet processed some of the citing articles.
This article is cited by 37 publications.
- Fabian Schwarz, Senja Barthel, Amber Mace. Understanding Mobile Particles in Solid-State Materials: From the Perspective of Potential Energy Surfaces. Chemistry of Materials 2024, 36
(23)
, 11359-11376. https://doi.org/10.1021/acs.chemmater.4c01822
- Edoardo Cignoni, Divya Suman, Jigyasa Nigam, Lorenzo Cupellini, Benedetta Mennucci, Michele Ceriotti. Electronic Excited States from Physically Constrained Machine Learning. ACS Central Science 2024, 10
(3)
, 637-648. https://doi.org/10.1021/acscentsci.3c01480
- Alexander Hagg, Karl N. Kirschner. Open-Source Machine Learning in Computational Chemistry. Journal of Chemical Information and Modeling 2023, 63
(15)
, 4505-4532. https://doi.org/10.1021/acs.jcim.3c00643
- Bing Huang, O. Anatole von Lilienfeld. Ab Initio Machine Learning in Chemical Compound Space. Chemical Reviews 2021, 121
(16)
, 10001-10036. https://doi.org/10.1021/acs.chemrev.0c01303
- Peng-Jian Yang, Mahito Sugiyama, Koji Tsuda, Takeshi Yanai. Artificial Neural Networks Applied as Molecular Wave Function Solvers. Journal of Chemical Theory and Computation 2020, 16
(6)
, 3513-3529. https://doi.org/10.1021/acs.jctc.9b01132
- Pavlo O. Dral. Quantum Chemistry in the Age of Machine Learning. The Journal of Physical Chemistry Letters 2020, 11
(6)
, 2336-2347. https://doi.org/10.1021/acs.jpclett.9b03664
- Jeremy P. Coe. Machine Learning Configuration Interaction for ab Initio Potential Energy Curves. Journal of Chemical Theory and Computation 2019, 15
(11)
, 6179-6189. https://doi.org/10.1021/acs.jctc.9b00828
- Geng Sun, Philippe Sautet. Toward Fast and Reliable Potential Energy Surfaces for Metallic Pt Clusters by Hierarchical Delta Neural Networks. Journal of Chemical Theory and Computation 2019, 15
(10)
, 5614-5627. https://doi.org/10.1021/acs.jctc.9b00465
- Onur Çaylak, Anil Yaman, Björn Baumeier. Evolutionary Approach to Constructing a Deep Feedforward Neural Network for Prediction of Electronic Coupling Elements in Molecular Materials. Journal of Chemical Theory and Computation 2019, 15
(3)
, 1777-1784. https://doi.org/10.1021/acs.jctc.8b01285
- Haichen Li, Christopher Collins, Matteus Tanha, Geoffrey J. Gordon, David J. Yaron. A Density Functional Tight Binding Layer for Deep Learning of Chemical Hamiltonians. Journal of Chemical Theory and Computation 2018, 14
(11)
, 5764-5776. https://doi.org/10.1021/acs.jctc.8b00873
- Deming Xia, Fanqi Zeng, Wanting Chen, Hui Zhao, Hong-bin Xie, Jingwen Chen, Joseph S. Francisco. Accelerated peptide bond formation at air–water interfaces. Proceedings of the National Academy of Sciences 2025, 122
(12)
https://doi.org/10.1073/pnas.2501323122
- Diata Traore, Olivier Adjoua, César Feniou, Ioanna-Maria Lygatsika, Yvon Maday, Evgeny Posenitskiy, Kerstin Hammernik, Alberto Peruzzo, Julien Toulouse, Emmanuel Giner, Jean-Philip Piquemal. Shortcut to chemically accurate quantum computing via density-based basis-set correction. Communications Chemistry 2024, 7
(1)
https://doi.org/10.1038/s42004-024-01348-3
- Xiao Zhao, Jishi Wei, Tianbing Song, Zhengren Wang, Dawei Yang, Xirong Zhang, Feng Huo, Yanqiang Zhang, Huan-Ming Xiong. Computational insights into carbon dots: Evolution of structural models and structure–activity relationships. Chemical Engineering Journal 2024, 481 , 148779. https://doi.org/10.1016/j.cej.2024.148779
- Chao Xin, Bingqian Song, Guangyong Jin, Yongli Song, Feng Pan. Advancements in High‐Throughput Screening and Machine Learning Design for 2D Ferromagnetism: A Comprehensive Review. Advanced Theory and Simulations 2023, 6
(12)
https://doi.org/10.1002/adts.202300475
- Mohammad Haidar, Marko J. Rančić, Thomas Ayral, Yvon Maday, Jean‐Philip Piquemal. Open source variational quantum eigensolver extension of the quantum learning machine for quantum chemistry. WIREs Computational Molecular Science 2023, 13
(5)
https://doi.org/10.1002/wcms.1664
- Hyuk‐Yong Kwon, Gregory M. Curtin, Zachary Morrow, C. T. Kelley, Elena Jakubikova. Adaptive basis sets for practical quantum computing. International Journal of Quantum Chemistry 2023, 123
(14)
https://doi.org/10.1002/qua.27123
- William Dawson, Eisuke Kawashima, Laura E. Ratcliff, Muneaki Kamiya, Luigi Genovese, Takahito Nakajima. Complexity reduction in density functional theory: Locality in space and energy. The Journal of Chemical Physics 2023, 158
(16)
https://doi.org/10.1063/5.0142652
- Israel Perez. Ab initio
methods for the computation of physical properties and performance parameters of electrochemical energy storage devices. Physical Chemistry Chemical Physics 2023, 25
(3)
, 1476-1503. https://doi.org/10.1039/D2CP03611H
- Sergei V. Kalinin, Colin Ophus, Paul M. Voyles, Rolf Erni, Demie Kepaptsoglou, Vincenzo Grillo, Andrew R. Lupini, Mark P. Oxley, Eric Schwenker, Maria K. Y. Chan, Joanne Etheridge, Xiang Li, Grace G. D. Han, Maxim Ziatdinov, Naoya Shibata, Stephen J. Pennycook. Machine learning in scanning transmission electron microscopy. Nature Reviews Methods Primers 2022, 2
(1)
https://doi.org/10.1038/s43586-022-00095-w
- Abeer Abdulaziz Alarfaj, Hanan Ahmed Hosni Mahmoud. Feature Fusion Deep Learning Model for Defects Prediction in Crystal Structures. Crystals 2022, 12
(9)
, 1324. https://doi.org/10.3390/cryst12091324
- L. Fiedler, K. Shah, M. Bussmann, A. Cangi. Deep dive into machine learning density functional theory for materials science and chemistry. Physical Review Materials 2022, 6
(4)
https://doi.org/10.1103/PhysRevMaterials.6.040301
- Behnam Parsaeifard, Stefan Goedecker. Manifolds of quasi-constant SOAP and ACSF fingerprints and the resulting failure to machine learn four-body interactions. The Journal of Chemical Physics 2022, 156
(3)
https://doi.org/10.1063/5.0070488
- Jigyasa Nigam, Michael J. Willatt, Michele Ceriotti. Equivariant representations for molecular Hamiltonians and
N
-center atomic-scale properties. The Journal of Chemical Physics 2022, 156
(1)
https://doi.org/10.1063/5.0072784
- Ying Shi Teh, Swarnava Ghosh, Kaushik Bhattacharya. Machine-learned prediction of the electronic fields in a crystal. Mechanics of Materials 2021, 163 , 104070. https://doi.org/10.1016/j.mechmat.2021.104070
- Juan I. Gómez-Peralta, Nidia G. García-Peña, Xim Bokhimi. Crystal-Site-Based Artificial Neural Networks for Material Classification. Crystals 2021, 11
(9)
, 1039. https://doi.org/10.3390/cryst11091039
- Julia Westermayr, Michael Gastegger, Kristof T. Schütt, Reinhard J. Maurer. Perspective on integrating machine learning into computational chemistry and
materials science. The Journal of Chemical Physics 2021, 154
(23)
https://doi.org/10.1063/5.0047760
- Behnam Parsaeifard, Deb Sankar De, Anders S Christensen, Felix A Faber, Emir Kocer, Sandip De, Jörg Behler, O Anatole von Lilienfeld, Stefan Goedecker. An assessment of the structural resolution of various fingerprints commonly used in machine learning. Machine Learning: Science and Technology 2021, 2
(1)
, 015018. https://doi.org/10.1088/2632-2153/abb212
- Siwar Chibani, François-Xavier Coudert. Machine learning approaches for the prediction of materials properties. APL Materials 2020, 8
(8)
https://doi.org/10.1063/5.0018384
- Jorge Arturo Hernandez Zeledon, Aldo H. Romero, Pengju Ren, Xiaodong Wen, Yongwang Li, James P. Lewis. The structural information filtered features (SIFF) potential: Maximizing information stored in machine-learning descriptors for materials prediction. Journal of Applied Physics 2020, 127
(21)
https://doi.org/10.1063/5.0002252
- Thomas D. Kühne, Marcella Iannuzzi, Mauro Del Ben, Vladimir V. Rybkin, Patrick Seewald, Frederick Stein, Teodoro Laino, Rustam Z. Khaliullin, Ole Schütt, Florian Schiffmann, Dorothea Golze, Jan Wilhelm, Sergey Chulkov, Mohammad Hossein Bani-Hashemian, Valéry Weber, Urban Borštnik, Mathieu Taillefumier, Alice Shoshana Jakobovits, Alfio Lazzaro, Hans Pabst, Tiziano Müller, Robert Schade, Manuel Guidon, Samuel Andermatt, Nico Holmberg, Gregory K. Schenter, Anna Hehn, Augustin Bussy, Fabian Belleflamme, Gloria Tabacchi, Andreas Glöß, Michael Lass, Iain Bethune, Christopher J. Mundy, Christian Plessl, Matt Watkins, Joost VandeVondele, Matthias Krack, Jürg Hutter. CP2K: An electronic structure and molecular dynamics software package - Quickstep: Efficient and accurate electronic structure calculations. The Journal of Chemical Physics 2020, 152
(19)
https://doi.org/10.1063/5.0007045
- Sergei Manzhos. Machine learning for the solution of the Schrödinger equation. Machine Learning: Science and Technology 2020, 1
(1)
, 013002. https://doi.org/10.1088/2632-2153/ab7d30
- Tudor Luca Mitran, George Alexandru Nemnes. Prediction of Energy Gaps in Graphene—Hexagonal Boron Nitride Nanoflakes Using Artificial Neural Networks. 2020, 197-209. https://doi.org/10.1007/978-3-030-37790-8_11
- Pavlo O. Dral. Quantum chemistry assisted by machine learning. 2020, 291-324. https://doi.org/10.1016/bs.aiq.2020.05.002
- Harper R. Grimsley, Sophia E. Economou, Edwin Barnes, Nicholas J. Mayhall. An adaptive variational algorithm for exact molecular simulations on a quantum computer. Nature Communications 2019, 10
(1)
https://doi.org/10.1038/s41467-019-10988-2
- Guillaume Fraux, Siwar Chibani, François-Xavier Coudert. Modelling of framework materials at multiple scales: current practices and open questions. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 2019, 377
(2149)
, 20180220. https://doi.org/10.1098/rsta.2018.0220
- Gabriel R Schleder, Antonio C M Padilha, Carlos Mera Acosta, Marcio Costa, Adalberto Fazzio. From DFT to machine learning: recent approaches to materials science–a review. Journal of Physics: Materials 2019, 2
(3)
, 032001. https://doi.org/10.1088/2515-7639/ab084b
- Andreas Lynge Vishart, Nicolai Ree, Kurt V. Mikkelsen. Graphical user interface for an easy and reliable construction of input files to CP2K. Journal of Molecular Modeling 2019, 25
(5)
https://doi.org/10.1007/s00894-019-3987-6
Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.
Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.
The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.
Recommended Articles
Abstract
Figure 1
Figure 1. Overview of the PAO-ML scheme for using the potential parametrization and machine learning to calculate the PAO basis from given atomic positions.
Figure 2
Figure 2. Learning curve showing the decreasing error of PAO-ML (blue) with increased training set size. For comparison the error of a variationally optimized PAO basis (green) and a traditional minimal SZV-MOLOPT-GTH (red) basis set are shown. With very little training data, the variational limit is approached by the ML method.
Figure 3
Figure 3. Energy fluctuation during a series of MD simulation of a water dimer using the PAO-ML scheme. The simulations were conducted in the NVE ensemble using different time steps Δt to demonstrate the consistency of the forces and thus the controllability of the integration error.
Figure 4
Figure 4. Shown are oxygen–oxygen pair correlation functions for liquid water at 300 K. As reference the experimental (green, ref (63)) and TZV2P-MOLOPT-GTH basis sets (blue) results are shown. The SZV-MOLOPT-GTH curve (red) and DFTB (orange) are examples of results typically obtained from a minimal basis sets. The adaptive basis set PAO-ML (black) reproduces the reference (TZV2P) better than any of the alternative minimal basis set methods.
References
This article references 71 other publications.
- 1Goedecker, S. Linear scaling electronic structure methods. Rev. Mod. Phys. 1999, 71, 1085, DOI: 10.1103/RevModPhys.71.10851Linear scaling electronic structure methodsGoedecker, StefanReviews of Modern Physics (1999), 71 (4), 1085-1123CODEN: RMPHAT; ISSN:0034-6861. (American Physical Society)A review with many refs. Methods exhibiting linear scaling with respect to the size of the system, the so-called O(N) methods, are an essential tool for the calcn. of the electronic structure of large systems contg. many atoms. They are based on algorithms that take advantage of the decay properties of the d. matrix. In this article the phys. decay properties of the d. matrix are be studied for both metals and insulators. Several strategies for constructing O(N) algorithms are presented and critically examd. Some issues that are relevant only for self-consistent O(N) methods, such as the calcn. of the Hartree potential and mixing issues, are also discussed. Some typical applications of O(N) methods are briefly described.
- 2Bowler, D. R.; Miyazaki, T.
methods in electronic structure calculations. Rep. Prog. Phys. 2012, 75, 036503, DOI: 10.1088/0034-4885/75/3/036503
2(N) methods in electronic structure calculationsBowler, D. R.; Miyazaki, T.Reports on Progress in Physics (2012), 75 (3), 036503/1-036503/43CODEN: RPPHAG; ISSN:0034-4885. (Institute of Physics Publishing)A review. Linear-scaling methods, or O(N) methods, have computational and memory requirements which scale linearly with the no. of atoms in the system, N, in contrast to std. approaches which scale with the cube of the no. of atoms. These methods, which rely on the short-ranged nature of electronic structure, will allow accurate, ab initio simulations of systems of unprecedented size. The theory behind the locality of electronic structure is described and related to phys. properties of systems to be modelled, along with a survey of recent developments in real-space methods which are important for efficient use of high-performance computers. The linear-scaling methods proposed to date can be divided into seven different areas, and the applicability, efficiency and advantages of the methods proposed in these areas are then discussed. The applications of linear-scaling methods, as well as the implementations available as computer programs, are considered. Finally, the prospects for and the challenges facing linear-scaling methods are discussed. - 3VandeVondele, J.; Borštnik, U.; Hutter, J. Linear Scaling Self-Consistent Field Calculations with Millions of Atoms in the Condensed Phase. J. Chem. Theory Comput. 2012, 8, 3565– 3573, DOI: 10.1021/ct200897x3Linear Scaling Self-Consistent Field Calculations with Millions of Atoms in the Condensed PhaseVandeVondele, Joost; Borstnik, Urban; Hutter, JurgJournal of Chemical Theory and Computation (2012), 8 (10), 3565-3573CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The applicability and performance of a linear scaling algorithm is investigated for three-dimensional condensed phase systems. A simple but robust approach based on the matrix sign function is employed together with a thresholding matrix multiplication that does not require a prescribed sparsity pattern. Semiempirical methods and d. functional theory have been tested. We demonstrate that self-consistent calcns. with 1 million atoms are feasible for simple systems. With this approach, the computational cost of the calcn. depends strongly on basis set quality. In the current implementation, high quality calcns. for dense systems are limited to a few hundred thousand atoms. We report on the sparsities of the involved matrixes as obtained at convergence and for intermediate iterations. We investigate how detg. the chem. potential impacts the computational cost for very large systems.
- 4Bowler, D. R.; Miyazaki, T. Calculations for millions of atoms with density functional theory: linear scaling shows its potential. J. Phys.: Condens. Matter 2010, 22, 074207, DOI: 10.1088/0953-8984/22/7/0742074Calculations for millions of atoms with density functional theory: linear scaling shows its potentialBowler, D. R.; Miyazaki, T.Journal of Physics: Condensed Matter (2010), 22 (7), 074207/1-0074207/6CODEN: JCOMEL; ISSN:0953-8984. (Institute of Physics Publishing)An overview of the CONQUEST linear scaling d. functional theory (DFT) code is given, focusing particularly on the scaling behavior on modern high-performance computing platforms. We demonstrate that essentially perfect linear scaling and weak parallel scaling (with fixed no. of atoms per processor core) can be achieved, and that DFT calcns. on millions of atoms are now possible.
- 5Mulliken, R. S. Criteria for the Construction of Good Self-Consistent-Field Molecular Orbital Wave Functions, and the Significance of LCAO-MO Population Analysis. J. Chem. Phys. 1962, 36, 3428– 3439, DOI: 10.1063/1.17324765Criteria for the construction of good self-consistent-field molecular orbital wave functions, and the significance of L.C.A.O.M.O. population analysisMulliken, R. S.Journal of Chemical Physics (1962), 36 (), 3428-39CODEN: JCPSA6; ISSN:0021-9606.Criteria for the optimal choice of finite linear combinations of Slater-type orbitals (S.T.O.) adequate to approx. closely S.C.F. M.O. were examd. in the light of computations on HF and other mols. Some aspects of the A.O. (generalized Heitler-London) method were discussed. The inherent limitations on the meaning of charges on atoms in a mol., or of degree of ionic character, were discussed. Unacceptable at. charges are found from an L.C.A.O. mol. orbital population analysis made on S.C.F. M.O. wave functions, approximated by using unbalanced S.T.O. sets, while acceptable results are obtained with judiciously balanced and sufficiently complete S.T.O. sets. The effects of insufficiently complete and unbalanced S.T.O. sets on the computed dipole moments of S.C.F. M.O. wave functions were discussed, with examples.
- 6Davidson, E. R. Electronic Population Analysis of Molecular Wavefunctions. J. Chem. Phys. 1967, 46, 3320– 3324, DOI: 10.1063/1.18412196Electronic population analysis of molecular wavefunctionsDavidson, Ernest RoyJournal of Chemical Physics (1967), 46 (9), 3320-4CODEN: JCPSA6; ISSN:0021-9606.A general method for carrying out population analysis of wavefunctions calcd. with arbitrary basis sets is presented. The difficulties in defining orbital populations is discussed. Results are presented for the sequence BF, CO, and N2 which indicate that back-transfer of charge in the π bond cancels the normal transfer of the charge in the σ bond. Contrary to popular belief, the more electroneg. element has the larger degree of hybridization in each case. The amt. of promotion of 2s electrons is greater on the less-electroneg. element, as expected.
- 7Roby, K. R. Quantum theory of chemical valence concepts. Mol. Phys. 1974, 27, 81– 104, DOI: 10.1080/00268977400100071There is no corresponding record for this reference.
- 8Heinzmann, R.; Ahlrichs, R. Population analysis based on occupation numbers of modified atomic orbitals (MAOs). Theor. Chim. Acta. 1976, 42, 33– 45, DOI: 10.1007/BF005482898Population analysis based on occupation numbers of modified atomic orbitals (MAOs)Heinzmann, Rolf; Ahlrichs, ReinhartTheoretica Chimica Acta (1976), 42 (1), 33-45CODEN: TCHAAM; ISSN:0040-5744.A new interpretational scheme is proposed for the anal. of mol. wavefunctions. Starting from the mol. d. operator a minimal set of MAOs is constructed from the requirement that the MOs can be represented as closely as possible by the MAOs. The MAOs are used to compute at. occupation nos. N and shared electron nos. σ. The mol. d. is then discussed in terms of N and σ. This approach has the following advantages: (1) it is generally applicable, (2) the quantities N and σ are virtually basis set independent, (3) the quantities N and σ fulfil the intuitively expected boundary conditions, (4) the simultaneous consideration of N and σ allows for a more reliable description of chem. bonding than consideration of at. charges only.
- 9Ehrhardt, C.; Ahlrichs, R. Population analysis based on occupation numbers II. Relationship between shared electron numbers and bond energies and characterization of hypervalent contributions. Theor. Chim. Acta. 1985, 68, 231– 245, DOI: 10.1007/BF005267749Population analysis based on occupation numbers. II. Relationship between shared electron numbers and bond energies and characterization of hypervalent contributionsEhrhardt, Claus; Ahlrichs, ReinhartTheoretica Chimica Acta (1985), 68 (3), 231-45CODEN: TCHAAM; ISSN:0040-5744.The population anal. based on occupation nos. is briefly reviewed. A new way is proposed to det. modified AOs and to characterize hypervalent contributions. This is discussed in application to the mols. NSF, NSF3, SF6, OPCl, OPCl2, O2PCl, SO2, and ClO4-. The connection was studied between shared electron nos. σ - considered as a measure of covalent bond strength - and bond energies. The σ is a reliable measure of bond energies.
- 10Reed, A. E.; Curtiss, L. A.; Weinhold, F. Intermolecular interactions from a natural bond orbital, donor-acceptor viewpoint. Chem. Rev. 1988, 88, 899– 926, DOI: 10.1021/cr00088a00510Intermolecular interactions from a natural bond orbital, donor-acceptor viewpointReed, Alan E.; Curtiss, Larry A.; Weinhold, FrankChemical Reviews (Washington, DC, United States) (1988), 88 (6), 899-926CODEN: CHREAY; ISSN:0009-2665.A review with >135 refs. is given with discussion of natural bond orbital (NBO) anal., intermol. interaction models based on NBO anal., relation of donor-acceptor and electrostatic models.
- 11Lee, M. S.; Head-Gordon, M. Extracting polarized atomic orbitals from molecular orbital calculations. Int. J. Quantum Chem. 2000, 76, 169– 184, DOI: 10.1002/(SICI)1097-461X(2000)76:2<169::AID-QUA7>3.0.CO;2-G11Extracting polarized atomic orbitals from molecular orbital calculationsLee, Michael S.; Head-Gordon, MartinInternational Journal of Quantum Chemistry (2000), 76 (2), 169-184CODEN: IJQCB2; ISSN:0020-7608. (John Wiley & Sons, Inc.)We present a class of methods for extg. a polarized AO (EPAO) minimal basis set from a converged MO calcn. Unlike minimal basis sets obtained from previous approaches, EPAOs rigorously contain the occupied MO space. EPAOs achieve this exactness because their spatial extent is not restricted. Nonetheless, EPAOs are optimally localized with respect to a localization criterion and are essentially single-centered. EPAOs provide an alternative scheme for partitioning the electron d. into at. subspaces. Therefore, they can be used to det. at. and chem. group properties such as charge populations. Since EPAOs provide a compact description to the occupied space, they may have other computational applications such as in local correlation methods. Addnl., the EPAOs give a description of valence antibonding orbitals that may be appropriate for nondynamical electron correlation.
- 12Mayer, I. Orthogonal effective atomic orbitals in the topological theory of atoms. Can. J. Chem. 1996, 74, 939– 942, DOI: 10.1139/v96-103There is no corresponding record for this reference.
- 13Cioslowski, J.; Liashenko, A. Atomic orbitals in molecules. J. Chem. Phys. 1998, 108, 4405– 4412, DOI: 10.1063/1.475853There is no corresponding record for this reference.
- 14Lu, W. C.; Wang, C. Z.; Schmidt, M. W.; Bytautas, L.; Ho, K. M.; Ruedenberg, K. Molecule intrinsic minimal basis sets. I. Exact resolution of ab initio optimized molecular orbitals in terms of deformed atomic minimal-basis orbitals. J. Chem. Phys. 2004, 120, 2629– 2637, DOI: 10.1063/1.163873114Molecule intrinsic minimal basis sets. I. Exact resolution of ab initio optimized molecular orbitals in terms of deformed atomic minimal-basis orbitalsLu, W. C.; Wang, C. Z.; Schmidt, M. W.; Bytautas, L.; Ho, K. M.; Ruedenberg, K.Journal of Chemical Physics (2004), 120 (6), 2629-2637CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)A method is presented for expressing the occupied SCF orbitals of a mol. exactly in terms of chem. deformed at. minimal-basis-set orbitals that deviate as little as possible from free-atom SCF minimal-basis orbitals. The MOs referred to are the exact SCF orbitals, the free-atom orbitals referred to are the exact at. SCF orbitals, and the formulation of the deformed "quasiat. minimal-basis-sets" is independent of the calculational AO basis used. The resulting resoln. of MOs in terms of quasiat. minimal basis set orbitals is therefore intrinsic to the exact mol. wave functions. The deformations are analyzed in terms of interat. contributions. The Mulliken population anal. is formulated in terms of the quasiat. minimal-basis orbitals. In the virtual SCF orbital space the method leads to a quant. ab initio formulation of the qual. model of virtual valence orbitals, which are useful for calcg. electron correlation and the interpretation of reactions. The method is applicable to Kohn-Sham d. functional theory orbitals and is easily generalized to valence MCSCF orbitals.
- 15Laikov, D. N. Intrinsic minimal atomic basis representation of molecular electronic wavefunctions. Int. J. Quantum Chem. 2011, 111, 2851– 2867, DOI: 10.1002/qua.2276715Intrinsic minimal atomic basis representation of molecular electronic wavefunctionsLaikov, Dimitri N.International Journal of Quantum Chemistry (2011), 111 (12), 2851-2867CODEN: IJQCB2; ISSN:0020-7608. (John Wiley & Sons, Inc.)The problem of finding an effective minimal at. basis that spans the exact occupied wavefunctions of a mean-field theory at a given mol. geometry, which has a no. of special properties, is studied and a new general procedure is developed that (1) solves for a raw minimal set of strongly atom-centered functions-products of spherical harmonics and mol.-optimized radial parts-that approx. span the occupied mol. wavefunctions and minimize the sum of their energies, (2) uses projection operators to get a new set of deformed atom-centered functions that exactly span the occupied space and fall into core and valence subsets, (3) applies a new zero-bond-dipole orthogonalization scheme to the core-orthogonalized valence subset so that for each two-center product of these functions the projection of its dipole moment along the line going through the two centers is zero. The resulting effective minimal at. basis is intrinsic to the mol. problem and does not need a free-atoms input. Some interesting features of the zero-bond-dipole orthogonalization are showing up in the at. population anal. of a diverse set of mols. The new procedure may be useful for the interpretation of electronic structure, for the construction of model Hamiltonians in terms of transferable mol. integrals, and for the definition of active valence space in the treatment of electron correlation. © 2010 Wiley Periodicals, Inc. Int J Quantum Chem, 2011.
- 16Knizia, G. Intrinsic Atomic Orbitals: An Unbiased Bridge between Quantum Theory and Chemical Concepts. J. Chem. Theory Comput. 2013, 9, 4834– 4843, DOI: 10.1021/ct400687b16Intrinsic Atomic Orbitals: An Unbiased Bridge between Quantum Theory and Chemical ConceptsKnizia, GeraldJournal of Chemical Theory and Computation (2013), 9 (11), 4834-4843CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Modern quantum chem. can make quant. predictions on an immense array of chem. systems. However, the interpretation of those predictions is often complicated by the complex wave function expansions used. Here we show that an exceptionally simple algebraic construction allows for defining at. core and valence orbitals, polarized by the mol. environment, which can exactly represent SCF wave functions. This construction provides an unbiased and direct connection between quantum chem. and empirical chem. concepts, and can be used, for example, to calc. the nature of bonding in mols., in chem. terms, from first principles. In particular, we find consistency with electronegativities (χ), C 1s core-level shifts, resonance substituent parameters (σR), Lewis structures, and oxidn. states of transition-metal complexes.
- 17Adams, W. H. On the Solution of the Hartree-Fock Equation in Terms of Localized Orbitals. J. Chem. Phys. 1961, 34, 89– 102, DOI: 10.1063/1.173162217The solution of the Hartree-Fock equation in terms of localized orbitalsAdams, William H.Journal of Chemical Physics (1961), 34 (), 89-102CODEN: JCPSA6; ISSN:0021-9606.The Hartree-Fock method was discussed, with emphasis on the transformation properties of the Hartree-Fock equation. This equation can be solved in terms of nonorthogonal one-electron functions. In some cases it may be more convenient to choose such solns. Equations were developed which define the localized one-electron functions. For a system of closedshell atoms or ions, it was suggested that the localized orbitals of each atom or ion can be expanded in terms of functions centered on its nucleus. This was based on the success of the ionic theory of crystals. Because of the symmetry of a crystal, use of the localized orbitals could lead to expressions for the 1st-order, Hartree-Fock d. matrix and the Hartree-Fock energy of a crystal.
- 18Adams, W. H. Orbital Theories of Electronic Structure. J. Chem. Phys. 1962, 37, 2009– 2018, DOI: 10.1063/1.1733420There is no corresponding record for this reference.
- 19Adams, W. Distortion of interacting atoms and ions. Chem. Phys. Lett. 1971, 12, 295– 298, DOI: 10.1016/0009-2614(71)85068-6There is no corresponding record for this reference.
- 20Lee, M. S.; Head-Gordon, M. Polarized atomic orbitals for self-consistent field electronic structure calculations. J. Chem. Phys. 1997, 107, 9085– 9095, DOI: 10.1063/1.47519920Polarized atomic orbitals for self-consistent field electronic structure calculationsLee, Michael S.; Head-Gordon, MartinJournal of Chemical Physics (1997), 107 (21), 9085-9095CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We present a new SCF approach which, given a large "secondary" basis set of AOs, variationally optimizes MOs in terms of a small "primary" basis set of distorted AOs, which are simultaneously optimized. If the primary basis is taken as a minimal basis, the resulting functions are termed polarized AOs (PAO's) because they are valence (or core) AOs which have distorted or polarized in an optimal way for their mol. environment. The PAO's derive their flexibility from the fact that they are formed from atom-centered linear-combinations of the larger set of secondary AOs. The variational conditions satisfied by PAO's are defined, and an iterative method for performing a PAO-SCF calcn. is introduced. We compare the PAO-SCF approach against full SCF calcns. for the energies, dipoles, and mol. geometries of various mols. The PAO's are potentially useful for studying large systems that are currently intractable with larger than minimal basis sets, as well as offering potential interpretative benefits relative to calcns. in extended basis sets.
- 21Lee, M. S.; Head-Gordon, M. Absolute and relative energies from polarized atomic orbital self-consistent field calculations and a second order correction.: Convergence with size and composition of the secondary basis. Comput. Chem. 2000, 24, 295– 301, DOI: 10.1016/S0097-8485(99)00086-821Absolute and relative energies from polarized atomic orbital self-consistent field calculations and a second order correction. Convergence with size and composition of the secondary basisLee, Michael S.; Head-Gordon, MartinComputers & Chemistry (Oxford) (2000), 24 (3,4), 295-301CODEN: COCHDK; ISSN:0097-8485. (Elsevier Science Ltd.)Polarized AOs (PAO's) are mol.-adapted minimal basis functions that are variationally obtained as an atom-blocked transformation from a conventional extended basis set, as a Hartree-Fock calcn. is performed in the PAO basis. This approxn. yields a higher energy than a HF calcn. performed in the extended basis, although the two results converge to the same limit as the extended basis approaches completeness on each atom. To test the rate of convergence, PAO-HF calcns. were performed using cc-pVXZ and aug-cc-pVXZ basis sets for the water monomer and dimer, and six substituted ethylenes. The results show that the quality of PAO calcns. converges smoothly with X. The use of augmented functions is recommended. To correct a PAO-HF calcn. for residual deficiencies, a noniterative second order correction is introduced. This correction corresponds to an energy-weighted steepest descent step, and substantially improves the quality of PAO energies.
- 22Berghold, G.; Parrinello, M.; Hutter, J. Polarized atomic orbitals for linear scaling methods. J. Chem. Phys. 2002, 116, 1800– 1810, DOI: 10.1063/1.1431270There is no corresponding record for this reference.
- 23Bowler, D. R.; Miyazaki, T.; Gillan, M. J. Recent progress in linear scaling ab initio electronic structure techniques. J. Phys.: Condens. Matter 2002, 14, 2781, DOI: 10.1088/0953-8984/14/11/30323Recent progress in linear scaling ab initio electronic structure techniquesBowler, D. R.; Miyazaki, T.; Gillan, M. J.Journal of Physics: Condensed Matter (2002), 14 (11), 2781-2798CODEN: JCOMEL; ISSN:0953-8984. (Institute of Physics Publishing)A review. We describe recent progress in developing linear scaling ab initio electronic structure methods, referring in particular to our highly parallel code CONQUEST. After reviewing the state of the field, we present the basic ideas underlying almost all linear scaling methods, and discuss specific practical details of the implementation. We also note the connection between linear scaling methods and embedding techniques.
- 24Torralba, A. S.; Todorović, M.; Brázdová, V.; Choudhury, R.; Miyazaki, T.; Gillan, M. J.; Bowler, D. R. Pseudo-atomic orbitals as basis sets for the
DFT code CONQUEST. J. Phys.: Condens. Matter 2008, 20 (29), 294206, DOI: 10.1088/0953-8984/20/29/294206
24Pseudo-atomic orbitals as basis sets for the O(N) DFT code CONQUESTTorralba, A. S.; Todorovic, M.; Brazdova, V.; Choudhury, R.; Miyazaki, T.; Gillan, M. J.; Bowler, D. R.Journal of Physics: Condensed Matter (2008), 20 (29), 294206/1-294206/8CODEN: JCOMEL; ISSN:0953-8984. (Institute of Physics Publishing)Various aspects of the implementation of pseudo-AOs (PAOs) as basis functions for the linear scaling CONQUEST code are presented. Preliminary results for the assignment of a large set of PAOs to a smaller space of support functions are encouraging, and an important related proof on the necessary symmetry of the support functions is shown. Details of the generation and integration schemes for the PAOs are also given. - 25Skylaris, C.-K.; Haynes, P. D.; Mostofi, A. A.; Payne, M. C. Introducing ONETEP: Linear-scaling density functional simulations on parallel computers. J. Chem. Phys. 2005, 122, 084119, DOI: 10.1063/1.183985225Introducing ONETEP: Linear-scaling density functional simulations on parallel computersSkylaris, Chris-Kriton; Haynes, Peter D.; Mostofi, Arash A.; Payne, Mike C.Journal of Chemical Physics (2005), 122 (8), 084119/1-084119/10CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We present ONETEP (order-N electronic total energy package), a d. functional program for parallel computers whose computational cost scales linearly with the no. of atoms and the no. of processors. ONETEP is based on our reformulation of the plane wave pseudopotential method which exploits the electronic localization that is inherent in systems with a nonvanishing band gap. We summarize the theor. developments that enable the direct optimization of strictly localized quantities expressed in terms of a delocalized plane wave basis. These same localized quantities lead us to a phys. way of dividing the computational effort among many processors to allow calcns. to be performed efficiently on parallel supercomputers. We show with examples that ONETEP achieves excellent speedups with increasing nos. of processors and confirm that the time taken by ONETEP as a function of increasing no. of atoms for a given no. of processors is indeed linear. What distinguishes our approach is that the localization is achieved in a controlled and math. consistent manner so that ONETEP obtains the same accuracy as conventional cubic-scaling plane wave approaches and offers fast and stable convergence. We expect that calcns. with ONETEP have the potential to provide quant. theor. predictions for problems involving thousands of atoms such as those often encountered in nanoscience and biophysics.
- 26Skylaris, C.-K.; Mostofi, A. A.; Haynes, P. D.; Diéguez, O.; Payne, M. C. Nonorthogonal generalized Wannier function pseudopotential plane-wave method. Phys. Rev. B: Condens. Matter Mater. Phys. 2002, 66, 035119, DOI: 10.1103/PhysRevB.66.03511926Nonorthogonal generalized Wannier function pseudopotential plane-wave methodSkylaris, Chris-Kriton; Mostofi, Arash A.; Haynes, Peter D.; Dieguez, Oswaldo; Payne, Mike C.Physical Review B: Condensed Matter and Materials Physics (2002), 66 (3), 035119/1-035119/12CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)We present a reformulation of the plane-wave pseudopotential method for insulators. This new approach allows us to perform d.-functional calcns. by solving directly for "nonorthogonal generalized Wannier functions" rather than extended Bloch states. We outline the theory on which our method is based and present test calcns. on a variety of systems. Comparison of our results with a std. plane-wave code shows that they are equiv. Apart from the usual advantages of the plane-wave approach such as the applicability to any lattice symmetry and the high accuracy, our method also benefits from the localization properties of our functions in real space. The localization of all our functions greatly facilitates the future extension of our method to linear-scaling schemes or calcns. of the elec. polarization of cryst. insulators.
- 27Mohr, S.; Ratcliff, L. E.; Genovese, L.; Caliste, D.; Boulanger, P.; Goedecker, S.; Deutsch, T. Accurate and efficient linear scaling DFT calculations with universal applicability. Phys. Chem. Chem. Phys. 2015, 17, 31360– 31370, DOI: 10.1039/C5CP00437C27Accurate and efficient linear scaling DFT calculations with universal applicabilityMohr, Stephan; Ratcliff, Laura E.; Genovese, Luigi; Caliste, Damien; Boulanger, Paul; Goedecker, Stefan; Deutsch, ThierryPhysical Chemistry Chemical Physics (2015), 17 (47), 31360-31370CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)D. functional theory calcns. are computationally extremely expensive for systems contg. many atoms due to their intrinsic cubic scaling. This fact has led to the development of so-called linear scaling algorithms during the last few decades. In this way it becomes possible to perform ab initio calcns. for several tens of thousands of atoms within reasonable walltimes. However, even though the use of linear scaling algorithms is phys. well justified, their implementation often introduces some small errors. Consequently most implementations offering such a linear complexity either yield only a limited accuracy or, if one wants to go beyond this restriction, require a tedious fine tuning of many parameters. In our linear scaling approach within the BigDFT package, we were able to overcome this restriction. Using an ansatz based on localized support functions expressed in an underlying Daubechies wavelet basis - which offers ideal properties for accurate linear scaling calcns. - we obtain an amazingly high accuracy and a universal applicability while still keeping the possibility of simulating large system with linear scaling walltimes requiring only a moderate demand of computing resources. We prove the effectiveness of our method on a wide variety of systems with different boundary conditions, for single-point calcns. as well as for geometry optimizations and mol. dynamics.
- 28Mohr, S.; Ratcliff, L. E.; Boulanger, P.; Genovese, L.; Caliste, D.; Deutsch, T.; Goedecker, S. Daubechies wavelets for linear scaling density functional theory. J. Chem. Phys. 2014, 140, 204110, DOI: 10.1063/1.487187628Daubechies wavelets for linear scaling density functional theoryMohr, Stephan; Ratcliff, Laura E.; Boulanger, Paul; Genovese, Luigi; Caliste, Damien; Deutsch, Thierry; Goedecker, StefanJournal of Chemical Physics (2014), 140 (20), 204110/1-204110/16CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)We demonstrate that Daubechies wavelets can be used to construct a minimal set of optimized localized adaptively contracted basis functions in which the Kohn-Sham orbitals can be represented with an arbitrarily high, controllable precision. Ground state energies and the forces acting on the ions can be calcd. in this basis with the same accuracy as if they were calcd. directly in a Daubechies wavelets basis, provided that the amplitude of these adaptively contracted basis functions is sufficiently small on the surface of the localization region, which is guaranteed by the optimization procedure described in this work. This approach reduces the computational costs of d. functional theory calcns., and can be combined with sparse matrix algebra to obtain linear scaling with respect to the no. of electrons in the system. Calcns. on systems of 10 000 atoms or more thus become feasible in a systematic basis set with moderate computational resources. Further computational savings can be achieved by exploiting the similarity of the adaptively contracted basis functions for closely related environments, e.g., in geometry optimizations or combined calcns. of neutral and charged systems. (c) 2014 American Institute of Physics.
- 29Ozaki, T. Variationally optimized atomic orbitals for large-scale electronic structures. Phys. Rev. B: Condens. Matter Mater. Phys. 2003, 67, 155108, DOI: 10.1103/PhysRevB.67.15510829Variationally optimized atomic orbitals for large-scale electronic structuresOzaki, T.Physical Review B: Condensed Matter and Materials Physics (2003), 67 (15), 155108/1-155108/5CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)A simple and practical method for variationally optimizing numerical AOs used in d. functional calcns. is presented based on the force theorem. The derived equation provides the same procedure for the optimization of AOs as that for the geometry optimization. The optimized orbitals well reproduce convergent results calcd. by a larger no. of unoptimized orbitals. In addn., we demonstrate that the optimized orbitals significantly reduce the computational effort in the geometry optimization, while keeping a high degree of accuracy.
- 30Ozaki, T.; Kino, H. Numerical atomic basis orbitals from H to Kr. Phys. Rev. B: Condens. Matter Mater. Phys. 2004, 69, 195113, DOI: 10.1103/PhysRevB.69.19511330Numerical atomic basis orbitals from H to KrOzaki, T.; Kino, H.Physical Review B: Condensed Matter and Materials Physics (2004), 69 (19), 195113/1-195113/19CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)We present a systematic study for numerical at. basis orbitals ranging from H to Kr, which could be used in large scale O(N) electronic structure calcns. based on d.-functional theories (DFT). The comprehensive investigation of convergence properties with respect to our primitive basis orbitals provides a practical guideline in an optimum choice of basis sets for each element, which well balances the computational efficiency and accuracy. Moreover, starting from the primitive basis orbitals, a simple and practical method for variationally optimizing basis orbitals is presented based on the force theorem, which enables us to maximize both the computational efficiency and accuracy. The optimized orbitals well reproduce convergent results calcd. by a larger no. of primitive orbitals. As illustrations of the orbital optimization, we demonstrate two examples: the geometry optimization coupled with the orbital optimization of a C60 mol. and the preorbital optimization for a specific group such as proteins. They clearly show that the optimized orbitals significantly reduce the computational efforts, while keeping a high degree of accuracy, thus indicating that the optimized orbitals are quite suitable for large scale DFT calcns.
- 31Junquera, J.; Paz, O.; Sánchez-Portal, D.; Artacho, E. Numerical atomic orbitals for linear-scaling calculations. Phys. Rev. B: Condens. Matter Mater. Phys. 2001, 64, 235111, DOI: 10.1103/PhysRevB.64.23511131Numerical atomic orbitals for linear-scaling calculationsJunquera, Javier; Paz, Oscar; Sanchez-Portal, Daniel; Artacho, EmilioPhysical Review B: Condensed Matter and Materials Physics (2001), 64 (23), 235111/1-235111/9CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)The performance of basis sets made of numerical AOs is explored in d.-functional calcns. of solids and mols. With the aim of optimizing basis quality while maintaining strict localization of the orbitals, as needed for linear-scaling calcns., several schemes have been tried. The best performance is obtained for the basis sets generated according to a new scheme presented here, a flexibilization of previous proposals. Strict localization is maintained while ensuring the continuity of the basis-function deriv. at the cutoff radius. The basis sets are tested vs. converged plane-wave calcns. on a significant variety of systems, including covalent, ionic, and metallic. Satisfactory convergence is obtained for reasonably small basis sizes, with a clear improvement over previous schemes. The transferability of the obtained basis sets is tested in several cases and it is found to be satisfactory as well.
- 32Basanta, M.; Dappe, Y.; Jelínek, P.; Ortega, J. Optimized atomic-like orbitals for first-principles tight-binding molecular dynamics. Comput. Mater. Sci. 2007, 39, 759– 766, DOI: 10.1016/j.commatsci.2006.09.00332Optimized atomic-like orbitals for first-principles tight-binding molecular dynamicsBasanta, M. A.; Dappe, Y. J.; Jelinek, P.; Ortega, J.Computational Materials Science (2007), 39 (4), 759-766CODEN: CMMSEM; ISSN:0927-0256. (Elsevier B.V.)We analyze the optimization of at.-like minimal basis sets for the hydrocarbons and for materials made up only of C atoms, e.g. C-nanotubes. In our approach the optimized numerical AOs (NAOs) are obtained as a linear combination of only two primitive NAOs. We find that the optimized basis sets yield an important lowering of the total energy, and bond lengths in very good agreement with the exptl. evidence. Also, we find that a good "universal" minimal basis set for the hydrocarbons and C-only materials can be obtained using this simple method. The approach discussed is a promising tool for the simulation of complex org. materials, beyond the hydrocarbons, using optimized minimal basis sets.
- 33Rayson, M. J.; Briddon, P. R. Highly efficient method for Kohn-Sham density functional calculations of 500–10000 atom systems. Phys. Rev. B: Condens. Matter Mater. Phys. 2009, 80, 205104, DOI: 10.1103/PhysRevB.80.20510433Highly efficient method for Kohn-Sham density functional calculations of 500-10 000 atom systemsRayson, M. J.; Briddon, P. R.Physical Review B: Condensed Matter and Materials Physics (2009), 80 (20), 205104/1-205104/11CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)A method for the soln. of the self-consistent Kohn-Sham equations using Gaussian-type orbitals is presented. Accurate relative energies and forces are demonstrated to be achievable at a fraction of the computational expense for large systems. With this approach calcns. involving around 1000 atoms can easily be performed with a serial desktop computer and ∼10 000 atom systems are within reach of relatively modest parallel computational resources. The method is applicable to arbitrary systems including metals. The approach generates a minimal basis on the fly while retaining the accuracy of the large underpinning basis set. Convergence of energies and forces are given for clusters as well as cubic cells of silicon and aluminum, for which the formation energies of defects are calcd. in systems up to 8000 and 4000 atoms, resp. For these systems the method exhibits linear scaling with the no. of atoms in the presently important size range of ∼500-3000 atoms.
- 34Rayson, M. Rapid filtration algorithm to construct a minimal basis on the fly from a primitive Gaussian basis. Comput. Phys. Commun. 2010, 181, 1051– 1056, DOI: 10.1016/j.cpc.2010.02.01234Rapid filtration algorithm to construct a minimal basis on the fly from a primitive Gaussian basisRayson, M. J.Computer Physics Communications (2010), 181 (6), 1051-1056CODEN: CPHCBZ; ISSN:0010-4655. (Elsevier B.V.)In a recent work an implementation of filter diagonalization with localization constraints was shown to provide an accurate and highly efficient method to solve the Kohn-Sham equations using a primitive Gaussian basis for systems contg. several thousand electrons. In this work, an alternative filtration algorithm is proposed, based on a rational approxn. to the filtration function, to replace the kernel of this algorithm. This approach is considerably faster than the diagonalization approach used previously and also its performance is largely independent of the filtration temp., aiding a more flexible approach to the construction of filtered basis sets.
- 35Nakata, A.; Bowler, D. R.; Miyazaki, T. Efficient Calculations with Multisite Local Orbitals in a Large-Scale DFT Code CONQUEST. J. Chem. Theory Comput. 2014, 10, 4813– 4822, DOI: 10.1021/ct500493435Efficient Calculations with Multisite Local Orbitals in a Large-Scale DFT Code CONQUESTNakata, Ayako; Bowler, David R.; Miyazaki, TsuyoshiJournal of Chemical Theory and Computation (2014), 10 (11), 4813-4822CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Multisite local orbitals, which are formed from linear combinations of pseudoat. orbitals from a target atom and its neighbor atoms, have been introduced in the large-scale d. functional theory calcn. code CONQUEST. Multisite local orbitals correspond to local MOs so that the no. of required local orbitals can be minimal. The multisite support functions are detd. by using the localized filter diagonalization (LFD) method. Two new methods, the double cutoff method and the smoothing method, are introduced to the LFD method to improve efficiency and stability. The Hamiltonian and overlap matrixes with multisite local orbitals are constructed by efficient sparse-matrix multiplications in CONQUEST. The investigation of the calcd. energetic and geometrical properties and band structures of bulk Si, Al, and DNA systems demonstrate the accuracy and the computational efficiency of the present method. The representability of both occupied and unoccupied band structures with the present method has been also confirmed.
- 36Lin, L.; Lu, J.; Ying, L.; E, W. Adaptive local basis set for Kohn-Sham density functional theory in a discontinuous Galerkin framework I: Total energy calculation. J. Comput. Phys. 2012, 231, 2140– 2154, DOI: 10.1016/j.jcp.2011.11.032There is no corresponding record for this reference.
- 37Lin, L.; Lu, J.; Ying, L.; E, W. Optimized local basis set for Kohn-Sham density functional theory. J. Comput. Phys. 2012, 231, 4515– 4529, DOI: 10.1016/j.jcp.2012.03.009There is no corresponding record for this reference.
- 38Mao, Y.; Horn, P. R.; Mardirossian, N.; Head-Gordon, T.; Skylaris, C.-K.; Head-Gordon, M. Approaching the basis set limit for DFT calculations using an environment-adapted minimal basis with perturbation theory: Formulation, proof of concept, and a pilot implementation. J. Chem. Phys. 2016, 145, 044109, DOI: 10.1063/1.495912538Approaching the basis set limit for DFT calculations using an environment-adapted minimal basis with perturbation theory: Formulation, proof of concept, and a pilot implementationMao, Yuezhi; Horn, Paul R.; Mardirossian, Narbe; Head-Gordon, Teresa; Skylaris, Chris-Kriton; Head-Gordon, MartinJournal of Chemical Physics (2016), 145 (4), 044109/1-044109/17CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Recently developed d. functionals have good accuracy for both thermochem. (TC) and non-covalent interactions (NC) if very large AO basis sets are used. To approach the basis set limit with potentially lower computational cost, a new SCF scheme is presented that employs minimal adaptive basis (MAB) functions. The MAB functions are optimized on each at. site by minimizing a surrogate function. High accuracy is obtained by applying a perturbative correction (PC) to the MAB calcn., similar to dual basis approaches. Compared to exact SCF results, using this MAB-SCF (PC) approach with the same large target basis set produces <0.15 kcal/mol root-mean-square deviations for most of the tested TC datasets, and <0.1 kcal/mol for most of the NC datasets. The performance of d. functionals near the basis set limit can be even better reproduced. With further improvement to its implementation, MAB-SCF (PC) is a promising lower-cost substitute for conventional large-basis calcns. as a method to approach the basis set limit of modern d. functionals. (c) 2016 American Institute of Physics.
- 39Ramakrishnan, R.; von Lilienfeld, O. A. Rev. Comput. Chem.; John Wiley & Sons, Inc., 2017; pp 225– 256.There is no corresponding record for this reference.
- 40Hansen, K.; Montavon, G.; Biegler, F.; Fazli, S.; Rupp, M.; Scheffler, M.; von Lilienfeld, O. A.; Tkatchenko, A.; Müller, K.-R. Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies. J. Chem. Theory Comput. 2013, 9, 3404– 3419, DOI: 10.1021/ct400195d40Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization EnergiesHansen, Katja; Montavon, Gregoire; Biegler, Franziska; Fazli, Siamac; Rupp, Matthias; Scheffler, Matthias; von Lilienfeld, O. Anatole; Tkatchenko, Alexandre; Mueller, Klaus-RobertJournal of Chemical Theory and Computation (2013), 9 (8), 3404-3419CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)The accurate and reliable prediction of properties of mols. typically requires computationally intensive quantum-chem. calcns. Recently, machine learning techniques applied to ab initio calcns. have been proposed as an efficient approach for describing the energies of mols. in their given ground-state structure throughout chem. compd. space. In this paper we outline a no. of established machine learning techniques and investigate the influence of the mol. representation on the methods performance. The best methods achieve prediction errors of 3 kcal/mol for the atomization energies of a wide variety of mols. Rationales for this performance improvement are given together with pitfalls and challenges when applying machine learning approaches to the prediction of quantum-mech. observables.
- 41Handley, C. M.; Popelier, P. L. A. Potential Energy Surfaces Fitted by Artificial Neural Networks. J. Phys. Chem. A 2010, 114, 3371– 3383, DOI: 10.1021/jp910558541Potential Energy Surfaces Fitted by Artificial Neural NetworksHandley, Chris M.; Popelier, Paul L. A.Journal of Physical Chemistry A (2010), 114 (10), 3371-3383CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)A review. Mol. mechanics is the tool of choice for the modeling of systems that are so large or complex that it is impractical or impossible to model them by ab initio methods. For this reason there is a need for accurate potentials that are able to quickly reproduce ab initio quality results at the fraction of the cost. The interactions within force fields are represented by a no. of functions. Some interactions are well understood and can be represented by simple math. functions while others are not so well understood and their functional form is represented in a simplistic manner or not even known. In the last 20 years there have been the first examples of a new design ethic, where novel and contemporary methods using machine learning, in particular, artificial neural networks, have been used to find the nature of the underlying functions of a force field. Here we appraise what has been achieved over this time and what requires further improvements, while offering some insight and guidance for the development of future force fields.
- 42Behler, J. Neural network potential-energy surfaces in chemistry: a tool for large-scale simulations. Phys. Chem. Chem. Phys. 2011, 13, 17930– 17955, DOI: 10.1039/c1cp21668f42Neural network potential-energy surfaces in chemistry: a tool for large-scale simulationsBehler, JoergPhysical Chemistry Chemical Physics (2011), 13 (40), 17930-17955CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)A review. The accuracy of the results obtained in mol. dynamics or Monte Carlo simulations crucially depends on a reliable description of the at. interactions. A large variety of efficient potentials has been proposed in the literature, but often the optimum functional form is difficult to find and strongly depends on the particular system. In recent years, artificial neural networks (NN) have become a promising new method to construct potentials for a wide range of systems. They offer a no. of advantages: they are very general and applicable to systems as different as small mols., semiconductors and metals; they are numerically very accurate and fast to evaluate; and they can be constructed using any electronic structure method. Significant progress has been made in recent years and a no. of successful applications demonstrate the capabilities of neural network potentials. In this Perspective, the current status of NN potentials is reviewed, and their advantages and limitations are discussed.
- 43Morawietz, T.; Singraber, A.; Dellago, C.; Behler, J. How van der Waals interactions determine the unique properties of water. Proc. Natl. Acad. Sci. U. S. A. 2016, 113, 8368– 8373, DOI: 10.1073/pnas.160237511343How van der Waals interactions determine the unique properties of waterMorawietz, Tobias; Singraber, Andreas; Dellago, Christoph; Behler, JoergProceedings of the National Academy of Sciences of the United States of America (2016), 113 (30), 8368-8373CODEN: PNASA6; ISSN:0027-8424. (National Academy of Sciences)Whereas the interactions between water mols. are dominated by strongly directional hydrogen bonds (HBs), it was recently proposed that relatively weak, isotropic van der Waals (vdW) forces are essential for understanding the properties of liq. water and ice. This insight was derived from ab initio computer simulations, which provide an unbiased description of water at the at. level and yield information on the underlying mol. forces. However, the high computational cost of such simulations prevents the systematic investigation of the influence of vdW forces on the thermodn. anomalies of water. Here, we develop efficient ab initio-quality neural network potentials and use them to demonstrate that vdW interactions are crucial for the formation of water's d. max. and its neg. vol. of melting. Both phenomena can be explained by the flexibility of the HB network, which is the result of a delicate balance of weak vdW forces, causing, e.g., a pronounced expansion of the second solvation shell upon cooling that induces the d. max.
- 44Snyder, J. C.; Rupp, M.; Hansen, K.; Blooston, L.; Mueller, K.-R.; Burke, K. Orbital-free bond breaking via machine learning. J. Chem. Phys. 2013, 139, 224104, DOI: 10.1063/1.483407544Orbital-free bond breaking via machine learningSnyder, John C.; Rupp, Matthias; Hansen, Katja; Blooston, Leo; Mueller, Klaus-Robert; Burke, KieronJournal of Chemical Physics (2013), 139 (22), 224104/1-224104/10CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Using a one-dimensional model, we explore the ability of machine learning to approx. the non-interacting kinetic energy d. functional of diatomics. This nonlinear interpolation between Kohn-Sham ref. calcns. can (i) accurately dissoc. a diat., (ii) be systematically improved with increased ref. data and (iii) generate accurate self-consistent densities via a projection method that avoids directions with no data. With relatively few densities, the error due to the interpolation is smaller than typical errors in std. exchange-correlation functionals. (c) 2013 American Institute of Physics.
- 45Schütt, K. T.; Glawe, H.; Brockherde, F.; Sanna, A.; Müller, K. R.; Gross, E. K. U. How to represent crystal structures for machine learning: Towards fast prediction of electronic properties. Phys. Rev. B: Condens. Matter Mater. Phys. 2014, 89, 205118, DOI: 10.1103/PhysRevB.89.20511845How to represent crystal structures for machine learning: towards fast prediction of electronic propertiesSchuett, K. T.; Glawe, H.; Brockherde, F.; Sanna, A.; Mueller, K. R.; Gross, E. K. U.Physical Review B: Condensed Matter and Materials Physics (2014), 89 (20), 205118/1-205118/5, 5 pp.CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)High-throughput d. functional calcns. of solids are highly time-consuming. As an alternative, we propose a machine learning approach for the fast prediction of solid-state properties. To achieve this, local spin-d. approxn. calcns. are used as a training set. We focus on predicting the value of the d. of electronic states at the Fermi energy. We find that conventional representations of the input data, such as the Coulomb matrix, are not suitable for the training of learning machines in the case of periodic solids. We propose a novel crystal structure representation for which learning and competitive prediction accuracies become possible within an unrestricted class of spd systems of arbitrary unit-cell size.
- 46Dral, P. O.; von Lilienfeld, O. A.; Thiel, W. Machine Learning of Parameters for Accurate Semiempirical Quantum Chemical Calculations. J. Chem. Theory Comput. 2015, 11, 2120– 2125, DOI: 10.1021/acs.jctc.5b0014146Machine Learning of Parameters for Accurate Semiempirical Quantum Chemical CalculationsDral, Pavlo O.; von Lilienfeld, O. Anatole; Thiel, WalterJournal of Chemical Theory and Computation (2015), 11 (5), 2120-2125CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)We investigate possible improvements in the accuracy of semiempirical quantum chem. (SQC) methods through the use of machine learning (ML) models for the parameters. For a given class of compds., ML techniques require sufficiently large training sets to develop ML models that can be used for adapting SQC parameters to reflect changes in mol. compn. and geometry. The ML-SQC approach allows the automatic tuning of SQC parameters for individual mols., thereby improving the accuracy without deteriorating transferability to mols. with mol. descriptors very different from those in the training set. The performance of this approach is demonstrated for the semiempirical OM2 method using a set of 6095 constitutional isomers C7H10O2, for which accurate ab initio atomization enthalpies are available. The ML-OM2 results show improved av. accuracy and a much reduced error range compared with those of std. OM2 results, with mean abs. errors in atomization enthalpies dropping from 6.3 to 1.7 kcal/mol. They are also found to be superior to the results from specific OM2 reparameterizations (rOM2) for the same set of isomers. The ML-SQC approach thus holds promise for fast and reasonably accurate high-throughput screening of materials and mols.
- 47Kranz, J. J.; Kubillus, M.; Ramakrishnan, R.; von Lilienfeld, O. A.; Elstner, M. Generalized Density-Functional Tight-Binding Repulsive Potentials from Unsupervised Machine Learning. J. Chem. Theory Comput. 2018, 14, 2341– 2352, DOI: 10.1021/acs.jctc.7b0093347Generalized Density-Functional Tight-Binding Repulsive Potentials from Unsupervised Machine LearningKranz, Julian J.; Kubillus, Maximilian; Ramakrishnan, Raghunathan; von Lilienfeld, O. Anatole; Elstner, MarcusJournal of Chemical Theory and Computation (2018), 14 (5), 2341-2352CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)We combine the approx. d.-functional tight-binding (DFTB) method with unsupervised machine learning. This allows us to improve transferability and accuracy, make use of large quantum chem. data sets for the parametrization, and efficiently automatize the parametrization process of DFTB. For this purpose, generalized pair-potentials are introduced, where the chem. environment is included during the learning process, leading to more specific effective two-body potentials. We train on energies and forces of equil. and nonequil. structures of 2100 mols., and test on ∼130,000 org. mols. contg. O, N, C, H, and F atoms. Atomization energies of the ref. method can be reproduced within an error of ∼2.6 kcal/mol, indicating drastic improvement over std. DFTB.
- 48White, C. A.; Maslen, P.; Lee, M. S.; Head-Gordon, M. The tensor properties of energy gradients within a non-orthogonal basis. Chem. Phys. Lett. 1997, 276, 133– 138, DOI: 10.1016/S0009-2614(97)88046-3There is no corresponding record for this reference.
- 49Bartók, A. P.; Kondor, R.; Csányi, G. On representing chemical environments. Phys. Rev. B: Condens. Matter Mater. Phys. 2013, 87, 184115, DOI: 10.1103/PhysRevB.87.18411549On representing chemical environmentsBartok, Albert P.; Kondor, Risi; Csanyi, GaborPhysical Review B: Condensed Matter and Materials Physics (2013), 87 (18), 184115/1-184115/16CODEN: PRBMDO; ISSN:1098-0121. (American Physical Society)We review some recently published methods to represent at. neighborhood environments, and analyze their relative merits in terms of their faithfulness and suitability for fitting potential energy surfaces. The crucial properties that such representations (sometimes called descriptors) must have are differentiability with respect to moving the atoms and invariance to the basic symmetries of physics: rotation, reflection, translation, and permutation of atoms of the same species. We demonstrate that certain widely used descriptors that initially look quite different are specific cases of a general approach, in which a finite set of basis functions with increasing angular wave nos. are used to expand the at. neighborhood d. function. Using the example system of small clusters, we quant. show that this expansion needs to be carried to higher and higher wave nos. as the no. of neighbors increases in order to obtain a faithful representation, and that variants of the descriptors converge at very different rates. We also propose an altogether different approach, called Smooth Overlap of Atomic Positions, that sidesteps these difficulties by directly defining the similarity between any two neighborhood environments, and show that it is still closely connected to the invariant descriptors. We test the performance of the various representations by fitting models to the potential energy surface of small silicon clusters and the bulk crystal.
- 50De, S.; Bartok, A. P.; Csanyi, G.; Ceriotti, M. Comparing molecules and solids across structural and alchemical space. Phys. Chem. Chem. Phys. 2016, 18, 13754– 13769, DOI: 10.1039/C6CP00415F50Comparing molecules and solids across structural and alchemical spaceDe, Sandip; Bartok, Albert P.; Csanyi, Gabor; Ceriotti, MichelePhysical Chemistry Chemical Physics (2016), 18 (20), 13754-13769CODEN: PPCPFQ; ISSN:1463-9076. (Royal Society of Chemistry)Evaluating the (dis)similarity of cryst., disordered and mol. compds. is a crit. step in the development of algorithms to navigate automatically the configuration space of complex materials. For instance, a structural similarity metric is crucial for classifying structures, searching chem. space for better compds. and materials, and driving the next generation of machine-learning techniques for predicting the stability and properties of mols. and materials. In the last few years several strategies have been designed to compare at. coordination environments. In particular, the smooth overlap of at. positions (SOAPs) has emerged as an elegant framework to obtain translation, rotation and permutation-invariant descriptors of groups of atoms, underlying the development of various classes of machine-learned inter-at. potentials. Here we discuss how one can combine such local descriptors using a regularized entropy match (REMatch) approach to describe the similarity of both whole mol. and bulk periodic structures, introducing powerful metrics that enable the navigation of alchem. and structural complexities within a unified framework. Furthermore, using this kernel and a ridge regression method we can predict atomization energies for a database of small org. mols. with a mean abs. error below 1 kcal mol-1, reaching an important milestone in the application of machine-learning techniques for the evaluation of mol. properties.
- 51Zhu, L.; Amsler, M.; Fuhrer, T.; Schäfer, B.; Faraji, S.; Rostami, S.; Ghasemi, S. A.; Sadeghi, A.; Grauzinyte, M.; Wolverton, C.; Goedecker, S. A fingerprint based metric for measuring similarities of crystalline structures. J. Chem. Phys. 2016, 144, 034203, DOI: 10.1063/1.494002651A fingerprint based metric for measuring similarities of crystalline structuresZhu, Li; Amsler, Maximilian; Fuhrer, Tobias; Schaefer, Bastian; Faraji, Somayeh; Rostami, Samare; Ghasemi, S. Alireza; Sadeghi, Ali; Grauzinyte, Migle; Wolverton, Chris; Goedecker, StefanJournal of Chemical Physics (2016), 144 (3), 034203/1-034203/9CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Measuring similarities/dissimilarities between at. structures is important for the exploration of potential energy landscapes. However, the cell vectors together with the coordinates of the atoms, which are generally used to describe periodic systems, are quantities not directly suitable as fingerprints to distinguish structures. Based on a characterization of the local environment of all atoms in a cell, we introduce crystal fingerprints that can be calcd. easily and define configurational distances between cryst. structures that satisfy the math. properties of a metric. This distance between two configurations is a measure of their similarity/dissimilarity and it allows in particular to distinguish structures. The new method can be a useful tool within various energy landscape exploration schemes, such as min. hopping, random search, swarm intelligence algorithms, and high-throughput screenings. (c) 2016 American Institute of Physics.
- 52Sadeghi, A.; Ghasemi, S. A.; Schäfer, B.; Mohr, S.; Lill, M. A.; Goedecker, S. Metrics for measuring distances in configuration spaces. J. Chem. Phys. 2013, 139, 184118, DOI: 10.1063/1.482870452Metrics for measuring distances in configuration spacesSadeghi, Ali; Ghasemi, S. Alireza; Schaefer, Bastian; Mohr, Stephan; Lill, Markus A.; Goedecker, StefanJournal of Chemical Physics (2013), 139 (18), 184118/1-184118/11CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)In order to characterize mol. structures we introduce configurational fingerprint vectors which are counterparts of quantities used exptl. to identify structures. The Euclidean distance between the configurational fingerprint vectors satisfies the properties of a metric and can therefore safely be used to measure dissimilarities between configurations in the high dimensional configuration space. In particular we show that these metrics are a perfect and computationally cheap replacement for the root-mean-square distance (RMSD) when one has to decide whether two noise contaminated configurations are identical or not. We introduce a Monte Carlo approach to obtain the global min. of the RMSD between configurations, which is obtained from a global minimization over all translations, rotations, and permutations of at. indexes. (c) 2013 American Institute of Physics.
- 53Rupp, M.; Tkatchenko, A.; Müller, K.-R.; von Lilienfeld, O. A. Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning. Phys. Rev. Lett. 2012, 108, 058301, DOI: 10.1103/PhysRevLett.108.05830153Fast and accurate modeling of molecular atomization energies with machine learningRupp, Matthias; Tkatchenko, Alexandre; Mueller, Klaus-Robert; von Lilienfeld, O. AnatolePhysical Review Letters (2012), 108 (5), 058301/1-058301/5CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)The authors introduce a machine learning model to predict atomization energies of a diverse set of org. mols., based on nuclear charges and at. positions only. The problem of solving the mol. Schrodinger equation is mapped onto a nonlinear statistical regression problem of reduced complexity. Regression models are trained on and compared to atomization energies computed with hybrid d.-functional theory. Cross validation over more than seven thousand org. mols. yields a mean abs. error of ∼10 kcal/mol. Applicability is demonstrated for the prediction of mol. atomization potential energy curves.
- 54Behler, J. Constructing high-dimensional neural network potentials: A tutorial review. Int. J. Quantum Chem. 2015, 115, 1032– 1050, DOI: 10.1002/qua.2489054Constructing high-dimensional neural network potentials: A tutorial reviewBehler, JoergInternational Journal of Quantum Chemistry (2015), 115 (16), 1032-1050CODEN: IJQCB2; ISSN:0020-7608. (John Wiley & Sons, Inc.)A lot of progress has been made in recent years in the development of atomistic potentials using machine learning (ML) techniques. In contrast to most conventional potentials, which are based on phys. approxns. and simplifications to derive an analytic functional relation between the at. configuration and the potential-energy, ML potentials rely on simple but very flexible math. terms without a direct phys. meaning. Instead, in case of ML potentials the topol. of the potential-energy surface is "learned" by adjusting a no. of parameters with the aim to reproduce a set of ref. electronic structure data as accurately as possible. Due to this bias-free construction, they are applicable to a wide range of systems without changes in their functional form, and a very high accuracy close to the underlying first-principles data can be obtained. Neural network potentials (NNPs), which have first been proposed about two decades ago, are an important class of ML potentials. Although the first NNPs have been restricted to small mols. with only a few degrees of freedom, they are now applicable to high-dimensional systems contg. thousands of atoms, which enables addressing a variety of problems in chem., physics, and materials science. In this tutorial review, the basic ideas of NNPs are presented with a special focus on developing NNPs for high-dimensional condensed systems. A recipe for the construction of these potentials is given and remaining limitations of the method are discussed. © 2015 Wiley Periodicals, Inc.
- 55Rasmussen, C. E.; Williams, C. K. I. Gaussian Processes for Machine Learning; The MIT Press, 2006; http://www.gaussianprocess.org/gpml/ (accessed Apr. 20, 2018).There is no corresponding record for this reference.
- 56Tikhonov, A. N.; Goncharsky, A. V.; Stepanov, V. V.; Yagola, A. G. Numerical Methods for the Solution of Ill-Posed Problems; Kluwer Academic, 1995.There is no corresponding record for this reference.
- 57Gillan, M. J.; Alfé, D.; Michaelides, A. Perspective: How good is DFT for water. J. Chem. Phys. 2016, 144, 130901, DOI: 10.1063/1.494463357Perspective: How good is DFT for water?Gillan, Michael J.; Alfe, Dario; Michaelides, AngelosJournal of Chemical Physics (2016), 144 (13), 130901/1-130901/33CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)A review. Kohn-Sham d. functional theory (DFT) has become established as an indispensable tool for investigating aq. systems of all kinds, including those important in chem., surface science, biol., and the earth sciences. Nevertheless, many widely used approxns. for the exchange-correlation (XC) functional describe the properties of pure water systems with an accuracy that is not fully satisfactory. The explicit inclusion of dispersion interactions generally improves the description, but there remain large disagreements between the predictions of different dispersion-inclusive methods. We present here a review of DFT work on water clusters, ice structures, and liq. water, with the aim of elucidating how the strengths and weaknesses of different XC approxns. manifest themselves across this variety of water systems. Our review highlights the crucial role of dispersion in describing the delicate balance between compact and extended structures of many different water systems, including the liq. By referring to a wide range of published work, we argue that the correct description of exchange-overlap interactions is also extremely important, so that the choice of semi-local or hybrid functional employed in dispersion-inclusive methods is crucial. The origins and consequences of beyond-2-body errors of approx. XC functionals are noted, and we also discuss the substantial differences between different representations of dispersion. We propose a simple numerical scoring system that rates the performance of different XC functionals in describing water systems, and we suggest possible future developments. (c) 2016 American Institute of Physics.
- 58Del Ben, M.; Schönherr, M.; Hutter, J.; VandeVondele, J. Bulk Liquid Water at Ambient Temperature and Pressure from MP2 Theory. J. Phys. Chem. Lett. 2013, 4, 3753– 3759, DOI: 10.1021/jz401931f58Bulk Liquid Water at Ambient Temperature and Pressure from MP2 TheoryDel Ben, Mauro; Schonherr, Mandes; Hutter, Jurg; Vande Vondele, JoostJournal of Physical Chemistry Letters (2013), 4 (21), 3753-3759CODEN: JPCLCD; ISSN:1948-7185. (American Chemical Society)MP2 provides a good description of hydrogen bonding in water clusters and includes long-range dispersion interactions without the need to introduce empirical elements in the description of the interat. potential. To assess its performance for bulk liq. water under ambient conditions, an isobaric-isothermal (NpT) Monte Carlo simulation at the second-order Moller-Plesset perturbation theory level (MP2) has been performed. The obtained value of the water d. is excellent (1.02 g/mL), and the calcd. radial distribution functions are in fair agreement with exptl. data. The MP2 results are compared to a few d. functional approxns., including semilocal functionals, hybrid functionals, and functionals including empirical dispersion corrections. These results demonstrate the feasibility of directly sampling the potential energy surface of condensed-phase systems using correlated wave function theory, and their quality paves the way for further applications.
- 59Del Ben, M.; Hutter, J.; VandeVondele, J. Probing the structural and dynamical properties of liquid water with models including non-local electron correlation. J. Chem. Phys. 2015, 143, 054506, DOI: 10.1063/1.492732559Probing the structural and dynamical properties of liquid water with models including non-local electron correlationDel Ben, Mauro; Hutter, Jurg; Vande Vondele, JoostJournal of Chemical Physics (2015), 143 (5), 054506/1-054506/12CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Water is a ubiquitous liq. that displays a wide range of anomalous properties and has a delicate structure that challenges expt. and simulation alike. The various intermol. interactions that play an important role, such as repulsion, polarization, hydrogen bonding, and van der Waals interactions, are often difficult to reproduce faithfully in atomistic models. Here, electronic structure theories including all these interactions at equal footing, which requires the inclusion of non-local electron correlation, are used to describe structure and dynamics of bulk liq. water. Isobaric-isothermal (NpT) ensemble simulations based on the RPA yield excellent d. (0.994 g/mL) and fair radial distribution functions, while various other d. functional approxns. produce scattered results (0.8-1.2 g/mL). Mol. dynamics simulation in the microcanonical (NVE) ensemble based on Moller-Plesset perturbation theory (MP2) yields dynamical properties in the condensed phase, namely, the IR spectrum and diffusion const. At the MP2 and RPA levels of theory, ice is correctly predicted to float on water, resolving one of the anomalies as resulting from a delicate balance between van der Waals and hydrogen bonding interactions. For several properties, obtaining quant. agreement with expt. requires correction for nuclear quantum effects (NQEs), highlighting their importance, for structure, dynamics, and electronic properties. A computed NQE shift of 0.6 eV for the band gap and absorption spectrum illustrates the latter. Giving access to both structure and dynamics of condensed phase systems, non-local electron correlation will increasingly be used to study systems where weak interactions are of paramount importance. (c) 2015 American Institute of Physics.
- 60Verlet, L. Computer ”Experiments” on Classical Fluids. I. Thermodynamical Properties of Lennard-Jones Molecules. Phys. Rev. 1967, 159, 98– 103, DOI: 10.1103/PhysRev.159.9860Computer "experiments" on classical fluids. I. Thermodynamical properties of Lennard-Jones moleculesVerlet, LoupPhysical Review (1967), 159 (1), 98-103CODEN: PHRVAO; ISSN:0031-899X.The equation of motion of a system of 864 particles interacting through a Lennard-Jones potential was integrated for various values of the temp. and d., relative, generally, to a fluid state. The equil. properties agree with the corresponding properties of Ar. The equil. state of Ar can be described through a 2-body potential.
- 61Elstner, M.; Porezag, D.; Jungnickel, G.; Elsner, J.; Haugk, M.; Frauenheim, T.; Suhai, S.; Seifert, G. Self-consistent-charge density-functional tight-binding method for simulations of complex materials properties. Phys. Rev. B: Condens. Matter Mater. Phys. 1998, 58, 7260– 7268, DOI: 10.1103/PhysRevB.58.726061Self-consistent-charge density-functional tight-binding method for simulations of complex materials propertiesElstner, M.; Porezag, D.; Jungnickel, G.; Elsner, J.; Haugk, M.; Frauenheim, Th.; Suhai, S.; Seifert, G.Physical Review B: Condensed Matter and Materials Physics (1998), 58 (11), 7260-7268CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)We outline details about an extension of the tight-binding (TB) approach to improve total energies, forces, and transferability. The method is based on a second-order expansion of the Kohn-Sham total energy in d.-functional theory (DFT) with respect to charge-d. fluctuations. The zeroth-order approach is equiv. to a common std. non-self-consistent (TB) scheme, while at second-order a transparent, parameter-free, and readily calculable expression for generalized Hamiltonian matrix elements may be derived. These are modified by a self-consistent redistribution of Mulliken charges (SCC). Besides the usual "band structure" and short-range repulsive terms the final approx. Kohn-Sham energy addnl. includes a Coulomb interaction between charge fluctuations. At large distances this accounts for long-range electrostatic forces between two point charges and approx. includes self-interaction contributions of a given atom if the charges are located at one and the same atom. We apply the new SCC scheme to problems where deficiencies within the non-SCC std. TB approach become obvious. We thus considerably improve transferability.
- 62Hu, H.; Lu, Z.; Elstner, M.; Hermans, J.; Yang, W. Simulating Water with the Self-Consistent-Charge Density Functional Tight Binding Method: From Molecular Clusters to the Liquid State. J. Phys. Chem. A 2007, 111, 5685– 5691, DOI: 10.1021/jp070308d62Simulating Water with the Self-Consistent-Charge Density Functional Tight Binding Method: From Molecular Clusters to the Liquid StateHu, Hao; Lu, Zhenyu; Elstner, Marcus; Hermans, Jan; Yang, WeitaoJournal of Physical Chemistry A (2007), 111 (26), 5685-5691CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)The recently developed self-consistent-charge d. functional tight binding (SCCDFTB) method provides an accurate and inexpensive quantum mech. soln. to many mol. systems of interests. To examine the performance of the SCCDFTB method on (liq.) water, the most fundamental yet indispensable mol. in biol. systems, we report here the simulation results of water in sizes ranging from mol. clusters to the liq. state. The latter simulation was achieved through the use of the linear scaling divide-and-conquer approach. The results of liq. water simulation indicate that the SCCDFTB method can describe the structural and energetics of liq. water in qual. agreement with expts., and the results for water clusters suggest potential future improvements of the SCCDFTB method.
- 63Skinner, L. B.; Huang, C.; Schlesinger, D.; Pettersson, L. G. M.; Nilsson, A.; Benmore, C. J. Benchmark oxygen-oxygen pair-distribution function of ambient water from x-ray diffraction measurements with a wide Q-range. J. Chem. Phys. 2013, 138, 074506, DOI: 10.1063/1.479086163Benchmark oxygen-oxygen pair-distribution function of ambient water from x-ray diffraction measurements with a wide Q-rangeSkinner, Lawrie B.; Huang, Congcong; Schlesinger, Daniel; Pettersson, Lars G. M.; Nilsson, Anders; Benmore, Chris J.Journal of Chemical Physics (2013), 138 (7), 074506/1-074506/12CODEN: JCPSA6; ISSN:0021-9606. (American Institute of Physics)Four recent x-ray diffraction measurements of ambient liq. water are reviewed here. Each of these measurements represents a significant development of the x-ray diffraction technique applied to the study of liq. water. Sources of uncertainty from statistical noise, Q-range, Compton scattering, and self-scattering are discussed. The oxygen-hydrogen contribution to the measured x-ray scattering pattern was subtracted using literature data to yield an exptl. detn., with error bars, of the oxygen-oxygen pair-distribution function, gOO(r), which essentially describes the distribution of mol. centers. The extended Q-range and low statistical noise of these measurements has significantly reduced truncation effects and related errors in the gOO(r) functions obtained. From these measurements and error anal., the position and height of the nearest neighbor max. in gOO(r) were found to be 2.80(1) Å and 2.57(5) resp. Numerical data for the coherent differential x-ray scattering cross-section IX(Q), the oxygen-oxygen structure factor SOO(Q), and the derived gOO(r) are provided as benchmarks for calibrating force-fields for water. (c) 2013 American Institute of Physics.
- 64The CP2K developers group. CP2K , 2018; https://www.cp2k.org, (accessed Apr. 20, 2018).There is no corresponding record for this reference.
- 65Hutter, J.; Iannuzzi, M.; Schiffmann, F.; VandeVondele, J. CP2K: atomistic simulations of condensed matter systems. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2014, 4, 15– 25, DOI: 10.1002/wcms.115965cp2k: atomistic simulations of condensed matter systemsHutter, Juerg; Iannuzzi, Marcella; Schiffmann, Florian; VandeVondele, JoostWiley Interdisciplinary Reviews: Computational Molecular Science (2014), 4 (1), 15-25CODEN: WIRCAH; ISSN:1759-0884. (Wiley-Blackwell)A review. Cp2k has become a versatile open-source tool for the simulation of complex systems on the nanometer scale. It allows for sampling and exploring potential energy surfaces that can be computed using a variety of empirical and first principles models. Excellent performance for electronic structure calcns. is achieved using novel algorithms implemented for modern and massively parallel hardware. This review briefly summarizes the main capabilities and illustrates with recent applications the science cp2k has enabled in the field of atomistic simulation. WIREs Comput Mol Sci 2014, 4:15-25. doi: 10.1002/wcms.1159 The authors have declared no conflicts of interest in relation to this article. For further resources related to this article, please visit the WIREs website.
- 66VandeVondele, J.; Krack, M.; Mohamed, F.; Parrinello, M.; Chassaing, T.; Hutter, J. Quickstep: Fast and accurate density functional calculations using a mixed Gaussian and plane waves approach. Comput. Phys. Commun. 2005, 167, 103– 128, DOI: 10.1016/j.cpc.2004.12.01466QUICKSTEP: fast and accurate density functional calculations using a mixed Gaussian and plane waves approachVandeVondele, Joost; Krack, Matthias; Mohamed, Fawzi; Parrinello, Michele; Chassaing, Thomas; Hutter, JuergComputer Physics Communications (2005), 167 (2), 103-128CODEN: CPHCBZ; ISSN:0010-4655. (Elsevier B.V.)We present the Gaussian and plane waves (GPW) method and its implementation in which is part of the freely available program package CP2K. The GPW method allows for accurate d. functional calcns. in gas and condensed phases and can be effectively used for mol. dynamics simulations. We show how derivs. of the GPW energy functional, namely ionic forces and the Kohn-Sham matrix, can be computed in a consistent way. The computational cost of computing the total energy and the Kohn-Sham matrix is scaling linearly with the system size, even for condensed phase systems of just a few tens of atoms. The efficiency of the method allows for the use of large Gaussian basis sets for systems up to 3000 atoms, and we illustrate the accuracy of the method for various basis sets in gas and condensed phases. Agreement with basis set free calcns. for single mols. and plane wave based calcns. in the condensed phase is excellent. Wave function optimization with the orbital transformation technique leads to good parallel performance, and outperforms traditional diagonalisation methods. Energy conserving Born-Oppenheimer dynamics can be performed, and a highly efficient scheme is obtained using an extrapolation of the d. matrix. We illustrate these findings with calcns. using commodity PCs as well as supercomputers.
- 67Lippert, G.; Hutter, J.; Parrinello, M. A hybrid Gaussian and plane wave density functional scheme. Mol. Phys. 1997, 92, 477– 488, DOI: 10.1080/0026897970948211967A hybrid Gaussian and plane wave density functional schemeLippert, Gerald; Hutter, Juerg; Parrinello, MicheleMolecular Physics (1997), 92 (3), 477-487CODEN: MOPHAM; ISSN:0026-8976. (Taylor & Francis)A d.-functional theory-based algorithm for periodic and nonperiodic ab initio calcns. is presented. This scheme uses pseudopotentials in order to integrate out the core electrons from the problem. The valence pseudo wave functions are expanded in Gaussian-type orbitals and the d. is represented in a plane wave auxiliary basis. The Gaussian basis functions make it possible to use the efficient anal. integration schemes and screening algorithms of quantum chem. Novel recursion relations are developed for the calcn. of the matrix elements of the d.-dependent Kohn-Sham self-consistent potential. At the same time the use of a plane wave basis for the electron d. permits efficient calcn. of the Hartree energy using fast Fourier transforms, thus circumventing one of the major bottlenecks of std. Gaussian based calcns. Furthermore, this algorithm avoids the fitting procedures that go along with intermediate basis sets for the charge d. The performance and accuracy of this new scheme are discussed and selected examples are given.
- 68Perdew, J. P.; Burke, K.; Ernzerhof, M. Generalized Gradient Approximation Made Simple. Phys. Rev. Lett. 1996, 77, 3865– 3868, DOI: 10.1103/PhysRevLett.77.386568Generalized gradient approximation made simplePerdew, John P.; Burke, Kieron; Ernzerhof, MatthiasPhysical Review Letters (1996), 77 (18), 3865-3868CODEN: PRLTAO; ISSN:0031-9007. (American Physical Society)Generalized gradient approxns. (GGA's) for the exchange-correlation energy improve upon the local spin d. (LSD) description of atoms, mols., and solids. We present a simple derivation of a simple GGA, in which all parameters (other than those in LSD) are fundamental consts. Only general features of the detailed construction underlying the Perdew-Wang 1991 (PW91) GGA are invoked. Improvements over PW91 include an accurate description of the linear response of the uniform electron gas, correct behavior under uniform scaling, and a smoother potential.
- 69Goedecker, S.; Teter, M.; Hutter, J. Separable dual-space Gaussian pseudopotentials. Phys. Rev. B: Condens. Matter Mater. Phys. 1996, 54, 1703– 1710, DOI: 10.1103/PhysRevB.54.170369Separable dual-space Gaussian pseudopotentialsGoedecker, S.; Teter, M.; Hutter, J.Physical Review B: Condensed Matter (1996), 54 (3), 1703-1710CODEN: PRBMDO; ISSN:0163-1829. (American Physical Society)We present pseudopotential coeffs. for the first two rows of the Periodic Table. The pseudopotential is of an analytic form that gives optimal efficiency in numerical calculations using plane waves as a basis set. At most, even coeffs. are necessary to specify its analytic form. It is separable and has optimal decay properties in both real and Fourier space. Because of this property, the application of the nonlocal part of the pseudopotential to a wave function can be done efficiently on a grid in real space. Real space integration is much faster for large systems than ordinary multiplication in Fourier space, since it shows only quadratic scaling with respect to the size of the system. We systematically verify the high accuracy of these pseudopotentials by extensive at. and mol. test calcns.
- 70Niklasson, A. M.; Tymczak, C.; Challacombe, M. Trace resetting density matrix purification in
self-consistent-field theory. J. Chem. Phys. 2003, 118, 8611– 8620, DOI: 10.1063/1.1559913
There is no corresponding record for this reference. - 71Schütt, O. Enabling Large Scale DFT Simulation with GPU Acceleration and Machine Learning. Ph.D. Thesis, ETH Zürich, 2017.There is no corresponding record for this reference.
Supporting Information
Supporting Information
The Supporting Information is available free of charge on the ACS Publications website at DOI: 10.1021/acs.jctc.8b00378.
Representative input files for most simulations (ZIP)
Terms & Conditions
Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.