Download - Fully Analytic Energy Gradient in the Fragment …...Fully Analytic Energy Gradient in the Fragment Molecular Orbital Method Abstract TheZ-vector equations are derived and implemented

Ames Laboratory Publications Ames Laboratory

2011

Fully Analytic Energy Gradient in the FragmentMolecular Orbital MethodTakeshi NagataNational Institute of Advanced Industrial Science and Technology

Kurt Ryan BrorsenIowa State University

Dmitri G. FedorovNational Institute of Advanced Industrial Science and Technology

Kazuo KitauraNational Institute of Advanced Industrial Science and Technology

Mark S. GordonIowa State University, [email protected]

Follow this and additional works at: http://lib.dr.iastate.edu/ameslab_pubs

Part of the Chemistry Commons

The complete bibliographic information for this item can be found at http://lib.dr.iastate.edu/ameslab_pubs/345. For information on how to cite this item, please visit http://lib.dr.iastate.edu/howtocite.html.

This Article is brought to you for free and open access by the Ames Laboratory at Iowa State University Digital Repository. It has been accepted forinclusion in Ames Laboratory Publications by an authorized administrator of Iowa State University Digital Repository. For more information, pleasecontact [email protected].

http://lib.dr.iastate.edu/?utm_source=lib.dr.iastate.edu%2Fameslab_pubs%2F345&utm_medium=PDF&utm_campaign=PDFCoverPages

http://lib.dr.iastate.edu/?utm_source=lib.dr.iastate.edu%2Fameslab_pubs%2F345&utm_medium=PDF&utm_campaign=PDFCoverPages

http://lib.dr.iastate.edu/ameslab_pubs?utm_source=lib.dr.iastate.edu%2Fameslab_pubs%2F345&utm_medium=PDF&utm_campaign=PDFCoverPages

http://lib.dr.iastate.edu/ameslab?utm_source=lib.dr.iastate.edu%2Fameslab_pubs%2F345&utm_medium=PDF&utm_campaign=PDFCoverPages

http://lib.dr.iastate.edu/ameslab_pubs?utm_source=lib.dr.iastate.edu%2Fameslab_pubs%2F345&utm_medium=PDF&utm_campaign=PDFCoverPages

http://network.bepress.com/hgg/discipline/131?utm_source=lib.dr.iastate.edu%2Fameslab_pubs%2F345&utm_medium=PDF&utm_campaign=PDFCoverPages

http://lib.dr.iastate.edu/ameslab_pubs/345

http://lib.dr.iastate.edu/ameslab_pubs/345

http://lib.dr.iastate.edu/howtocite.html

http://lib.dr.iastate.edu/howtocite.html

mailto:[email protected]

Fully Analytic Energy Gradient in the Fragment Molecular OrbitalMethod

AbstractThe Z-vector equations are derived and implemented for solving the response term due to the externalelectrostatic potentials, and the corresponding contribution is added to the energy gradients in the frameworkof the fragment molecular orbital (FMO) method. To practically solve the equations for large molecules likeproteins, the equations are decoupled by taking advantage of the local nature of fragments in the FMOmethod and establishing the self-consistent Z-vector method. The resulting gradients are compared withnumerical gradients for the test molecular systems: (H2O)64, alanine decamer, hydrated chignolin with theprotein data bank (PDB) ID of 1UAO, and a Trp-cage miniprotein construct (PDB ID: 1L2Y). Thecomputation time for calculating the response contribution is comparable to or less than that of the FMO self-consistent charge calculation. It is also shown that the energy gradients for the electrostatic dimerapproximation are fully analytic, which significantly reduces the computational costs. The fully analytic FMOgradient is parallelized with an efficiency of about 98% on 32 nodes.

KeywordsPolymers, Electrostatics, Chemical bonds, Proteins, Density functional theory

DisciplinesChemistry

CommentsThe following article appeared in Journal of Chemical Physics 134 (2011): 124115, and may be found atdoi:10.1063/1.3568010.

RightsCopyright 2011 American Institute of Physics. This article may be downloaded for personal use only. Anyother use requires prior permission of the author and the American Institute of Physics.

This article is available at Iowa State University Digital Repository: http://lib.dr.iastate.edu/ameslab_pubs/345

http://dx.doi.org/10.1063/1.3568010

http://lib.dr.iastate.edu/ameslab_pubs/345?utm_source=lib.dr.iastate.edu%2Fameslab_pubs%2F345&utm_medium=PDF&utm_campaign=PDFCoverPages

Fully analytic energy gradient in the fragment molecular orbital methodTakeshi Nagata, Kurt Brorsen, Dmitri G. Fedorov, Kazuo Kitaura, and Mark S. Gordon Citation: The Journal of Chemical Physics 134, 124115 (2011); doi: 10.1063/1.3568010 View online: http://dx.doi.org/10.1063/1.3568010 View Table of Contents: http://scitation.aip.org/content/aip/journal/jcp/134/12?ver=pdfcov Published by the AIP Publishing Articles you may be interested in Nonorthogonal orbital based N-body reduced density matrices and their applications to valence bond theory. II.An efficient algorithm for matrix elements and analytical energy gradients in VBSCF method J. Chem. Phys. 138, 164120 (2013); 10.1063/1.4801632 Unrestricted Hartree-Fock based on the fragment molecular orbital method: Energy and its analytic gradient J. Chem. Phys. 137, 044110 (2012); 10.1063/1.4737860 Analytic energy gradient for second-order Møller-Plesset perturbation theory based on the fragment molecularorbital method J. Chem. Phys. 135, 044110 (2011); 10.1063/1.3611020 Molecular applications of analytical gradient approach for the improved virtual orbital-complete active spaceconfiguration interaction method J. Chem. Phys. 132, 034105 (2010); 10.1063/1.3290203 Theoretical study of intramolecular interaction energies during dynamics simulations of oligopeptides by thefragment molecular orbital-Hamiltonian algorithm method J. Chem. Phys. 122, 094905 (2005); 10.1063/1.1857481

This article is copyrighted as indicated in the article. Reuse of AIP content is subject to the terms at: http://scitation.aip.org/termsconditions. Downloaded to IP:

129.186.176.217 On: Wed, 02 Dec 2015 16:21:14

http://scitation.aip.org/content/aip/journal/jcp?ver=pdfcov

http://oasc12039.247realmedia.com/RealMedia/ads/click_lx.ads/www.aip.org/pt/adcenter/pdfcover_test/L-37/701402136/x01/AIP-PT/JCP_ArticleDL_092315/AIP-2639_EIC_APL_Photonics_1640x440r2.jpg/6c527a6a713149424c326b414477302f?x

http://scitation.aip.org/search?value1=Takeshi+Nagata&option1=author

http://scitation.aip.org/search?value1=Kurt+Brorsen&option1=author

http://scitation.aip.org/search?value1=Dmitri+G.+Fedorov&option1=author

http://scitation.aip.org/search?value1=Kazuo+Kitaura&option1=author

http://scitation.aip.org/search?value1=Mark+S.+Gordon&option1=author

http://scitation.aip.org/content/aip/journal/jcp?ver=pdfcov

http://dx.doi.org/10.1063/1.3568010

http://scitation.aip.org/content/aip/journal/jcp/134/12?ver=pdfcov

http://scitation.aip.org/content/aip?ver=pdfcov

http://scitation.aip.org/content/aip/journal/jcp/138/16/10.1063/1.4801632?ver=pdfcov









THE JOURNAL OF CHEMICAL PHYSICS 134, 124115 (2011)

Fully analytic energy gradient in the fragment molecular orbital methodTakeshi Nagata (��),1,a) Kurt Brorsen,2 Dmitri G. Fedorov,1

Kazuo Kitaura (��),1,3 and Mark S. Gordon (��)2

1NRI, National Institute of Advanced Industrial Science and Technology (AIST), 1–1-1 Umezono, Tsukuba,Ibaraki 305–8568, Japan2Ames Laboratory, US-DOE and Department of Chemistry, Iowa State University, Ames, Iowa 50011, USA3Graduate School of Pharmaceutical Sciences, Kyoto University, 46–29 Yoshidashimo Adachi, Sakyo-ku,Kyoto 606–8501, Japan

(Received 11 November 2010; accepted 28 February 2011; published online 29 March 2011)

The Z-vector equations are derived and implemented for solving the response term due to the exter-nal electrostatic potentials, and the corresponding contribution is added to the energy gradients in theframework of the fragment molecular orbital (FMO) method. To practically solve the equations forlarge molecules like proteins, the equations are decoupled by taking advantage of the local nature offragments in the FMO method and establishing the self-consistent Z-vector method. The resultinggradients are compared with numerical gradients for the test molecular systems: (H2O)64, alaninedecamer, hydrated chignolin with the protein data bank (PDB) ID of 1UAO, and a Trp-cage minipro-tein construct (PDB ID: 1L2Y). The computation time for calculating the response contribution iscomparable to or less than that of the FMO self-consistent charge calculation. It is also shown thatthe energy gradients for the electrostatic dimer approximation are fully analytic, which significantlyreduces the computational costs. The fully analytic FMO gradient is parallelized with an efficiencyof about 98% on 32 nodes. © 2011 American Institute of Physics. [doi:10.1063/1.3568010]

I. INTRODUCTION

Dynamics simulations of large molecules have becomevery important in the field of computational chemistry.1–7 Forexample, thermodynamic data such as the free energy and en-tropy can be extracted from such simulations. Since chem-ical processes occur at nonzero temperatures, an increasingnumber of molecular dynamics (MD) simulations have beenperformed using model potentials based on molecular me-chanics (MM). Another common use of computational sim-ulations is the sampling of potential energy surfaces for localand global minima using for example, Monte Carlo simula-tions. For instance, the structural data of proteins measured byX-ray or nuclear magnetic resonance (NMR) can be refinedwith the help of a computational simulation and the result-ing structure stored in the protein data bank (PDB). ClassicalMM schemes can, in principle, be replaced by more accuratemethods that consider the electronic structure explicitly.8–13

The benefit of this replacement is that by using first princi-ples methods, more reliable and sophisticated thermodynamicproperties, dynamics, and structures can be provided.

The cost of conventional quantum mechanical (QM)electronic structure approaches formally scales at least as N m ,where N measures the size of the molecular system and mranges from 3 for simple methods [e.g., the local densityapproximation of density functional theory (DFT)] to muchhigher values for correlated methods. Considerable efforts areinvested in developing linear scaling or O(N ) methods.14, 15

The fragmentation methods have a fairly long history16–22 andnow it is an active area of research.13, 23–32

a)Author to whom correspondence should be addressed. Electronic mail:[email protected]. Fax: +81 29 851 5426.

One such method is the fragment molecular orbital(FMO) method.33–36 The FMO method enables the calcula-tion of electronic states for large molecules by performingMO calculations in parallel on fragments of a large molecularsystem using external electrostatic potentials (ESPs). TheseESPs describe the electrostatic field due to charge distribu-tions of all of the other fragments. MOs of individual frag-ments are local in nature, because they are expanded in termsof basis functions on the atoms in the fragments, reducing thecomputation time significantly. Additionally, in order to re-alize linear scaling, approximations have been proposed forthe time-consuming ESP calculation in FMO34, 37 and othermethods.10 The fragment self-consistent procedure employedin FMO has been used in somewhat different forms in othermethods,18, 19,9,38, 39 while the many-body expansion of theenergy is conceptually related to the theory of intermolecu-lar interactions16 or the density expansion method.20

Over the past decade, the FMO method has becomeone of the most extensively developed fragmentation meth-ods for the calculation of accurate chemical properties, suchas the total energy, the dipole moment, and the interac-tion energy between fragments in large systems. The FMOmethod has been interfaced with many QM methods: Sec-ond order Møller–Plesset (MP2) perturbation theory,40, 41

coupled-cluster theory,42 DFT,43, 44 Hatree-Fock, the mul-ticonfiguration self-consistent field method,45 configurationinteraction,46, 47 time-dependent DFT,48–50 open-shells51 andnuclear-electronic orbitals.52 For these methods, the total en-ergies are in good agreement with the corresponding con-ventional QM total energies. The FMO method has been ap-plied to a number of large systems.53–66 FMO–MD has beenused67–78 to study the dynamics of various processes such as

0021-9606/2011/134(12)/124115/13/$30.00 © 2011 American Institute of Physics134, 124115-1


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14

http://dx.doi.org/10.1063/1.3568010

http://dx.doi.org/10.1063/1.3568010

mailto: [email protected].

124115-2 Nagata et al. J. Chem. Phys. 134, 124115 (2011)

chemical reactions. However, the FMO–MD method suffersfrom an accumulation of errors67 with the time evolution dueto an incomplete analytic FMO energy gradient.

The analytic FMO energy gradient reported by Kitaura etal. in 2001 (Ref. 79) was incomplete since the response termdue to the ESPs was neglected assuming that it is a small con-tribution for small basis sets. It was subsequently illustratedthat when using larger basis sets, such as 6–31G(d),80 the re-sponse contribution to the gradient is substantial. Evaluatingthe response contribution requires the solution of the coupled-perturbed Hartree–Fock (CPHF) equations. These equationsare time consuming to solve for large molecules, since theydepend not on the fragment size but on the size of the entiresystem.

Since the original report of a partially analytic FMOgradient,79 several improvements regarding previously miss-ing terms in the FMO gradient have been implemented inthe GAMESS program package:81, 82 (a) The ESP derivativeterms including the Mulliken point charge (PTC) approxi-mation (ESP–PTC),80 (b) the derivative of the dimer energyof two distant fragments approximated as the electrostaticinteraction energy (the electrostatic dimer approximation,ES-DIM),83 and (c) the hybrid orbital projection derivatives.84

The last and most complicated missing response term isthe subject of the present paper in which the exact responseterm for the FMO ESP gradient contribution is introduced,and the fully analytic FMO energy gradient is implemented,with an additional computation time requirement that is com-parable to the first step of the FMO calculation (the self-consistent charge, SCC, iterations). The gradient implementa-tion is also extended to the ES-DIM approximation but not yetto the ESP–PTC approximation. Consequently, the usefulnessof the FMO method is improved by establishing a completeexpression of the analytic energy gradient.

Similarly to FMO, the response term arises in other meth-ods where the external charges are not fully self-consistent.For example, in the electronically embedded method85, 86 lay-ers are introduced in the system. This is different from frag-ments because layers are mutually inclusive, i.e., lower layersinclude higher layers, and thus the lowest layer calculationdeals with the whole system, whereas the fragment based ap-proach we take uses the locality of fragments. On the otherhand in some other fragment methods such as X-Pol87 theCPHF-related terms do not arise.

II. FULLY ANALYTIC FMO ENERGY GRADIENT

A. Overview of the FMO energy gradient

In the following equations, the FMO energyexpression33–36 and the corresponding gradient equa-tions are briefly summarized. The two-body FMO (FMO2)total energy is represented as

E =N∑I

E ′I +

N∑I>J

(E ′I J − E ′

I − E ′J ) +

N∑I>J

Tr(�DI J VI J ),

(1)

where E ′X is the internal fragment energy for fragment X (X

= I or IJ, for monomers and dimers, respectively). VI J is the

matrix form of the ESP for dimer IJ due to the electron den-sities and nuclei of the remaining fragments, i.e.,

V I Jμν =

∑K �=I J

(uK

μν + v Kμν

). (2)

The one-electron and two-electron integrals in V I Jμν are,

respectively,

uKμν =

∑A∈K

〈μ| −Z A

|r − RA| |ν〉, (3)

v Kμν =

∑λσ∈K

DKλσ (μν|λσ ) , (4)

where DKλσ is the density matrix element of fragment K and

(μν|λσ ) is a two-electron integral in the atomic orbital (AO)basis. Index A runs over atoms. The dimer density matrix dif-ference �DI J in Eq. (1) is defined by

�DI J = DI J − (DI ⊕ DJ ). (5)

We note that the SCC process ensures the self-consistency of monomer densities with respect to each otherbut not monomer and dimer densities. The direct sum inEq. (5) means blockwise addition of two monomer matricesinto the dimer supermatrix.

The internal fragment energies in Eq. (1) may be writtenin the form:

E ′X =

∑μν∈X

DXμνhX

μν

+ 1

2

∑μνλσ∈X

[DX

μν DXλσ − 1

2DX

μλ DXνσ

](μν|λσ )

+∑μν∈X

DXμν P X

μν + ENRX , (6)

where hXμν is the X-mer one-electron Hamiltonian and the nu-

clear repulsion (NR) energy is

ENRX =

∑B∈X

∑A(∈X )>B

Z A Z B

RAB. (7)

Note that monomer and dimer densities are determinedby the MO calculations in the presence of ESPs due to theremaining monomers. For fragmentation across a covalentbond, the hybrid orbital projection (HOP) contribution

occ∑i∈X

2 〈i | P̂ X |i〉 =∑μν∈X

DXμν P X

μν, (8)

must be considered in Eq. (6). The HOP operator is definedby

P̂ X =∑k∈X

Bk |θk〉〈θk | , (9)

where |θk〉 is a hybrid orbital and the universal constant Bk isusually set to 106.


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14

124115-3 Fully analytic energy gradient in FMO J. Chem. Phys. 134, 124115 (2011)

The differentiation of E ′X with respect to nuclear coordi-

nate a leads to

∂ E ′X

∂a=

∑μν∈X

DXμν

∂hXμν

∂a

+ 1

2

∑μνλσ∈X

[DX

μν DXλσ − 1

2DX

μλ DXνσ

]∂ (μν|λσ )

∂a

+∑μν∈X

DXμν

∂ P Xμν

∂a− 2

occ∑i, j∈X

Sa,Xji F ′X

ji

− 4occ∑i∈X

vir∑r∈X

U a,Xri V X

ri + ∂ ENRX

∂a, (10)

where the overlap derivative matrix element is defined by

Sa,Xi j =

∑μν∈X

C X∗μi

∂SXμν

∂aC X

ν j . (11)

The internal fragment Fock matrix element F ′Xi j is given

in terms of MOs:

F ′Xi j = hX

i j +occ∑

k∈X

[2(i j |kk) − (ik| jk)] + P Xi j , (12)

where the MO-based projection operator matrix P Xi j is intro-

duced for convenience:

P Xi j =

∑μν∈X

C X∗μi P X

μνC Xν j . (13)

Throughout this study, the Roman (ijkl· · ·) and Greek(μνρσ · · ·) indices denote the molecular orbitals and AOs, re-spectively. For the response term in Eq. (10), the followingequation defined in the previous study80 is introduced:

Ua,X,Y = 4

occ∑i∈X

vir∑r∈X

U a,Xri V Y

ri . (14)

The response terms U a,Xri associated with ESP V Y

ri in theMO basis arise from the expansion of the MO coefficientderivative in terms of the MO coefficients,88 and Eq. (14)sums the U a,X

ri terms multiplied by the V Yri terms in order

to simplify the gradient derivations. There are two types of

Ua,X,Y

terms: (a) Ua,I,I

(i.e., X = Y) arising from the deriva-

tive of the monomer terms, and (b) Ua,X,I J

where X can be I,J, or IJ [related to the three D terms in Eq. (5)]. The deriva-tives of the MO coefficients can be written as

∂C Xμi

∂a=

occ+vir∑m∈X

U a,Xmi C X

μm . (15)

To obtain the occupied-virtual orbital response U a,Xri , one

must solve the CPHF equations. This will be discussed in sub-sequent subsections.

The differentiation of the ESP energy contribution inEq. (1) with respect to nuclear coordinate a leads to

∂

∂aTr(�DI J VI J )

= ∑μν∈I J

�DI Jμν

∑K �=I J

[∂uK

μν

∂a+

∑λσ∈K

DKλσ

∂ (μν|λσ )

∂a

]

− 2∑

μν∈I JW I J

μν

∂SI Jμν

∂a+2

∑μν∈I

W Iμν

∂SIμν

∂a+ 2

∑μν∈J

W Jμν

∂S Jμν

∂a

+ Ua,I J,I J − U

a,I,I J −Ua,J,I J −2

∑K �=I J

∑μν∈K

�X K (I J )μν Sa,K

μν

+ 4∑

K �=I J

∑μν∈I J

vir∑r∈K

occ∑i∈K

�DI JμνU a,K

ri (μν|ri) ,

(16)where

W Xμν = 1

4

∑λσ∈X

DXμλV I J

λσ DXσν (17)

and

�X K (I J )μν = 1

4

∑λσ∈K

DKμλ

⎡⎣ ∑

ζη∈I J

�DI Jζη (ζη|λσ )

⎤⎦ DK

σν.

(18)The collection of all the U

a,X,Yterms in Eqs. (10) and

(16) subject to the differentiation of Eq. (1) forms the follow-ing equations (N = number of fragments):

Ua = −

N∑I

Ua,I,I −

N∑I>J

(U

a,I J,I J − Ua,I,I − U

a,J,J )

+N∑

I>J

(U

a,I J,I J − Ua,I,I J − U

a,J,I J ). (19)

Ua

is zero when no ESP approximations are applied orwhen all the ESPs are approximated uniformly (e.g., withthe ESP–PTC approximation).80 Otherwise, U

ais only ap-

proximately equal to zero. Ua

is regarded as a compensationterm arising by defining the internal fragment energies andthe ESP contribution separately in Eq. (1). If U

a = 0, then in

Eq. (16) the dimer-related terms Ua,I J,I J − U

a,I,I J − Ua,J,I J

need not be evaluated, and the only terms that require the so-lution of CPHF equations come from the monomers, U a,I

ri .

B. Coupled-perturbed Hartree–Fock equations in FMO

Calculating the fully analytic FMO energy gradients(with the aforementioned conditions on the ESP approxima-tions that lead to U

a = 0) requires only the response termsdue to the monomers, i.e., U a,I

ri . These terms can be obtainedusing the diagonal nature of the monomer Fock matrices. Thepurpose of this subsection is to derive the CPHF equations forU a,I

ri in the framework of the FMO method.


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14


For monomer I, the corresponding Fock matrix rewrittenin terms of MOs is

F Ii j = F

′ Ii j + V I

i j

= h̃ Ii j +

occ∑k∈I

[2(i j |kk) − (ik| jk)] + P Ii j ,

(20)

where the one-electron Hamiltonian in FMO is

h̃ Ii j = hI

i j + V Ii j . (21)

F ′Ii j is the internal Fock matrix (i.e., the full matrix F I

i j withESP subtracted), and P I

i j is the projection operator matrix.The differentiation of Eq. (20) with respect to nuclear co-

ordinate a is

∂ F Ii j

∂a= ∂

∂a

(h̃ I

i j +occ∑k∈I

[2(i j |kk) − (ik| jk)] + P Ii j

).

(22)After some algebra, Eq. (22) leads to

∂ F Ii j

∂a= Fa,I

i j +occ+vir∑

k∈I

(U a,I

ki F Ik j + U a,I

k j F Iik

)

+occ+vir∑

k∈I

occ∑l∈I

U a,Ikl A′

i j,kl +∑K �=I

occ+vir∑k∈K

occ∑l∈K

U a,Kkl 4(i j |kl).

(23)In Eq. (23) a superscript a indicates a derivative with re-

spect to coordinate a. The (molecular) orbital Hessian contri-bution is given by

A′i j,kl = 4(i j |kl) − (ik| jl) − (il| jk), (24)

and the Fock derivative is

Fa,Ii j = ha,I

i j + V a,Ii j +

occ∑k∈I

[2(i j |kk)a − (ik| jk)a] + Pa,Ii j .

(25)Equation (25) is derived from Eq. (10.7) of Ref. 88.

The derivative of the one-electron Hamiltonian, ha,Ii j and the

derivative two-electron integral, (i j |kl)a are defined as

ha,Ii j =

∑μν∈I

C I∗μi C I

ν j

∂hIμν

∂a, (26)

(i j |kl)a =∑

μνρσ∈I

C I∗μi C I

ν j CI∗ρk C I

σ l

∂(μν|ρσ )

∂a, (27)

(see Section 4.5 of Ref. 88 for details). The ESP derivativeV a,I

i j is defined as follows:

V a,Ii j =

∑K �=I

(ua,K

i j +occ∑

k∈K

2 (i j |kk)a

). (28)

F Ii j is zero by virtue of the SCF equations, if i �=j. The

HOP derivative Pa,Ii j and the one-electron contribution in the

ESP derivative ua,Ki j are defined analogously to ha,I

i j , as MO-transformed derivative integrals [see Eq. (26)].

After further algebra, Eq. (23) leads to

∂ F Ii j

∂a= Fa,I

i j − (ε I

j − ε Ii

)U a,I

i j − Sa,Ii j ε I

j

−occ∑

k,l∈I

Sa,Ikl [2(i j |kl) − (ik| jl)] +

vir∑k∈I

occ∑l∈I

U a,Ikl A′

i j,kl

−∑K �=I

occ∑k,l∈K

2Sa,Kkl (i j |kl) +

∑K �=I

vir∑k∈K

occ∑l∈K

U a,Kkl 4(i j |kl),

(29)where ε I

i is the orbital energy of MO i on fragment I and thefollowing relation88 is used

Sa,Xi j + U a,X

ji + U a,Xi j = 0. (30)

Equation (29) contains two types of unknown valuesU a,X

kl to be solved: U a,Ikl for the target fragment and U a,K

kl fromthe remaining fragments. Therefore, it is necessary to collect∂ F X

i j /∂a of all the monomer fragments to construct and solvethe complete CPHF equations. Consequently, a set of CPHFequations have the dimension of the whole system given inmatrix form as

AUa = Ba0, (31)

where the fragment diagonal and off-diagonal blocks of ma-trix A are given in Eqs. (32) and (33), respectively,

AI,Ii j,kl = δikδ jl

(ε I

j − ε Ii

) − [4(i j |kl) − (ik| jl) − (il| jk)]

(32)

and

AI,Ki j,kl = −4(i j |kl), (33)

where Eq. (32) comes from the MOs of only the target frag-ment I and Eq. (33) comes from the ESP acting upon I . Theij∈I element of vector Ba

0 is

Ba,I0,i j = Fa,I

i j − Sa,Ii j ε I

j −occ∑

kl∈I

Sa,Ikl [2(i j |kl) − (ik| jl)]

−∑K �=I

occ∑kl∈K

2Sa,Kkl (i j |kl). (34)

C. Z-vector method in FMO

It is impractical to solve the CPHF equations of Eq. (31)for all nuclear coordinates of a large system. To avoid thisdifficulty, the Z-vector method is applied to the FMO CPHFequations.88

In Eq. (16), the terms involving the unknowns U a,Kri are

collected for all of the dimer fragments IJ. The resulting con-tribution to the FMO energy gradient leads to

a = 4N∑

I>J

∑K �=I J

∑μν∈I J

vir∑r∈K

occ∑i∈K

�DI JμνU a,K

ri (μν|ri)

=∑

K

vir∑r∈K

occ∑i∈K

U a,Kri X K

ri , (35)


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14


where X Kri is defined as

X Kri = 4

N∑(I>J )�=K

∑μν∈I J

�DI Jμν (μν|ri) . (36)

It is convenient to express Eq. (35) in vector form:

a =∑

K

vir∑r∈K

occ∑i∈K

U a,Kri X K

ri = XT Ua

= XT A−1Ba0

= ZT Ba0. (37)

This reduces the CPHF problem for solving Eq. (31) to aset of simultaneous equations:

AT Z = X. (38)

It is still time consuming to solve Eq. (38) directly, be-cause it has the dimension of the entire system. However, tak-ing advantage of the decoupled nature of the FMO method, amore clever approach can be considered by separating the di-agonal (I, I ) and off-diagonal (K , I ) blocks of A in Eq. (38):

vir∑k∈I

occ∑l∈I

AI,Ikl,ri Z I

kl = X Iri −

all∑K �=I

vir∑k∈K

occ∑l∈K

AK ,Ikl,ri Z K

kl , (39)

or in matrix form: (AI,I

)TZI = X′I , (40)

where

X′I = XI −∑K �=I

(AK ,I

)TZK . (41)

Equation (37), using definitions in Eqs. (34), (38), (40), and(41) gives the final formulation of the terms that are requiredto complete the FMO analytic gradient. The solution of theseequations is accomplished as follows, illustrated in Fig. 1.By taking the I diagonal blocks of matrix A and solving(AI,I

)TZI = XI , one finds the initial ZI . X′I is computed

to solve Eq. (40). The external unknowns Z Kkl in the last term

of Eq. (39) are frozen, and this equation is decoupled for eachfragment I. Z is obtained by solving Eq. (40) for all fragmentsindependently, and then X′I is updated with these new valuesof ZK for the next step; the calculations are then repeated untilall elements in the Z-vector are self-consistent. This is simi-lar in both procedure and computational cost to the SCC pro-cedure in the monomer energy calculation (and it also bearsa similarity to SCF), so it will be called the self-consistentZ-vector (SCZV) method. The preconditioned conjugate gra-dient method is applied to solve Eq. (40). The convergencetest is made for all Z-vector elements. If the root-mean-squaredeviation of ZI,new−ZI,old is larger than a threshold, the pro-cedure returns to step 2 in Fig. 1.

The SCZV method is parallelized using the generalizeddistributed data interface (GDDI).89 Because of its iterative

FIG. 1. Schematic diagram of the SCZV procedure.

decoupled nature, the computation time of SCZV is compara-ble to that of SCC.

D. Application to the electrostaticdimer approximation

The previous subsection presented a derivation in whichthe analytic FMO gradient was derived with no approxima-tions. In this subsection, the electrostatic dimer (ES-DIM) ap-proximation for the fully analytic energy gradients is intro-duced, and it is shown that the response terms arising fromthe ES-DIM approximation need not be considered, becausethey cancel out with the response term from Eq. (19).

For separated fragments I and J, if the distance RI J (de-fined as the distance between the closest atoms in I and J di-vided by the sum of their van der Waals atomic radii) is largerthan the threshold value LES−DIM in the ES-DIM approxima-tion, the internal pair interaction energy, i.e., the correspond-ing summand in the second sum on the right-hand side of Eq.(1) can be replaced by

E ′I J − E ′

I − E ′J ≈ Tr(DI uJ ) + Tr(DJ uI )

+∑μν∈I

∑λσ∈J

DIμν D J

λσ (μν|λσ ) + �ENRI J ,

(42)

where the NR term �ENRI J = ENR

I J − ENRI − ENR

J . In addition,the corresponding IJ term in the third sum of Eq. (1) can-cels out because �DI J = 0. The differentiation of Eq. (42)with respect to nuclear coordinate a without considering theresponse term is given elsewhere.83 Therefore, it is sufficienthere to discuss how the unknown orbital response terms in theFMO gradient are formulated. The collection of the response


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14


terms in the FMO gradient yields

Ua + a = −

N∑I

Ua,I,I −

N∑I>J

(U


a,J,J )

+N∑

I>J

(U


a,J,I J )

+ 4N∑

I>J

∑K �=I J

∑μν∈I J

vir∑r∈K

occ∑i∈K

�DI JμνU a,K

ri (μν|ri) .

(43)

As mentioned before, Ua = 0 only if there are no approx-

imations to the ESPs. In the case of the ES-DIM approxima-tion, Eq. (43) should be reformulated.

When the ES-DIM approximation is applied to avoidSCF calculations for dimers separated by more than LES−DIM,U

ain Eq. (19) can be rewritten as

Ua = −

N∑I

Ua,I,I

−N∑

I>J (RI J ≤LES−DIM)

(U


a,J,J )

+N∑


(U


a,J,I J )

= −N∑I

Ua,I,I +

N∑I>J (RI J ≤LES−DIM)

(U

a,I,I (J ) + Ua,J,J (I ))

�= 0, (44)

where the partial terms

Ua,X,X (Y ) = 4

occ∑i∈X

vir∑r∈X

U a,Xri

(uY

ri + vYri

), (45)

describe the contribution of a single fragment denoted Y to the

full sum over all Y in Ua,X,X

.Also, the following relation can be used:

Ua,I,I − U

a,I,I J = Ua,I,I (J )

, (46)

because [see Eq. (14)] the ESP for I J (V I J ) runs over allfragments excluding I and J, whereas the ESP for I excludesthe contribution from I. Therefore, in the difference expres-sion only J terms remain:

V Iri − V I J

ri = V I,I (J )ri ≡ u J

ri + v Jri . (47)

Ua

in Eq. (44) is in general nonzero and can be furthersimplified to

Ua = −

N∑I

Ua,I,I +

N∑I>J (RI J ≤LES−DIM)

(U

a,I,I (J ) + Ua,J,J (I ))

= −N∑

I>J (RI J >LES−DIM)

(U

a,I,I (J ) + Ua,J,J (I ))

, (48)

where the completeness relation is used:

N∑I

Ua,I,I =

N∑I

N∑J �=I

Ua,I,I (J ) =

N∑I>J

(U

a,I,I (J ) + Ua,J,J (I ))

.

(49)

For the derivatives of Eq. (42) the collection of responseterms for all IJ is

N∑I>J (RI J >LES−DIM)

[Tr

(∂DI

∂auJ

)+ Tr

(∂DJ

∂auI

)

+∑μν∈I

∑λσ∈J

∂ DIμν

∂aD J

λσ (μν|λσ )

+∑μν∈I

∑λσ∈J

DIμν

∂ D Jλσ

∂a(μν|λσ )

]

=N∑

I>J (RI J >LES−DIM)

(U

a,I,I (J ) + Ua,J,J (I )

)

−2N∑

I �=J (RI J >LES−DIM)

occ∑i j∈I

Sa,Ij i

(u J

ji + v Jji

). (50)

The first term on the right-hand side of Eq. (50) cancelsout with U

ain Eq. (48). Since the derivative terms of Eq. (42)

are already implemented,83 one can obtain the fully analyticenergy gradients for the ES-DIM approximation by calculat-ing the following response contribution to the gradient:

Ua + a

= 4N∑


∑K �=I J

∑μν∈I J

vir∑r∈K

occ∑i∈K

�DI JμνU a,K

ri (μν|ri) .

(51)

E. Implementation

To further facilitate solution of the SCZV equations,it is useful to reformulate them in the AO basis, therebyavoiding the expensive integral transformation.90 ExaminingEqs. (39), (32), and (33), the main computational effort isspent on two-electron integral terms like

∑kl(kl|ri)Z K

kl inEq. (39). By utilizing the MO i expansion over AOs μ,

|i〉 =∑

μ

Cμi |μ〉 . (52)

The two-electron integral terms in Eq. (39) can be trans-formed according to

vir∑k∈K

occ∑l∈K

(kl|ri)Z Kkl =

vir∑k∈K

occ∑l∈K

C K∗μk C K

νl (μν|ri)Z Kkl

=∑μν

(μν|ri)Z̃ Kμν, (53)


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14


where

Z̃ Kμν =

vir∑k∈K

occ∑l∈K

C K∗μk Z K

kl CKνl . (54)

Using Eq. (53) avoids the full transformation of the two-electron integrals to the MO basis, which leads to a significantreduction of the computation time and memory. However, Z̃ I

μν

is not symmetric; this is inconvenient and can be further im-proved by symmetrizing it:

ZIμν = 1

2

(Z̃ I

μν + Z̃ Iνμ

). (55)

It is possible to rewrite the SCZV equations [Eq. (40)]

using the symmetrized local Z-vector element ZKμν . Z

Kμν cor-

responds to the density matrix element Dμν in typical inte-gral programs, and therefore the SCZV method can be imple-mented using standard CPHF codes.

The Cauchy–Schwarz inequality, which estimates thevalue of (μν|ρσ ) based on the values of a smaller set ofintegrals with repeated indices, can be used in the SCZVto obtain a further reduction of computation time. TheCauchy–Schwarz inequality screening is usually applied to

the maximum element of the factor by which the integralsof interest are multiplied. The SCZV procedure is more ef-ficient because the Z-vector elements normally have smallervalues than the density matrix elements, and therefore, theCauchy–Schwarz integral screening can skip more terms. Inthe implementation of the response contribution, Eq. (37),the direct (AO basis) algorithm and the Cauchy–Schwarzinequality for the calculation of the two-electron integralsare included. As a result, the calculations do not need a largeamount of disk space or memory.

III. COMPUTATIONAL DETAILS

To verify that the response contribution that has beenderived and implemented in this study makes the FMO en-ergy gradients fully analytic, analytic gradients are comparedwith numerical gradients for molecular systems taken fromprevious studies:80, 91, 92 (H2O)64, the α-helix conformationof the alanine decamer (ALA)10 capped with –OCH3 and–NHCH3 groups, chignolin (PDB ID: 1UAO) solvated by157 water molecules, and the Trp-cage miniprotein construct(PDB ID: 1L2Y). The structures of (ALA)10 and 1L2Y were

FIG. 2. Geometric structures of (a) (H2O)64, (b) (ALA)10 capped with CH3CO– and –NHCH3 groups, (c) chignolin solvated by 157 water molecules, and (d)1L2Y [colored by chemical elements as light grey (H), dark grey (C), blue (N), and red (O)].


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14


taken from earlier works.83, 92 Numerical gradients werecomputed with double differencing and a coordinate step of0.005 Å [except for (H2O)64 with 6–311G(d), where a 0.0005Å step was used].

For the fragmentation of a system that does not requirefragmenting covalent bonds, such as a water cluster, the HOPoperator is not needed. For the first test calculation, a watermolecule in (H2O)64 is assigned to a fragment. The geomet-rical structure of (H2O)64 was modeled by HYPERCHEM, op-timized by AMBER94,93 and then reoptimized at the FMO-RHF/6–31G level.80 The optimized structure of (H2O)64 isdepicted in Fig. 2(a).

For molecular systems in which fragmentation occursacross covalent bonds, the hybrid orbital operator contributesto the gradients both directly and via the response term fromthe Fock derivative, Eq. (25). The α-helix conformation of(ALA)10 with some intramolecular hydrogen bonds [shownin Fig. 2(b)] is chosen because in the case of incomplete ana-lytic gradients it is expected to have large errors in the gradi-ents compared to the β-strand or extended structures. Becausethis system is relatively small, the validity of the analytic gra-dients without approximations can also be accessed.

The FMO method has been interfaced with the ef-fective fragment potential (EFP) method, which explicitlytreats solvent molecules by adding a one-electron potentialto the Hamiltonian.91, 94 Chignolin is immersed in 157 watermolecules described by EFPs, and its structure is optimizedusing the combined FMO/EFP method at the RHF/cc-pVDZlevel.91 The optimized structure in Fig. 2(c) reproduces thePDB NMR structure well.80

Figure 2(d) depicts the structure of 1L2Y. Because 1L2Yis the largest system treated with the FMO method in thisstudy, it is an appropriate test case for the application of theelectrostatic dimer approximation which is necessary for effi-cient computations of large systems. In this calculation, thethreshold for the ES-DIM approximation LES−DIM was setto the default value, 2.0 and all other approximations wereturned off.34

For systems fragmented across covalent bonds, i.e.,(ALA)10, chignolin and 1L2Y, a one residue/one fragmentpartition was adopted. The gradient calculations for (H2O)64,(ALA)10, hydrated chignolin, and 1L2Y were performed atthe RHF/6–31G(d) level and diffuse functions were added tothe carboxyl groups of 1L2Y. Additionally, RHF/6–311G(d)was also used to calculate the gradients for (H2O)64.

The computation time of the SCZV procedure is ex-pected to be comparable to that of the SCC calculation. Thus,the timings of the calculations of fully analytic gradients inwhich both the SCC and SCZV steps were included weremeasured and compared. The timing calculations used the 32

TABLE I. The number of GDDI groups used in dividing 31 nodes.

Monomer step Dimer step

(H2O)64 16 16(ALA)10 5 15Hydrated chignolin 5 151L2Y 10 15

node SOROBAN cluster with Intel Pentium 4 central process-ing unit (CPU) 3.2 GHz nodes connected by Gigabit Ethernet.Table I lists separately for monomers and dimers the numberof GDDI groups in the gradient calculation for each system(in GDDI, in order to improve parallel efficiency, all computernodes are divided into groups and individual monomer anddimer calculations are distributed dynamically among thesegroups). Note that the number of GDDI groups in the SCZVstep is the same as in the SCC step. The parallel efficiency iscalculated for the gradient and SCZV calculations for (H2O)64

using 1, 2, 4, 8, 16, and 32 nodes of the SOROBAN cluster. Forthis calculation, the number of GDDI groups is equal to thenumber of nodes.

IV. RESULTS AND DISCUSSION

A. Fully analytic energy gradients withoutapproximations

In this subsection, the fully analytic gradients withoutapproximations are discussed. It is important to numericallyverify that the gradients are fully analytic using U

a = 0[Eq. (19)].

As mentioned previously, for (H2O)64, the energy gradi-ents do not involve the HOP term and only the response termcontributes to the analytic gradients. In Table II, the root-mean-square (RMS) value in the numerical gradients becameidentical with the new analytic gradients (0.005997 a.u.).However, the RMS value in the old (conventional) analyticgradients, in which only the response contribution is ne-glected, deviates by 0.0001 a.u. from the numerical and theanalytic ones. For the maximum absolute gradient values(MAX grad.), the new analytic gradient value is in goodagreement with the corresponding numerical value, while theconventional value differs by 0.00025 a.u. The latter is nota negligible error when aiming for fully analytic gradients.The RMS of the errors for the new analytic gradient relativeto the numerical gradient (the RMS error in the analyticgradients) is negligibly small (0.000011 a.u.), which is animprovement by a factor of 20 over the RMS error in theold analytic gradients (0.000231 a.u.). This improvementis visualized in Fig. 3(a), which plots the errors of the newanalytic gradient and the old analytic gradient relative to thenumeric gradient against the total 576 gradient elements. Thenew analytic gradient values converge to zero, while the oldanalytic gradient values have large deviations.

For systems fragmented across covalent bonds, theHOP contribution to the response term must be considered.(ALA)10 is such a system and due to its size is a good test forcalculating numerical gradients without approximations at arelatively moderate computational cost. Note that the old an-alytic gradient method that is illustrated here did include thedirect contribution of HOP derivatives to the gradients, sincethis contribution was introduced in a previous study.84 Table IIshows that the three types of gradients are in good agreement.Even though the RMS value in the old analytic gradients isvery close to that of numerical gradients, the RMS errors andmaximum absolute gradient error for the old analytic gradientdeviates by significant amounts (0.000160 and 0.000580a.u., respectively). On the other hand, the very small errors


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14


TABLE II. The RMS and the maximum absolute values (MAX grad.) of the gradient elements for FMO2–RHF.RMS of the errors of the analytic gradients relative to the numeric gradients (RMS error) and the error in themaximum absolute gradient values (MAX error). All values are in a.u.

Gradient RMS MAX grad. RMS error MAX error

(H2O)64 without approximations, 6–31G(d)Numeric 0.005997 0.014090 . . . . . .Analytic 0.005997 0.014094 0.000011 0.000035Conventional 0.006132 0.014347 0.000231 0.000961

(H2O)64 with LES−DIM = 2.0, 6–31G(d)Numeric 0.005997 0.014090 . . . . . .Analytic 0.005997 0.014094 0.000011 0.000034Conventional 0.006132 0.014347 0.000232 0.000975

(H2O)64 with LES−DIM = 2.0, 6–311G(d)Numeric 0.007467 0.018020 . . . . . .Analytic 0.007469 0.018029 0.000003 0.000010Conventional 0.007745 0.018507 0.000433 0.002280

(ALA)10 without approximations, 6–31G(d)Numeric 0.011658 0.043197 . . . . . .Analytic 0.011664 0.043222 0.000009 0.000039Conventional 0.011655 0.043169 0.000160 0.000580

Hydrated (EFP) chignolin without approximations, 6–31G(d)Numeric 0.000284 0.002010 . . .Fully analytic 0.000280 0.001995 0.000017 0.000092Conventional 0.000215 0.001542 0.000191 0.001501

1L2Y with LES−DIM = 2.0, 6–31G(d)Numeric 0.019779 0.047540 . . . . . .Analytic 0.019780 0.047543 0.000015 0.000037Conventional 0.019822 0.047418 0.000335 0.001001

in the new analytic gradient values indicate that they arefully analytic, and that the HOP contribution to the responseterm is properly included. In Fig. 3(b), as in Fig. 3(a), onecan see that the new analytic gradients are fully analytic forevery gradient element. Comparing Fig. 3(b) ((ALA)10) withFig. 3(a) (the water cluster), the former has smaller gradienterrors for the old analytic gradients. Since the response termis related to the ESPs, the environmental potential affects thewater clusters more significantly than the polypeptide.

Most biological processes occur in aqueous solution. Tomodel one of these processes, one may consider a protein ina water droplet. For molecular dynamics and geometry opti-mization processes, an accurate solvent model must be com-bined with the application of FMO to the solute. As the firsttest calculation toward this goal, hydrated chignolin is chosenfor the FMO/EFP framework. In order to obtain the fully ana-lytic energy gradients in this framework, it is necessary to addthe EFP derivatives to the Fock derivative term, Eq. (25) andprobably to modify the A matrix in Eq. (38). In this study,however, the EFP-related many-body polarization contribu-tion to the FMO ESP response term was neglected. There-fore, the FMO/EFP energy gradients are not fully analytic. So,the accuracy of the FMO/EFP new energy gradients is shownonly for the solute (FMO) molecule (chignolin). As seen inTable II, the accuracy of the FMO/EFP new energy gradientsis similar to that for the full treatment of FMO, but the max-imum absolute gradient error is larger (0.000092 a.u.) thanthose for the other test systems, and there are slight deviations

in the FMO/EFP new analytic gradients in Fig. 3(c). Never-theless, the maximum absolute new analytic gradient error isimproved by a factor of 16 compared to the old analytic gradi-ent value. This encouraging result provides motivation to de-velop the fully analytic energy gradients for FMO/EFP as wellas FMO combined with the polarizable continuum model.92, 95

For the chosen systems, the wall clock times requiredfor the SCC, the SCZV, and the total FMO calculation weremeasured. Table III shows that for the gradient calculationwithout approximations for (H2O)64, the SCC calculationtook 24.3 s, which is comparable to 29.5 s. in the correspond-ing SCZV calculation. These times are small relative to thetotal computational time of over 400 s. For (ALA)10, theSCZV calculation takes only 64% of the computation time ofthe corresponding SCC calculation. For hydrated chignolin,the SCZV calculation requires a similar percentage of thecomputation time. These results illustrate why the Z-vectorsconverge more rapidly; the calculation of two-electronintegrals is faster because the Cauchy–Schwarz inequalityis more efficient when applied to the Z-vectors rather thanto the monomer densities (used in ESPs or two-electronintegrals for monomers).

B. Fully analytic energy gradients in theES dimer approximation

The FMO energy gradients have been shown to be fullyanalytic by introducing the response contribution. The com-putation time in the calculation of the response term is com-


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14


FIG. 3. Errors of the analytic gradient elements relative to the numeric gradient elements for (a) (H2O)64, (b) (ALA)10 capped with CH3O– and –NHCH3groups, (c) chignolin solvated by 157 water molecules, (d) 1L2Y, all calculated at the RHF/6–31G(d) level, and (e) (H2O)64 at the RHF/6–311G(d) [all thegradient elements for (a)–(c), (e), and the representative elements for (d)]. Black diamond: fully analytic energy gradients. Yellow square: the conventionalgradients.

parable to or less than that required for the SCC calculation.This implies that although it is practical to calculate the re-sponse term itself, it is still expensive to calculate the fullyanalytic energy gradients for a larger molecule because of thedifficulty in performing a large number of dimer SCF calcu-lations without approximations.

As discussed above, the fully analytic energy gradientswith the ES-DIM approximation have been derived. The pur-

pose of this subsection is to check numerically that the FMOenergy gradients are fully analytic within the ES-DIM approx-imation and to discuss the timings.

The ES-DIM approximation was first applied to (H2O)64

using both the 6–31G(d) and 6–311(d) basis sets. Table IIshows that with LES−DIM = 2.0, the RMS value, the MAX gra-dient value, and the RMS error for the new analytic gradientsare identical to those without approximations. Additionally,


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14


TABLE III. Wall clock time in the SCC, SCZV, and the total computa-tion using a 31 single 3.2 GHz Pentium 4 cluster (all in seconds), FMO2-RHF/6–31G(d).

SCC SCZV TOTAL

(H2O)64 without approximationsAnalytic 24.3 29.5 467.2Conventional 23.0 . . . 431.4

(H2O)64 with LES−DIM = 2.0Analytic 29.4 28.1 261.0Conventional 24.3 . . . 227.8

(ALA)10 without approximationsAnalytic 359.7 230.4 1240.3Conventional 358.4 . . . 1016.3

hydrated chignolin without approximationsAnalytic 1383.6 883.2 7299.8Conventional 1382.4 . . . 6419.2

1L2Y with LES−DIM = 2.0Analytic 3429.1 2320.7 12942.1Conventional 3380.4 . . . 10597.1

the maximum gradient error is negligibly small. These resultsimply that the ES-DIM approximation is a suitable choice forthis system.

The errors for the old analytic gradient increase withthe basis set, as can be seen in Table II. For example,the maximum gradient error increases from 0.000975 to0.002280 when going from 6–31G(d) to 6–311(d). Theerror in the new analytic versus numeric gradient is about15 times smaller and is probably due to the error in thenumerical gradient itself. There is also no basis set effect onthe error when the new analytic gradient is employed. Thepictorial representations of the errors in Figs. 3(a) and 3(e)demonstrate the quality of the gradients.

For another system requiring fragmentation across cova-lent bonds, consider 1L2Y, which consists of 20 amino acidresidues. Since the calculation of numerical gradients for sucha relatively large molecule is time consuming, a subsystemis chosen, i.e., the 38 atoms defining 19 detached bonds be-tween 20 fragments (for which atoms the error in the old an-alytic gradient is the largest as found in the earlier study84).With LES−DIM = 2.0, there are 92 SCF and 98 approximated(ES-DIM) dimers. In Table II [see also Fig. 3(d)], the cor-responding RMS value and the maximum absolute gradientvalue show that the new analytic gradients are fully analytic incomparison to the numerical values. The corresponding RMSand MAX gradient errors are similar in accuracy to the fullyanalytic gradient values of the other systems. The errors aremuch improved compared to those from the old analytic gra-dient calculation, especially for the MAX gradient which ismore accurate by a factor of 27, indicating a significant con-tribution from the response term in this system.

As shown in Table III, for (H2O)64, the total wall clocktime for the fully analytic gradient calculation with the ap-proximation (261.0 s) is less than the timing without approx-imations (467.2 s). In comparing the SCZV and the SCC cal-culation for this water cluster, it is reasonable that the formertook less computation time. When using the approximation,

FIG. 4. The parallel efficiency using the personal computer cluster of 1, 2,4, 8, 16 and, 32 CPUs (Intel Pentium 4 3.2 GHz). Solid line: for the totalgradient calculation. Dashed line: for the SCZV calculation.

the ratio of the computation time in SCZV to SCC is normallyless than 1. This implies that the SCZV calculation is faster;this trend is independent of the ES-DIM approximation. For1L2Y, the SCZV calculations took approximately 70% of thetime required for the SCC calculations, and it is expected thatthis ratio will decrease for larger systems.

For (H2O)64, the parallel efficiency S(n), shown in Fig. 4,was calculated using the following expression:

S(n) =T1Tn

n× 100, (56)

where T1 and Tn represent the computation time using 1node and n nodes, respectively. Hundred percent efficiencymeans that the calculation is n times faster on n nodes. Theresults show that for (H2O)64 the parallel efficiency is over97.8% for all numbers of nodes (n = 1, 2, 4, 8, 16, and 32)both in the total gradient calculation (solid line) and SCZVcalculation (dashed line). For n = 16, the parallel efficiencyin the SCZV calculation drops slightly from that for n = 8.For (H2O)64, 64 SCZV calculations should be distributedinto 16 nodes (divided into 16 GDDI groups). However, eachSCZV calculation takes a different amount of time, becausethe number of iterations and the integral screening dependupon the fragment; the GDDI calculations are dynamicallydivided over groups, and there is some granularity (i.e.,some groups finish ahead of others). This is why the parallelefficiency drops at n = 16. The superlinear scaling over 100%for a small number of nodes is also observed in FMO–RHFcalculations89 and is thought to originate from externalfactors like CPU cache efficiency.

V. CONCLUSIONS

The CPHF equations and the Z-vector equations in theFMO framework have been derived to compute the fully an-alytic energy gradients. One outcome of this study is thederivation and implementation of the SCZV equations, bywhich the time-consuming Z-vector equations of the wholesystem reduce to those of the monomer fragments. Addi-tionally, the SCZV procedure is parallelized by the use of


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14


GDDI.89 It was shown that the FMO energy gradients arefully analytic in the electrostatic dimer approximation. Thisleads to a significant reduction of the total computation time.Nearly fully analytic energy gradients have been successfullyimplemented for the combined FMO/EFP method as well.

The use of the SCZV procedure is not limited to thegradient calculation. The calculation of the second derivative(Hessian) matrix is necessary to calculate important prop-erties such as the IR spectrum, Raman spectrum, and NMRchemical shifts. To calculate the fully analytic Hessian, onemust solve the CPHF equations, and the SCZV procedurewould play an essential role. The SCZV procedure could alsobe found useful in other fragmentation methods that requirethe response term (some methods87,96 do not need it withrespect to the field, although the Mulliken charge derivativesdo require it97,80).

The analytic gradient equations have been derived atthe Hartree–Fock level of theory. For proteins, dispersionis crucial for determining the folded structure58 and forprotein-ligand binding. Therefore, the fully analytic energygradient should be extended to electron correlation methodssuch as MP2. For practical calculations, such as FMO–MDto study protein folding, it will be necessary to develop thefully analytic gradient with the point charge (ESP–PTC)approximation.

As mentioned earlier, MD simulations are an importantapplication of FMO; however, FMO–MD has been limitedmainly to molecular clusters.70–72, 75–78 As found in this study,with the introduction of fully analytic energy gradients, re-liable MD simulation with perfect energy conservation willnow be possible.

ACKNOWLEDGMENTS

This work has been supported by the Next GenerationSuper Computing Project, Nanoscience Program (MEXT,Japan), and by a US National Science Foundation PetascaleApplications grant. K.B. is supported by a US Department ofEnergy Computational Science Graduate Fellowship.

1D. A. Pearlman, D. A. Case, J. W. Caldwell, W. S. Ross, T. E. Cheatham,S. Debolt, D. Ferguson, G. Seibel, and P. Kollman, Comput. Phys.Commun. 91, 1 (1995).

2P. Kollman, I. Massova, C. Reyes, B. Kuhn, S. H. Huo, L. Chong, M. Lee,T. Lee, Y. Duan, W. Wang, O. Donini, P. Cieplak, J. Srinivasan, D. A. Case,and T. E. Cheatham, Acc. Chem. Res. 33, 889 (2000).

3D. Bashford and D. A. Case, Annu. Rev. Phys. Chem. 51, 129 (2000).4J. Gao and D. Truhlar, Annu. Rev. Phys. Chem. 53, 467 (2002).5A. D. Mackerell, J. Comput. Chem. 25, 1584 (2004).6W. Wang, O. Donini, C. M. Reyes, and P. A. Kollman, Annu. Rev. Biophys.Biomol. Struct. 30, 211 (2001).

7A. Warshel, Annu. Rev. Biophys. Biomol. Struct. 32, 425 (2003).8R. Car and M. Parrinello, Phys. Rev. Lett. 55, 2471 (1985).9J. L. Gao, J. Phys. Chem. B 101, 657 (1997).

10J. L. Gao, J. Chem. Phys. 109, 2346 (1998).11H. B. Schlegel, J. M. Millam, S. S. Iyengar, G. A. Voth, A. D. Daniels, G.

E. Scuseria, and M. J. Frisch, J. Chem. Phys. 114, 9758 (2001).12W. Xie and J. Gao, J. Chem. Theory Comput. 3, 1890 (2007).13W. Xie, M. Orozco, D. G. Truhlar, and J. Gao, J. Chem. Theory Comput.

5, 459 (2009).14S. Y. Wu and C. S. Jayanthi, Phys. Rep. 358, 1 (2002).15S. Goedecker and G. E. Scuseria, Comput. Sci. Eng. 5, 14 (2003).

16D. Hankins, J. W. Moskowitz, and F. H. Stillinger, J. Chem. Phys. 53, 4544(1970).

17K. Morokuma, J. Chem. Phys. 55, 1236 (1971).18K. Ohno and H. Inokuchi, Theor. Chim. Acta 26, 331 (1972).19P. Otto and J. Ladik, Chem. Phys. 8, 192 (1975).20H. Stoll and H. Preuß, Theor. Chim. Acta 46, 11 (1977).21Z. Barandiaran and L. Seijo, J. Chem. Phys. 89, 5739 (1988).22H. Kubota, Y. Aoki, and A. Imamura, Bull. Chem. Soc. Jpn 67, 13

(1994).23H. R. Leverentz and D. G. Truhlar, J. Chem. Theory Comput. 5, 1573

(2009).24M. S. Gordon, J. M. Mullin, S. R. Pruitt, L. B. Roskop, L. V. Slipchenko,

and J. A. Boatz, J. Phys. Chem. B 113, 9646 (2009).25R. A. Mata, H. Stoll, and B. J. C. Cabral, J. Chem. Theory Comput. 5, 1829

(2009).26L. Huang, L. Massa, I. Karle, and J. Karle, Proc. Natl. Acad. Sci. U.S.A.

106, 3664 (2009).27Y. Tong, Y. Mei, J. Z. H. Zhang, L. L. Duan, and Q. G. Zhang, J. Theor.

Comput. Chem. 8, 1265 (2009).28P. Söderhjelm and U. Ryde, J. Phys. Chem. A 113, 617 (2009).29S. D. Yeole and S. Gadre, J. Chem. Phys. 132, 094102 (2010).30A. Pomogaeva, F. L. Gu, A. Imamura, and Y. Aoki, Theor. Chem. Acc. 125,

453 (2010).31T. Touma, M. Kobayashi, and H. Nakai, Chem. Phys. Lett. 485, 247

(2010).32X. He and K. M. Merz, J. Chem. Theory Comput. 6, 405 (2010).33K. Kitaura, E. Ikeo, T. Asada, T. Nakano, and M. Uebayasi, Chem. Phys.

Lett. 313, 701 (1999).34T. Nakano, T. Kaminuma, T. Sato, K. Fukuzawa, Y. Akiyama, M. Uebayasi,

and K. Kitaura, Chem. Phys. Lett. 351, 475 (2002).35D. G. Fedorov and K. Kitaura, J. Phys. Chem. A 111, 6904 (2007).36The Fragment Molecular Orbital Method: Practical Applications to Large

Molecular Systems, edited by D. G. Fedorov and K. Kitaura (CRC Press,Boca Raton, FL, 2009).

37D. G. Fedorov and K. Kitaura, Chem. Phys. Lett. 433, 182 (2006).38J. L. Pascual and L. Seijo, J. Chem. Phys. 102, 5368 (1995).39Q. Cui, M. Elstner, E. Kaxiras, T. Frauenheim, and M. Karplus, J. Phys.

Chem. B 105, 569 (2001).40D. G. Fedorov and K. Kitaura, J. Chem. Phys. 121, 2483 (2004).41Y. Mochizuki, K. Yamashita, K. Fukuzawa, K. Takematsu, H. Watanabe, N.

Taguchi, Y. Okiyama, M. Tsuboi, T. Nakano, and S. Tanaka, Chem. Phys.Lett. 493, 346 (2010).

42D. G. Fedorov and K. Kitaura, J. Chem. Phys. 123, 134103/1 (2005).43S.-I. Sugiki, N. Kurita, Y. Sengoku, and H. Sekino, Chem. Phys. Lett. 382,

611 (2003).44D. G. Fedorov and K. Kitaura, Chem. Phys. Lett. 389, 129 (2004).45D. G. Fedorov and K. Kitaura, J. Chem. Phys. 122, 054108/1 (2005).46Y. Mochizuki, S. Koikegami, S. Amari, K. Segawa, K. Kitaura, and T.

Nakano, Chem. Phys. Lett. 406, 283 (2005).47Y. Mochizuki, K. Tanaka, K. Yamashita, T. Ishikawa, T. Nakano, S. Amari,

K. Segawa, T. Murase, H. Tokiwa, and M. Sakurai, Theor. Chem. Acc. 117,541 (2007).

48M. Chiba, D. G. Fedorov, and K. Kitaura, Chem. Phys. Lett. 444, 346(2007).

49M. Chiba, D. G. Fedorov, and K. Kitaura, J. Chem. Phys. 127, 104108(2007).

50M. Chiba, D. G. Fedorov, and K. Kitaura, J. Comput. Chem. 29, 2667(2008).

51S. R. Pruitt, D. G. Fedorov, K. Kitaura, and M. S. Gordon, J. Chem. TheoryComput. 6, 1 (2010).

52B. Auer, M. V. Pak, and S. Hammes-Schiffer, J. Phys. Chem. C 114, 5582(2010).

53I. Nakanishi, D. G. Fedorov, and K. Kitaura, Proteins: Struct., Funct.,Bioinf. 68, 145 (2007).

54T. Sawada, T. Hashimoto, H. Tokiwa, T. Suzuki, H. Nakano, H. Ishida, M.Kiso, and Y. Suzuki, Glycoconjugate J. 25, 805 (2008).

55T. Watanabe, Y. Inadomi, K. Fukuzawa, T. Nakano, S. Tanaka, L. Nilsson,and U. Nagashima, J. Phys. Chem. B 111, 9621 (2007).

56T. Sawada, D. G. Fedorov, and K. Kitaura, Int. J. Quantum Chem. 109,2033 (2009).

57T. Ishida, D. G. Fedorov, and K. Kitaura, J. Phys. Chem. B 110, 1457(2006).

58X. He, L. Fusti-Molnar, G. Cui, and K. M. J. Merz, J. Phys. Chem. B 113,5290 (2009).


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14

http://dx.doi.org/10.1016/0010-4655(95)00041-D

http://dx.doi.org/10.1016/0010-4655(95)00041-D

http://dx.doi.org/10.1021/ar000033j

http://dx.doi.org/10.1146/annurev.physchem.51.1.129

http://dx.doi.org/10.1146/annurev.physchem.53.091301.150114

http://dx.doi.org/10.1002/jcc.20082

http://dx.doi.org/10.1146/annurev.biophys.30.1.211



http://dx.doi.org/10.1103/PhysRevLett.55.2471

http://dx.doi.org/10.1021/jp962833a

http://dx.doi.org/10.1063/1.476802

http://dx.doi.org/10.1063/1.1372182

http://dx.doi.org/10.1021/ct700167b

http://dx.doi.org/10.1021/ct800239q

http://dx.doi.org/10.1016/S0370-1573(01)00035-7

http://dx.doi.org/10.1109/MCISE.2003.1208637

http://dx.doi.org/10.1063/1.1673986

http://dx.doi.org/10.1063/1.1676210

http://dx.doi.org/10.1007/BF01036246

http://dx.doi.org/10.1016/0301-0104(75)80107-8

http://dx.doi.org/10.1007/BF02401407

http://dx.doi.org/10.1063/1.455549

http://dx.doi.org/10.1246/bcsj.67.13

http://dx.doi.org/10.1021/ct900095d

http://dx.doi.org/10.1021/jp811519x

http://dx.doi.org/10.1021/ct9001653

http://dx.doi.org/10.1073/pnas.0900403106

http://dx.doi.org/10.1142/S0219633609005313

http://dx.doi.org/10.1142/S0219633609005313

http://dx.doi.org/10.1021/jp8073514

http://dx.doi.org/10.1063/1.3324702

http://dx.doi.org/10.1007/s00214-009-0576-2

http://dx.doi.org/10.1016/j.cplett.2009.12.043

http://dx.doi.org/10.1021/ct9006635

http://dx.doi.org/10.1016/S0009-2614(99)00874-X

http://dx.doi.org/10.1016/S0009-2614(99)00874-X

http://dx.doi.org/10.1016/S0009-2614(01)01416-6



http://dx.doi.org/10.1063/1.469264



http://dx.doi.org/10.1063/1.1769362



http://dx.doi.org/10.1063/1.2007588



http://dx.doi.org/10.1063/1.1835954


http://dx.doi.org/10.1007/s00214-006-0181-6


http://dx.doi.org/10.1063/1.2772850




http://dx.doi.org/10.1021/jp907193g

http://dx.doi.org/10.1002/prot.21389

http://dx.doi.org/10.1002/prot.21389

http://dx.doi.org/10.1007/s10719-008-9141-9

http://dx.doi.org/10.1021/jp071710v

http://dx.doi.org/10.1002/qua.22051




59D. G. Fedorov, P. V. Avramov, J. H. Jensen, and K. Kitaura, Chem. Phys.Lett. 477, 169 (2009).

60N. Taguchi, Y. Mochizuki, T. Nakano, S. Amari, K. Fukuzawa, T. Ishikawa,M. Sakurai, and S. Tanaka, J. Phys. Chem. B 113, 1153 (2009).

61T. Ikegami, T. Ishida, D. G. Fedorov, K. Kitaura, Y. Inadomi, H. Umeda,M. Yokokawa, and S. Sekiguchi, J. Comput. Chem. 31, 447 (2010).

62K. A. Kistler and S. J. Matsika, J. Phys. Chem. A 113, 12396(2009).

63S. Amari, M. Aizawa, J. Zang, K. Fukuzawa, Y. Mochizuki, Y. Iwasawa,H. Nakata, K. Chuman, and T. Nakano, J. Chem. Inf. Comput. Sci. 46, 221(2006).

64T. Yoshida, T. Fujita, and H. Chuman, Current Computer - Aided DrugDesign 5, 38 (2009).

65T. Yoshida, Y. Munei, S. Hitaoka, and H. Chuman, J. Chem. Inf. Model.50, 850 (2010).

66Q. Gao, S. Yokojima, D. G. Fedorov, K. Kitaura, M. Sakurai, and S. Naka-mura, J. Chem. Theory Comput. 6, 1428 (2010).

67Y. Komeiji, T. Nakano, K. Fukuzawa, Y. Ueno, Y. Inadomi, T. Nemoto,M. Uebayasi, D. G. Fedorov, and K. Kitaura, Chem. Phys. Lett. 372, 342(2003).

68T. Ishimoto, H. Tokiwa, H. Teramae, and U. Nagashima, Chem. Phys. Lett.387, 460 (2004).

69T. Ishimoto, H. Tokiwa, H. Teramae, and U. Nagashima, J. Chem. Phys.122, 094905 (2005).

70Y. Mochizuki, Y. Komeiji, T. Ishikawa, T. Nakano, and H. Yamataka,Chem. Phys. Lett. 437, 66 (2007).

71M. Sato, H. Yamataka, Y. Komeiji, Y. Mochizuki, T. Ishikawa, and T.Nakano, J. Am. Chem. Soc. 130, 2396 (2008).

72Y. Komeiji, T. Ishikawa, Y. Mochizuki, H. Yamataka, and T. Nakano, J.Comput. Chem. 30, 40 (2009).

73Y. Komeiji, Y. Mochizuki, T. Nakano, and D. G. Fedorov , J. Mol. Struct.:THEOCHEM 898, 2 (2009).

74T. Fujita, H. Watanabe, and S. Tanaka, J. Phys. Soc. Jpn. 78, 104723(2009).

75T. Fujiwara, Y. Mochizuki, Y. Komeiji, Y. Okiyama, H. Mori, T. Nakano,and E. Miyoshi, Chem. Phys. Lett. 490, 41 (2010).

76T. Fujiwara, H. Mori, Y. Mochizuki, H. Tatewaki, and E. Miyoshi, J. Mol.Struct.: THEOCHEM 949, 28 (2010).

77M. Sato, H. Yamataka, Y. Komeiji, Y. Mochizuki, and T. Nakano,Chem.-Eur. J. 16, 6430 (2010).

78Y. Komeiji, Y. Mochizuki, and T. Nakano, Chem. Phys. Lett. 484, 380(2010).

79K. Kitaura, S. I. Sugiki, T. Nakano, Y. Komeiji, and M. Uebayasi, Chem.Phys. Lett. 336, 163 (2001).

80T. Nagata, D. G. Fedorov, and K. Kitaura, Chem. Phys. Lett. 475, 124(2009).

81M. W. Schmidt, K. K. Baldridge, J. A. Boatz, S. T. Elbert, M. S.Gordon, J. J. Jensen, S. Koseki, N. Matsunaga, K. A. Nguyen, S. Su,T. L. Windus, M. Dupuis, and J. A. Montgomery, J. Comput. Chem. 14,1347 (1993).

82M. S. Gordon and M. W. Schmidt, Theory and Applications of Computa-tional Chemistry: The First Forty Years (Elsevier, Amsterdam, 2005).

83D. G. Fedorov, T. Ishida, M. Uebayasi, and K. Kitaura, J. Phys. Chem. A111, 2722 (2007).

84T. Nagata, D. G. Fedorov, and K. Kitaura, Chem. Phys. Lett. 492, 302(2010).

85H. P. Hratchian, P. V. Parandekar, K. Raghavachari, M. J. Frisch, and T.Vreven, J. Chem. Phys. 128, 034107 (2008).

86N. J. Mayhall, K. Raghavachari, and H. P. Hratchian, J. Chem. Phys. 132,114107 (2010).

87W. Xie, L. Song, D. G. Truhlar, and J. Gao, J. Chem. Phys. 128, 234108(2008).

88Y. Yamaguchi, H. F. Schaefer III, Y. Osamura, and J. Goddard, A NewDimension to Quantum Chemistry: Analytical Derivative Methods inAb Initio Molecular Electronic Structure Theory (Oxford University Press,New York, 1994).

89D. G. Fedorov, R. M. Olson, K. Kitaura, M. S. Gordon, and S. Koseki, J.Comput. Chem. 25, 872 (2004).

90Y. Alexeev, M. W. Schmidt, T. L. Windus, and M. S. Gordon, J. Comput.Chem. 28, 1685 (2007).

91T. Nagata, D. G. Fedorov, K. Kitaura, and M. S. Gordon, J. Chem. Phys.131, 024101 (2009).

92D. G. Fedorov, K. Kitaura, H. Li, J. H. Jensen, and M. S. Gordon, J. Com-put. Chem. 27, 976 (2006).

93W. D. Cornell, P. Cieplak, C. I. Bayly, I. R. Gould, K. M. Merz, D. M.Ferguson, D. C. Spellmeyer, T. Fox, J. W. Caldwell, and P. A. Kollman, J.Am. Chem. Soc. 117, 5179 (1995).

94T. Nagata, D. G. Fedorov, T. Sawada, K. Kitaura, and M. S. Gordon, J.Chem. Phys. 134, 034110 (2011).

95H. Li, D. G. Fedorov, T. Nagata, K. Kitaura, J. H. Jensen, and M. S. Gordon,J. Comput. Chem. 31, 778 (2010).

96C. Steinmann, D. G. Fedorov, and J. H. Jensen, J. Phys. Chem. A 114, 8705(2010).

97K. Sakata, J. Phys. Chem. A 104, 10001 (2000).


129.186.176.217 On: Wed, 02 Dec 2015 16:21:14



http://dx.doi.org/10.1021/jp808151c


http://dx.doi.org/10.1021/jp901601u

http://dx.doi.org/10.1021/ci050262q

http://dx.doi.org/10.2174/157340909787580845

http://dx.doi.org/10.2174/157340909787580845

http://dx.doi.org/10.1021/ci100068w

http://dx.doi.org/10.1021/ct100006n

http://dx.doi.org/10.1016/S0009-2614(03)00430-5


http://dx.doi.org/10.1063/1.1857481


http://dx.doi.org/10.1021/ja710038c



http://dx.doi.org/10.1016/j.theochem.2008.07.001


http://dx.doi.org/10.1143/JPSJ.78.104723




http://dx.doi.org/10.1002/chem.201000442


http://dx.doi.org/10.1016/S0009-2614(01)00099-9

http://dx.doi.org/10.1016/S0009-2614(01)00099-9





http://dx.doi.org/10.1063/1.2814164

http://dx.doi.org/10.1063/1.3315417

http://dx.doi.org/10.1063/1.2936122





http://dx.doi.org/10.1063/1.3156313



http://dx.doi.org/10.1021/ja00124a002

http://dx.doi.org/10.1021/ja00124a002

http://dx.doi.org/10.1063/1.3517110

http://dx.doi.org/10.1063/1.3517110


http://dx.doi.org/10.1021/jp101498m