(I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I)...

56
Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions i. Ground-state wave function of Bosons Consider a one dimensional Schrödinger equation for the ground state, which may be taken real. We will ask: is this wave function of the same sign every- where? What we mean is: is it all non-negative or all non positive? Of course, if it is all non-positive then we can multiply it by -1 and ask again: is it all non- negative? Saying it is of a constant sign is the same as saying it has no nodes, i.e. it does not go through the x- axis anywhere. Exercise: Prove the following theorem: Theorem: The ground-state wave function of a 1D wave non-degenerate Schrödinger equation has no nodes. Proof. Consider the functional: [] = [] + [] = 2 () +∫ () () (I.b.1) The ground-state wave function is the minimize of this (I.b.1)functional over all normalized real wave functions. Note that [] = [||] and [] = [||]. Exercise (the kinetic energy functional appearing in Eq. (I.b.1)): Let [] = () () (the usual definition of kinetic energy) and [] =

Transcript of (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I)...

Page 1: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 1

copy Roi Baer

(I) Introductory background topics

a) The Schroumldinger equation

XXX

b) Bosons and Fermions

i Ground-state wave function of Bosons

Consider a one dimensional Schroumldinger equation for the ground state which

may be taken real We will ask is this wave function of the same sign every-

where What we mean is is it all non-negative or all non positive Of course

if it is all non-positive then we can multiply it by -1 and ask again is it all non-

negative Saying it is of a constant sign is the same as saying it has no nodes

ie it does not go through the x- axis anywhere

Exercise Prove the following theorem

Theorem The ground-state wave function of a 1D wave non-degenerate

Schroumldinger equation has no nodes

Proof Consider the functional

119864[120595] = 119879[120595] + 119881[120595] =ℏ

2119898int 120595 (119909) 119889119909

+ int 120595(119909) 119881(119909)119889119909

(Ib1)

The ground-state wave function is the minimize of this lrm(Ib1)functional over

all normalized real wave functions Note that 119879[120595] = 119879[|120595|] and 119881[120595] =

119881[|120595|]

Exercise (the kinetic energy functional appearing in Eq (Ib1)) Let 119879 [120595] =

minusℏ

int 120595(119909) 120595 (119909)119889119909

(the usual definition of kinetic energy) and 119879 [120595] =

Electron Density Functional Theory Page 2

copy Roi Baer

int |120595 (119909)| 119889119909

Show that

1) 119879 [120595] = 119879 [120595] for any smooth 120595 that decays to zero as |119909| rarr infin

2) 119879 [120595 ] = minus119879 [120595 ] for 120595 (119909) = radic120572119890 | | Which of the two values is

physically unacceptable as kinetic energy

3) What do you think will happen when the two definitions of kinetic en-

ergies are applied to the family of functions 120595 (119909) = radic120572119890 radic Us-

ing numerical integration determine what is the limit lim rarr 119879 [120595 ] Is

it 119879 [120595 ] or 119879 [120595 ]

Based on your findings explain why 119879 is the more robust way to define

kinetic energy in the presence of discontinuities

We now show that if 120595(119909) has a node then one can build out of it function

having lower energy From this we will be able to deduce that the minimizing

wave function cannot have nodes For simplicity assume the node of 120595(119909) ap-

pears at the origin 119909 = 0

Figure lrmI-1 The absolute value of the wave function and the parabola in it

X1 X2

g(x)

Electron Density Functional Theory Page 3

copy Roi Baer

We will augment slightly the function 120595 by some 120575120595 and show that this

leads to reduction in energy Note that the energy change in an arbitrary per-

turbation is

120575119864[|120595|] = 120575119879[|120595|] + 2int |120595(119909)|(119881(119909) minus 119864[|120595|])|120575120595(119909)|119889119909

Exercise Verify this and show we explicitly assumed that the norm of 120595 is 1

Our augmentation smoothes the node as shown in Figure lrmI-1 determining a

large enough parameter 119886 such that the parabola

119901(119909) =1

2119886(119909 minus 119909

) + 119888

has the properties that it is tangent at some 119909 and 119909 to |120595(119909)| (where

119909 lt 0 lt 119909 ) The new wave function is nodeless

120594(119909) = |120595(119909)| 119909 lt 119909 119900119903 119909 gt 119909

119901(119909) 119909 lt 119909 lt 119909

Note that by decreasing 119886 the parabola becomes narrower and the following

limits are obtained

minus119909 rarr 119909 rarr |120595 (0)|119886 rarr 0

119888 rarr1

2 |120595 (0)| 119886 rarr 0

Note that because the function has a node at 119909 = 0 the potential energy

changes only to second order with 119886 119881[120594] minus 119881[120595] prop 119886 The kinetic energy on

the other hand has a linear dependence 119879[120594] minus 119879[120595] asymp minusℏ

|120595 (0)| 119886 But

note that it is negative Thus as 119886 rarr 0 the total change in energy is dominated

by the change in the kinetic energy and is negative This shows that 119864[120594] lt

119864[|120595|] = 119864[120595] Hence the ground state cannot have nodes otherwise one can

remove them lowering the energy further

Electron Density Functional Theory Page 4

copy Roi Baer

This proof can be extended to any number of dimensions It can be used to

prove that the ground state of a many-boson wave function has no nodes

Exercise Prove the corollary the ground-state of boson systems is non-

degenerate

ii Ground-state wave function of Fermions

The ground-state wave function of 119873 gt 1 Fermions is a much more compli-

cated object than that of 119873 gt 1 Bosons This is because of the Pauli principle

which states 120595(119909 119909 ) = minus120595(119909 119909 ) ie 120595 is bound to have nodes in general

Thus the Fermionic ground-state is actually a highly excited eigenstate of the

Hamiltonian It can still be described by a minimum principle namely the

normalized many-body antisymmetric wave-function which minimizes

119864[120595] = ⟨120595||120595⟩ This formulation of the ground state problem is very conven-

ient We do not have to look for many eigenstates of we need only search for

minimum Finding the minimal energy and the expectation values of various

physically important operators is the main challenge of electronic structure

theory Doing the same for low-lying excited states is the grand challenge of

this field

c) Why electronic structure is an important dif-

ficult problem

In this course we will study methods to treat electronic structure This prob-

lem is often considerably more complicated than nuclear dynamics of mole-

cules The most obvious reason is that usually there are many more electrons

than nuclei in a molecule However there are more reasons electrons in mol-

ecules have extreme quantum mechanical character while nuclei are more

classical This last statement requires probably some explanation We will

Electron Density Functional Theory Page 5

copy Roi Baer

discuss in this course ways of mapping the electrons in a molecule onto a sys-

tem of non-interacting particles In such a picture each electron has its own

orbital or wave-function Generally the orbitals of different electrons in mole-

cules strongly overlap Furthermore most of the valence orbitals are widely

spread over space encapsulating many molecular nuclei Nuclei on the other

hand are massive so their wave functions are strongly localized and hardly

overlap once again they behave much more like classical particles

Describing electronic structure is therefore a quantum mechanical problem

Now the electrons interact via the attractive Coulomb force not only with

the stationary nuclei but also with each other each electron repels each other

electron via the repulsive Coulomb force This makes the electronic structure

problem a many-body problem

Quantum many-body problems are very difficult to track down much more

difficult than classical mechanical ones The reason is that for 119873 electrons the

many-body wave function is then a function of 3119873 variables 120595(119955 hellip 119955 )

where ir (i=1hellip eN ) is 3 dimensional the position vector Highly accurate nu-

merical representation of such functions is next to impossible for 119873 gt 1

Thus electronic structure methods invariably include approximate methods

Typically the wave function encapsulates much more information than we

care to know about For example int|120595(119955 hellip 119955 )| 119889 119903 sdotsdotsdot 119889 119903 gives the proba-

bility for finding an electron at 119955 and an electron at 119955 and an electron at 119955

etc Indeed the total probability is 1

int|120595(119955 hellip 119955 )|

119889 119903 hellip119889 119903

= 1 (Ic1)

Electron Density Functional Theory Page 6

copy Roi Baer

d) The Born-Oppenheimer theory

iii The adiabatic theorem

Suppose a Hamiltonian is dependent parametrically on 119877 = (119877 119877 hellip ) We

write this as [119877] Denote the eigenstates of this Hamiltonian by

[119877]120595 [119877] = 119864 [119877]120595 [119877]

Now suppose we change 119877 with time so that we have a trajectory 119877(119905) The

Hamiltonian will become time dependent (119905) = [119877(119905)] Suppose the sys-

tem is placed in its ground state at time = 0 120601(119905 = 0) = 120595 [119877(0)] It will

evolve according to the TDSE

119894ℏ(119905) = (119905)120601(119905)

Now the adiabatic theorem says that if 119877(119905) changes very slowly then

120601(119905) = 119890 ( )120595 [119877(119905)]

Namely except for a purely TD phase the evolving state stays the instanta-

neous ground state of the Hamiltonian (119905)

To show this we use the instantaneous eigenstates of the Hamiltonian (119905) to

expand 120601(119905)

120601(119905) = sum119886 (119905)119890

ℏint ( ) 120595 [119877(119905)]

We want to plug this expansion into the SE So let us calculate

119894ℏ(119905) = sum[119894ℏ + 119864 119886 ]119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

And

Electron Density Functional Theory Page 7

copy Roi Baer

(119905)120601 = sum119886 119890

ℏint ( ) 119864 120595 [119877(119905)]

Equating the two expressions gives

0 = sum 119894ℏ 119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

Now let us multiply by 120595 and integrate (remember that ⟨120595 |120595 ⟩ = 120575 )

0 = 119894ℏ 119890

ℏint ( ) + sum119886 119890

ℏint ( ) 119894ℏ⟨120595 | ⟩

Now we have

0 =119889

119889119905⟨120595 |120595 ⟩ = ⟨120595 | ⟩ + ⟨ |120595 ⟩

And since ⟨ |120595 ⟩ = ⟨120595 | ⟩ we find

120591 = ⟨120595 | ⟩ = minus⟨120595 | ⟩ = minus120591

The matrix 120591 = ⟨120595 | ⟩ is the matrix of ldquotime-dependent non-adiabatic

couplings (TDNACs)rdquo and it is an anti-Hermitean matrix Furthermore for

119899 = 119898 we see

119877119890⟨120595 | ⟩ = 0

Thus

= minus120591 119886 minus sum 119890 int ( ) 120591 119886

(Id1)

We see that the TDNACS 120591 create the non-adiabatic transitions ie the

transitions out of the ground state

Let us play a bit with 120591 Take the time derivative of the TISE

Electron Density Functional Theory Page 8

copy Roi Baer

(119905)120595 (119905) + (119905) (119905) = (119905)120595 (119905) + 119864 (119905) (119905)

Multiply by ⟨120595 | and get

⟨120595 || 120595 ⟩ + ⟨120595 || ⟩ = (119905)⟨120595 |120595 ⟩ + 119864 ⟨120595 | ⟩

Since ⟨120595 || ⟩ = 119864 ⟨120595 | ⟩ we have

⟨120595 || 120595 ⟩ = (119905)⟨120595 |120595 ⟩ + (119864 minus 119864 )⟨120595 | ⟩

If 119899 = 119898 then

(119905) = ⟨120595 || 120595 ⟩

This is called the Hellmann-Feynman theorem showing that the power is the

expectation value of the rate of change of the Hamiltonian If 119899 ne 119898

120591 = ⟨120595 | ⟩ =⟨120595 || 120595 ⟩

119864 minus 119864 (Id2)

This is called Epsteinrsquos theorem giving an expression for the TDNACs Re-

member that [119877(119905)] depends on 119905 through the positions of the nuclei Thus

[119877(119905)] =119889

119889119905[119877(119905)] =

part

partR [119877(119905)]

Thus

120591 = sum⟨120595 |

[119877(119905)]| 120595 ⟩

119864 [119877] minus 119864 [119877] (119905)

= sum120591 (119905)

Where the time-independent NACS are defined as

120591 [119877] = ⟨120595 |

120597

120597119877 120595 ⟩ =

⟨120595 [119877] |

[119877]| 120595 [119877]⟩

119864 [119877] minus 119864 [119877] (Id3)

Electron Density Functional Theory Page 9

copy Roi Baer

We see that the TDNACs depend on the velocity of the nuclei When nuclei

are small the couplings out of the ground state are small The 120591 are called

the ldquonon-adiabatic couplings (NACs)rdquo We find that as long as 119864 minus 119864 are not

zero on the trajectory one can always find small enough that makes the

TDNACs as small as we wish All we need for states 119899 and 119898 to stay decou-

pled is for the following conditions to be met

|120591 | = | (119905)120591 (119877 (119905))| ≪ 1

Thus we can make the NACs as small as we wish by taking the rate (119905) to

be small enough So all that is left in Eq lrm(Id1) is = minus120591 119886 Define the

non-dynamical phase as 120579 (119905) = 119894 int 120591 (119905 ) 119889119905

(note 120579 is real) and then

119886 (119905) = 119890 ( ) (119904119897119900119908 119901119903119900119888119890119904119904119890119904 = 119886119889119894119886119887119886119905119894119888 119901119903119900119888119890119904119904)

The total state is then

120601(119905) = 119886 (119905)119890

ℏint ( ) 120595 [119877(119905)] = 119890 ( ( ) ( ))120595 [119877(119905)]

where 120579 (119905) = minus

ℏint 119864 (120591)119889120591

is called the dynamical phase It is easy to prove

that if 119877(119905) traverses a closed loop the non-dynamical phase depends only on

that loop and not on the way it is traversed For example if we traverse the

same loop using different velocities the dynamic phase may change but the

geometric phase will not The closed loop geometric angle is called the Berry

phase The independence of path is a result of the fact that the non-dynamical

phase is a line integral and can be made with no reference to time

Electron Density Functional Theory Page 10

copy Roi Baer

120579 (119905) = 119894 int ⟨120595 (119877[120591])| (119877[120591])⟩119889120591

= 119894 int sum⟨120595 [119877]|120597

120597119877 120595 [119877]⟩

119889120591

= 119894 int sum120591 (119877)

119889119877

( )

( )

The dynamical phase for example is not a line integral and its value depends

not only on the path itself but also on the velocity taken along the way This

observation makes the non-dynamical phase a special quantity Berry has

shown that this quantity can give us information on the way the Hamiltonian

is dependent on its parameters For a real Hamiltonian for example 119890

around a closed path always equals either 1 or -1 If there is an even number

of degeneracies enclosed by the path it is 1 and if an odd number ndash it is -1

iv Motivation for the Born-Oppenheimer approximation classical nuclei

In a molecule we can think of the nuclei (having coordinates 119877) as heavy and

therefore classical particles which are slowly moving The electrons are light

and therefore quantum They have coordinates 119903 (119903 includes spin) and they

119877

119877

A path in parameter

ldquospacerdquo 119877 minus 119877

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 2: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 2

copy Roi Baer

int |120595 (119909)| 119889119909

Show that

1) 119879 [120595] = 119879 [120595] for any smooth 120595 that decays to zero as |119909| rarr infin

2) 119879 [120595 ] = minus119879 [120595 ] for 120595 (119909) = radic120572119890 | | Which of the two values is

physically unacceptable as kinetic energy

3) What do you think will happen when the two definitions of kinetic en-

ergies are applied to the family of functions 120595 (119909) = radic120572119890 radic Us-

ing numerical integration determine what is the limit lim rarr 119879 [120595 ] Is

it 119879 [120595 ] or 119879 [120595 ]

Based on your findings explain why 119879 is the more robust way to define

kinetic energy in the presence of discontinuities

We now show that if 120595(119909) has a node then one can build out of it function

having lower energy From this we will be able to deduce that the minimizing

wave function cannot have nodes For simplicity assume the node of 120595(119909) ap-

pears at the origin 119909 = 0

Figure lrmI-1 The absolute value of the wave function and the parabola in it

X1 X2

g(x)

Electron Density Functional Theory Page 3

copy Roi Baer

We will augment slightly the function 120595 by some 120575120595 and show that this

leads to reduction in energy Note that the energy change in an arbitrary per-

turbation is

120575119864[|120595|] = 120575119879[|120595|] + 2int |120595(119909)|(119881(119909) minus 119864[|120595|])|120575120595(119909)|119889119909

Exercise Verify this and show we explicitly assumed that the norm of 120595 is 1

Our augmentation smoothes the node as shown in Figure lrmI-1 determining a

large enough parameter 119886 such that the parabola

119901(119909) =1

2119886(119909 minus 119909

) + 119888

has the properties that it is tangent at some 119909 and 119909 to |120595(119909)| (where

119909 lt 0 lt 119909 ) The new wave function is nodeless

120594(119909) = |120595(119909)| 119909 lt 119909 119900119903 119909 gt 119909

119901(119909) 119909 lt 119909 lt 119909

Note that by decreasing 119886 the parabola becomes narrower and the following

limits are obtained

minus119909 rarr 119909 rarr |120595 (0)|119886 rarr 0

119888 rarr1

2 |120595 (0)| 119886 rarr 0

Note that because the function has a node at 119909 = 0 the potential energy

changes only to second order with 119886 119881[120594] minus 119881[120595] prop 119886 The kinetic energy on

the other hand has a linear dependence 119879[120594] minus 119879[120595] asymp minusℏ

|120595 (0)| 119886 But

note that it is negative Thus as 119886 rarr 0 the total change in energy is dominated

by the change in the kinetic energy and is negative This shows that 119864[120594] lt

119864[|120595|] = 119864[120595] Hence the ground state cannot have nodes otherwise one can

remove them lowering the energy further

Electron Density Functional Theory Page 4

copy Roi Baer

This proof can be extended to any number of dimensions It can be used to

prove that the ground state of a many-boson wave function has no nodes

Exercise Prove the corollary the ground-state of boson systems is non-

degenerate

ii Ground-state wave function of Fermions

The ground-state wave function of 119873 gt 1 Fermions is a much more compli-

cated object than that of 119873 gt 1 Bosons This is because of the Pauli principle

which states 120595(119909 119909 ) = minus120595(119909 119909 ) ie 120595 is bound to have nodes in general

Thus the Fermionic ground-state is actually a highly excited eigenstate of the

Hamiltonian It can still be described by a minimum principle namely the

normalized many-body antisymmetric wave-function which minimizes

119864[120595] = ⟨120595||120595⟩ This formulation of the ground state problem is very conven-

ient We do not have to look for many eigenstates of we need only search for

minimum Finding the minimal energy and the expectation values of various

physically important operators is the main challenge of electronic structure

theory Doing the same for low-lying excited states is the grand challenge of

this field

c) Why electronic structure is an important dif-

ficult problem

In this course we will study methods to treat electronic structure This prob-

lem is often considerably more complicated than nuclear dynamics of mole-

cules The most obvious reason is that usually there are many more electrons

than nuclei in a molecule However there are more reasons electrons in mol-

ecules have extreme quantum mechanical character while nuclei are more

classical This last statement requires probably some explanation We will

Electron Density Functional Theory Page 5

copy Roi Baer

discuss in this course ways of mapping the electrons in a molecule onto a sys-

tem of non-interacting particles In such a picture each electron has its own

orbital or wave-function Generally the orbitals of different electrons in mole-

cules strongly overlap Furthermore most of the valence orbitals are widely

spread over space encapsulating many molecular nuclei Nuclei on the other

hand are massive so their wave functions are strongly localized and hardly

overlap once again they behave much more like classical particles

Describing electronic structure is therefore a quantum mechanical problem

Now the electrons interact via the attractive Coulomb force not only with

the stationary nuclei but also with each other each electron repels each other

electron via the repulsive Coulomb force This makes the electronic structure

problem a many-body problem

Quantum many-body problems are very difficult to track down much more

difficult than classical mechanical ones The reason is that for 119873 electrons the

many-body wave function is then a function of 3119873 variables 120595(119955 hellip 119955 )

where ir (i=1hellip eN ) is 3 dimensional the position vector Highly accurate nu-

merical representation of such functions is next to impossible for 119873 gt 1

Thus electronic structure methods invariably include approximate methods

Typically the wave function encapsulates much more information than we

care to know about For example int|120595(119955 hellip 119955 )| 119889 119903 sdotsdotsdot 119889 119903 gives the proba-

bility for finding an electron at 119955 and an electron at 119955 and an electron at 119955

etc Indeed the total probability is 1

int|120595(119955 hellip 119955 )|

119889 119903 hellip119889 119903

= 1 (Ic1)

Electron Density Functional Theory Page 6

copy Roi Baer

d) The Born-Oppenheimer theory

iii The adiabatic theorem

Suppose a Hamiltonian is dependent parametrically on 119877 = (119877 119877 hellip ) We

write this as [119877] Denote the eigenstates of this Hamiltonian by

[119877]120595 [119877] = 119864 [119877]120595 [119877]

Now suppose we change 119877 with time so that we have a trajectory 119877(119905) The

Hamiltonian will become time dependent (119905) = [119877(119905)] Suppose the sys-

tem is placed in its ground state at time = 0 120601(119905 = 0) = 120595 [119877(0)] It will

evolve according to the TDSE

119894ℏ(119905) = (119905)120601(119905)

Now the adiabatic theorem says that if 119877(119905) changes very slowly then

120601(119905) = 119890 ( )120595 [119877(119905)]

Namely except for a purely TD phase the evolving state stays the instanta-

neous ground state of the Hamiltonian (119905)

To show this we use the instantaneous eigenstates of the Hamiltonian (119905) to

expand 120601(119905)

120601(119905) = sum119886 (119905)119890

ℏint ( ) 120595 [119877(119905)]

We want to plug this expansion into the SE So let us calculate

119894ℏ(119905) = sum[119894ℏ + 119864 119886 ]119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

And

Electron Density Functional Theory Page 7

copy Roi Baer

(119905)120601 = sum119886 119890

ℏint ( ) 119864 120595 [119877(119905)]

Equating the two expressions gives

0 = sum 119894ℏ 119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

Now let us multiply by 120595 and integrate (remember that ⟨120595 |120595 ⟩ = 120575 )

0 = 119894ℏ 119890

ℏint ( ) + sum119886 119890

ℏint ( ) 119894ℏ⟨120595 | ⟩

Now we have

0 =119889

119889119905⟨120595 |120595 ⟩ = ⟨120595 | ⟩ + ⟨ |120595 ⟩

And since ⟨ |120595 ⟩ = ⟨120595 | ⟩ we find

120591 = ⟨120595 | ⟩ = minus⟨120595 | ⟩ = minus120591

The matrix 120591 = ⟨120595 | ⟩ is the matrix of ldquotime-dependent non-adiabatic

couplings (TDNACs)rdquo and it is an anti-Hermitean matrix Furthermore for

119899 = 119898 we see

119877119890⟨120595 | ⟩ = 0

Thus

= minus120591 119886 minus sum 119890 int ( ) 120591 119886

(Id1)

We see that the TDNACS 120591 create the non-adiabatic transitions ie the

transitions out of the ground state

Let us play a bit with 120591 Take the time derivative of the TISE

Electron Density Functional Theory Page 8

copy Roi Baer

(119905)120595 (119905) + (119905) (119905) = (119905)120595 (119905) + 119864 (119905) (119905)

Multiply by ⟨120595 | and get

⟨120595 || 120595 ⟩ + ⟨120595 || ⟩ = (119905)⟨120595 |120595 ⟩ + 119864 ⟨120595 | ⟩

Since ⟨120595 || ⟩ = 119864 ⟨120595 | ⟩ we have

⟨120595 || 120595 ⟩ = (119905)⟨120595 |120595 ⟩ + (119864 minus 119864 )⟨120595 | ⟩

If 119899 = 119898 then

(119905) = ⟨120595 || 120595 ⟩

This is called the Hellmann-Feynman theorem showing that the power is the

expectation value of the rate of change of the Hamiltonian If 119899 ne 119898

120591 = ⟨120595 | ⟩ =⟨120595 || 120595 ⟩

119864 minus 119864 (Id2)

This is called Epsteinrsquos theorem giving an expression for the TDNACs Re-

member that [119877(119905)] depends on 119905 through the positions of the nuclei Thus

[119877(119905)] =119889

119889119905[119877(119905)] =

part

partR [119877(119905)]

Thus

120591 = sum⟨120595 |

[119877(119905)]| 120595 ⟩

119864 [119877] minus 119864 [119877] (119905)

= sum120591 (119905)

Where the time-independent NACS are defined as

120591 [119877] = ⟨120595 |

120597

120597119877 120595 ⟩ =

⟨120595 [119877] |

[119877]| 120595 [119877]⟩

119864 [119877] minus 119864 [119877] (Id3)

Electron Density Functional Theory Page 9

copy Roi Baer

We see that the TDNACs depend on the velocity of the nuclei When nuclei

are small the couplings out of the ground state are small The 120591 are called

the ldquonon-adiabatic couplings (NACs)rdquo We find that as long as 119864 minus 119864 are not

zero on the trajectory one can always find small enough that makes the

TDNACs as small as we wish All we need for states 119899 and 119898 to stay decou-

pled is for the following conditions to be met

|120591 | = | (119905)120591 (119877 (119905))| ≪ 1

Thus we can make the NACs as small as we wish by taking the rate (119905) to

be small enough So all that is left in Eq lrm(Id1) is = minus120591 119886 Define the

non-dynamical phase as 120579 (119905) = 119894 int 120591 (119905 ) 119889119905

(note 120579 is real) and then

119886 (119905) = 119890 ( ) (119904119897119900119908 119901119903119900119888119890119904119904119890119904 = 119886119889119894119886119887119886119905119894119888 119901119903119900119888119890119904119904)

The total state is then

120601(119905) = 119886 (119905)119890

ℏint ( ) 120595 [119877(119905)] = 119890 ( ( ) ( ))120595 [119877(119905)]

where 120579 (119905) = minus

ℏint 119864 (120591)119889120591

is called the dynamical phase It is easy to prove

that if 119877(119905) traverses a closed loop the non-dynamical phase depends only on

that loop and not on the way it is traversed For example if we traverse the

same loop using different velocities the dynamic phase may change but the

geometric phase will not The closed loop geometric angle is called the Berry

phase The independence of path is a result of the fact that the non-dynamical

phase is a line integral and can be made with no reference to time

Electron Density Functional Theory Page 10

copy Roi Baer

120579 (119905) = 119894 int ⟨120595 (119877[120591])| (119877[120591])⟩119889120591

= 119894 int sum⟨120595 [119877]|120597

120597119877 120595 [119877]⟩

119889120591

= 119894 int sum120591 (119877)

119889119877

( )

( )

The dynamical phase for example is not a line integral and its value depends

not only on the path itself but also on the velocity taken along the way This

observation makes the non-dynamical phase a special quantity Berry has

shown that this quantity can give us information on the way the Hamiltonian

is dependent on its parameters For a real Hamiltonian for example 119890

around a closed path always equals either 1 or -1 If there is an even number

of degeneracies enclosed by the path it is 1 and if an odd number ndash it is -1

iv Motivation for the Born-Oppenheimer approximation classical nuclei

In a molecule we can think of the nuclei (having coordinates 119877) as heavy and

therefore classical particles which are slowly moving The electrons are light

and therefore quantum They have coordinates 119903 (119903 includes spin) and they

119877

119877

A path in parameter

ldquospacerdquo 119877 minus 119877

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 3: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 3

copy Roi Baer

We will augment slightly the function 120595 by some 120575120595 and show that this

leads to reduction in energy Note that the energy change in an arbitrary per-

turbation is

120575119864[|120595|] = 120575119879[|120595|] + 2int |120595(119909)|(119881(119909) minus 119864[|120595|])|120575120595(119909)|119889119909

Exercise Verify this and show we explicitly assumed that the norm of 120595 is 1

Our augmentation smoothes the node as shown in Figure lrmI-1 determining a

large enough parameter 119886 such that the parabola

119901(119909) =1

2119886(119909 minus 119909

) + 119888

has the properties that it is tangent at some 119909 and 119909 to |120595(119909)| (where

119909 lt 0 lt 119909 ) The new wave function is nodeless

120594(119909) = |120595(119909)| 119909 lt 119909 119900119903 119909 gt 119909

119901(119909) 119909 lt 119909 lt 119909

Note that by decreasing 119886 the parabola becomes narrower and the following

limits are obtained

minus119909 rarr 119909 rarr |120595 (0)|119886 rarr 0

119888 rarr1

2 |120595 (0)| 119886 rarr 0

Note that because the function has a node at 119909 = 0 the potential energy

changes only to second order with 119886 119881[120594] minus 119881[120595] prop 119886 The kinetic energy on

the other hand has a linear dependence 119879[120594] minus 119879[120595] asymp minusℏ

|120595 (0)| 119886 But

note that it is negative Thus as 119886 rarr 0 the total change in energy is dominated

by the change in the kinetic energy and is negative This shows that 119864[120594] lt

119864[|120595|] = 119864[120595] Hence the ground state cannot have nodes otherwise one can

remove them lowering the energy further

Electron Density Functional Theory Page 4

copy Roi Baer

This proof can be extended to any number of dimensions It can be used to

prove that the ground state of a many-boson wave function has no nodes

Exercise Prove the corollary the ground-state of boson systems is non-

degenerate

ii Ground-state wave function of Fermions

The ground-state wave function of 119873 gt 1 Fermions is a much more compli-

cated object than that of 119873 gt 1 Bosons This is because of the Pauli principle

which states 120595(119909 119909 ) = minus120595(119909 119909 ) ie 120595 is bound to have nodes in general

Thus the Fermionic ground-state is actually a highly excited eigenstate of the

Hamiltonian It can still be described by a minimum principle namely the

normalized many-body antisymmetric wave-function which minimizes

119864[120595] = ⟨120595||120595⟩ This formulation of the ground state problem is very conven-

ient We do not have to look for many eigenstates of we need only search for

minimum Finding the minimal energy and the expectation values of various

physically important operators is the main challenge of electronic structure

theory Doing the same for low-lying excited states is the grand challenge of

this field

c) Why electronic structure is an important dif-

ficult problem

In this course we will study methods to treat electronic structure This prob-

lem is often considerably more complicated than nuclear dynamics of mole-

cules The most obvious reason is that usually there are many more electrons

than nuclei in a molecule However there are more reasons electrons in mol-

ecules have extreme quantum mechanical character while nuclei are more

classical This last statement requires probably some explanation We will

Electron Density Functional Theory Page 5

copy Roi Baer

discuss in this course ways of mapping the electrons in a molecule onto a sys-

tem of non-interacting particles In such a picture each electron has its own

orbital or wave-function Generally the orbitals of different electrons in mole-

cules strongly overlap Furthermore most of the valence orbitals are widely

spread over space encapsulating many molecular nuclei Nuclei on the other

hand are massive so their wave functions are strongly localized and hardly

overlap once again they behave much more like classical particles

Describing electronic structure is therefore a quantum mechanical problem

Now the electrons interact via the attractive Coulomb force not only with

the stationary nuclei but also with each other each electron repels each other

electron via the repulsive Coulomb force This makes the electronic structure

problem a many-body problem

Quantum many-body problems are very difficult to track down much more

difficult than classical mechanical ones The reason is that for 119873 electrons the

many-body wave function is then a function of 3119873 variables 120595(119955 hellip 119955 )

where ir (i=1hellip eN ) is 3 dimensional the position vector Highly accurate nu-

merical representation of such functions is next to impossible for 119873 gt 1

Thus electronic structure methods invariably include approximate methods

Typically the wave function encapsulates much more information than we

care to know about For example int|120595(119955 hellip 119955 )| 119889 119903 sdotsdotsdot 119889 119903 gives the proba-

bility for finding an electron at 119955 and an electron at 119955 and an electron at 119955

etc Indeed the total probability is 1

int|120595(119955 hellip 119955 )|

119889 119903 hellip119889 119903

= 1 (Ic1)

Electron Density Functional Theory Page 6

copy Roi Baer

d) The Born-Oppenheimer theory

iii The adiabatic theorem

Suppose a Hamiltonian is dependent parametrically on 119877 = (119877 119877 hellip ) We

write this as [119877] Denote the eigenstates of this Hamiltonian by

[119877]120595 [119877] = 119864 [119877]120595 [119877]

Now suppose we change 119877 with time so that we have a trajectory 119877(119905) The

Hamiltonian will become time dependent (119905) = [119877(119905)] Suppose the sys-

tem is placed in its ground state at time = 0 120601(119905 = 0) = 120595 [119877(0)] It will

evolve according to the TDSE

119894ℏ(119905) = (119905)120601(119905)

Now the adiabatic theorem says that if 119877(119905) changes very slowly then

120601(119905) = 119890 ( )120595 [119877(119905)]

Namely except for a purely TD phase the evolving state stays the instanta-

neous ground state of the Hamiltonian (119905)

To show this we use the instantaneous eigenstates of the Hamiltonian (119905) to

expand 120601(119905)

120601(119905) = sum119886 (119905)119890

ℏint ( ) 120595 [119877(119905)]

We want to plug this expansion into the SE So let us calculate

119894ℏ(119905) = sum[119894ℏ + 119864 119886 ]119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

And

Electron Density Functional Theory Page 7

copy Roi Baer

(119905)120601 = sum119886 119890

ℏint ( ) 119864 120595 [119877(119905)]

Equating the two expressions gives

0 = sum 119894ℏ 119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

Now let us multiply by 120595 and integrate (remember that ⟨120595 |120595 ⟩ = 120575 )

0 = 119894ℏ 119890

ℏint ( ) + sum119886 119890

ℏint ( ) 119894ℏ⟨120595 | ⟩

Now we have

0 =119889

119889119905⟨120595 |120595 ⟩ = ⟨120595 | ⟩ + ⟨ |120595 ⟩

And since ⟨ |120595 ⟩ = ⟨120595 | ⟩ we find

120591 = ⟨120595 | ⟩ = minus⟨120595 | ⟩ = minus120591

The matrix 120591 = ⟨120595 | ⟩ is the matrix of ldquotime-dependent non-adiabatic

couplings (TDNACs)rdquo and it is an anti-Hermitean matrix Furthermore for

119899 = 119898 we see

119877119890⟨120595 | ⟩ = 0

Thus

= minus120591 119886 minus sum 119890 int ( ) 120591 119886

(Id1)

We see that the TDNACS 120591 create the non-adiabatic transitions ie the

transitions out of the ground state

Let us play a bit with 120591 Take the time derivative of the TISE

Electron Density Functional Theory Page 8

copy Roi Baer

(119905)120595 (119905) + (119905) (119905) = (119905)120595 (119905) + 119864 (119905) (119905)

Multiply by ⟨120595 | and get

⟨120595 || 120595 ⟩ + ⟨120595 || ⟩ = (119905)⟨120595 |120595 ⟩ + 119864 ⟨120595 | ⟩

Since ⟨120595 || ⟩ = 119864 ⟨120595 | ⟩ we have

⟨120595 || 120595 ⟩ = (119905)⟨120595 |120595 ⟩ + (119864 minus 119864 )⟨120595 | ⟩

If 119899 = 119898 then

(119905) = ⟨120595 || 120595 ⟩

This is called the Hellmann-Feynman theorem showing that the power is the

expectation value of the rate of change of the Hamiltonian If 119899 ne 119898

120591 = ⟨120595 | ⟩ =⟨120595 || 120595 ⟩

119864 minus 119864 (Id2)

This is called Epsteinrsquos theorem giving an expression for the TDNACs Re-

member that [119877(119905)] depends on 119905 through the positions of the nuclei Thus

[119877(119905)] =119889

119889119905[119877(119905)] =

part

partR [119877(119905)]

Thus

120591 = sum⟨120595 |

[119877(119905)]| 120595 ⟩

119864 [119877] minus 119864 [119877] (119905)

= sum120591 (119905)

Where the time-independent NACS are defined as

120591 [119877] = ⟨120595 |

120597

120597119877 120595 ⟩ =

⟨120595 [119877] |

[119877]| 120595 [119877]⟩

119864 [119877] minus 119864 [119877] (Id3)

Electron Density Functional Theory Page 9

copy Roi Baer

We see that the TDNACs depend on the velocity of the nuclei When nuclei

are small the couplings out of the ground state are small The 120591 are called

the ldquonon-adiabatic couplings (NACs)rdquo We find that as long as 119864 minus 119864 are not

zero on the trajectory one can always find small enough that makes the

TDNACs as small as we wish All we need for states 119899 and 119898 to stay decou-

pled is for the following conditions to be met

|120591 | = | (119905)120591 (119877 (119905))| ≪ 1

Thus we can make the NACs as small as we wish by taking the rate (119905) to

be small enough So all that is left in Eq lrm(Id1) is = minus120591 119886 Define the

non-dynamical phase as 120579 (119905) = 119894 int 120591 (119905 ) 119889119905

(note 120579 is real) and then

119886 (119905) = 119890 ( ) (119904119897119900119908 119901119903119900119888119890119904119904119890119904 = 119886119889119894119886119887119886119905119894119888 119901119903119900119888119890119904119904)

The total state is then

120601(119905) = 119886 (119905)119890

ℏint ( ) 120595 [119877(119905)] = 119890 ( ( ) ( ))120595 [119877(119905)]

where 120579 (119905) = minus

ℏint 119864 (120591)119889120591

is called the dynamical phase It is easy to prove

that if 119877(119905) traverses a closed loop the non-dynamical phase depends only on

that loop and not on the way it is traversed For example if we traverse the

same loop using different velocities the dynamic phase may change but the

geometric phase will not The closed loop geometric angle is called the Berry

phase The independence of path is a result of the fact that the non-dynamical

phase is a line integral and can be made with no reference to time

Electron Density Functional Theory Page 10

copy Roi Baer

120579 (119905) = 119894 int ⟨120595 (119877[120591])| (119877[120591])⟩119889120591

= 119894 int sum⟨120595 [119877]|120597

120597119877 120595 [119877]⟩

119889120591

= 119894 int sum120591 (119877)

119889119877

( )

( )

The dynamical phase for example is not a line integral and its value depends

not only on the path itself but also on the velocity taken along the way This

observation makes the non-dynamical phase a special quantity Berry has

shown that this quantity can give us information on the way the Hamiltonian

is dependent on its parameters For a real Hamiltonian for example 119890

around a closed path always equals either 1 or -1 If there is an even number

of degeneracies enclosed by the path it is 1 and if an odd number ndash it is -1

iv Motivation for the Born-Oppenheimer approximation classical nuclei

In a molecule we can think of the nuclei (having coordinates 119877) as heavy and

therefore classical particles which are slowly moving The electrons are light

and therefore quantum They have coordinates 119903 (119903 includes spin) and they

119877

119877

A path in parameter

ldquospacerdquo 119877 minus 119877

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 4: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 4

copy Roi Baer

This proof can be extended to any number of dimensions It can be used to

prove that the ground state of a many-boson wave function has no nodes

Exercise Prove the corollary the ground-state of boson systems is non-

degenerate

ii Ground-state wave function of Fermions

The ground-state wave function of 119873 gt 1 Fermions is a much more compli-

cated object than that of 119873 gt 1 Bosons This is because of the Pauli principle

which states 120595(119909 119909 ) = minus120595(119909 119909 ) ie 120595 is bound to have nodes in general

Thus the Fermionic ground-state is actually a highly excited eigenstate of the

Hamiltonian It can still be described by a minimum principle namely the

normalized many-body antisymmetric wave-function which minimizes

119864[120595] = ⟨120595||120595⟩ This formulation of the ground state problem is very conven-

ient We do not have to look for many eigenstates of we need only search for

minimum Finding the minimal energy and the expectation values of various

physically important operators is the main challenge of electronic structure

theory Doing the same for low-lying excited states is the grand challenge of

this field

c) Why electronic structure is an important dif-

ficult problem

In this course we will study methods to treat electronic structure This prob-

lem is often considerably more complicated than nuclear dynamics of mole-

cules The most obvious reason is that usually there are many more electrons

than nuclei in a molecule However there are more reasons electrons in mol-

ecules have extreme quantum mechanical character while nuclei are more

classical This last statement requires probably some explanation We will

Electron Density Functional Theory Page 5

copy Roi Baer

discuss in this course ways of mapping the electrons in a molecule onto a sys-

tem of non-interacting particles In such a picture each electron has its own

orbital or wave-function Generally the orbitals of different electrons in mole-

cules strongly overlap Furthermore most of the valence orbitals are widely

spread over space encapsulating many molecular nuclei Nuclei on the other

hand are massive so their wave functions are strongly localized and hardly

overlap once again they behave much more like classical particles

Describing electronic structure is therefore a quantum mechanical problem

Now the electrons interact via the attractive Coulomb force not only with

the stationary nuclei but also with each other each electron repels each other

electron via the repulsive Coulomb force This makes the electronic structure

problem a many-body problem

Quantum many-body problems are very difficult to track down much more

difficult than classical mechanical ones The reason is that for 119873 electrons the

many-body wave function is then a function of 3119873 variables 120595(119955 hellip 119955 )

where ir (i=1hellip eN ) is 3 dimensional the position vector Highly accurate nu-

merical representation of such functions is next to impossible for 119873 gt 1

Thus electronic structure methods invariably include approximate methods

Typically the wave function encapsulates much more information than we

care to know about For example int|120595(119955 hellip 119955 )| 119889 119903 sdotsdotsdot 119889 119903 gives the proba-

bility for finding an electron at 119955 and an electron at 119955 and an electron at 119955

etc Indeed the total probability is 1

int|120595(119955 hellip 119955 )|

119889 119903 hellip119889 119903

= 1 (Ic1)

Electron Density Functional Theory Page 6

copy Roi Baer

d) The Born-Oppenheimer theory

iii The adiabatic theorem

Suppose a Hamiltonian is dependent parametrically on 119877 = (119877 119877 hellip ) We

write this as [119877] Denote the eigenstates of this Hamiltonian by

[119877]120595 [119877] = 119864 [119877]120595 [119877]

Now suppose we change 119877 with time so that we have a trajectory 119877(119905) The

Hamiltonian will become time dependent (119905) = [119877(119905)] Suppose the sys-

tem is placed in its ground state at time = 0 120601(119905 = 0) = 120595 [119877(0)] It will

evolve according to the TDSE

119894ℏ(119905) = (119905)120601(119905)

Now the adiabatic theorem says that if 119877(119905) changes very slowly then

120601(119905) = 119890 ( )120595 [119877(119905)]

Namely except for a purely TD phase the evolving state stays the instanta-

neous ground state of the Hamiltonian (119905)

To show this we use the instantaneous eigenstates of the Hamiltonian (119905) to

expand 120601(119905)

120601(119905) = sum119886 (119905)119890

ℏint ( ) 120595 [119877(119905)]

We want to plug this expansion into the SE So let us calculate

119894ℏ(119905) = sum[119894ℏ + 119864 119886 ]119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

And

Electron Density Functional Theory Page 7

copy Roi Baer

(119905)120601 = sum119886 119890

ℏint ( ) 119864 120595 [119877(119905)]

Equating the two expressions gives

0 = sum 119894ℏ 119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

Now let us multiply by 120595 and integrate (remember that ⟨120595 |120595 ⟩ = 120575 )

0 = 119894ℏ 119890

ℏint ( ) + sum119886 119890

ℏint ( ) 119894ℏ⟨120595 | ⟩

Now we have

0 =119889

119889119905⟨120595 |120595 ⟩ = ⟨120595 | ⟩ + ⟨ |120595 ⟩

And since ⟨ |120595 ⟩ = ⟨120595 | ⟩ we find

120591 = ⟨120595 | ⟩ = minus⟨120595 | ⟩ = minus120591

The matrix 120591 = ⟨120595 | ⟩ is the matrix of ldquotime-dependent non-adiabatic

couplings (TDNACs)rdquo and it is an anti-Hermitean matrix Furthermore for

119899 = 119898 we see

119877119890⟨120595 | ⟩ = 0

Thus

= minus120591 119886 minus sum 119890 int ( ) 120591 119886

(Id1)

We see that the TDNACS 120591 create the non-adiabatic transitions ie the

transitions out of the ground state

Let us play a bit with 120591 Take the time derivative of the TISE

Electron Density Functional Theory Page 8

copy Roi Baer

(119905)120595 (119905) + (119905) (119905) = (119905)120595 (119905) + 119864 (119905) (119905)

Multiply by ⟨120595 | and get

⟨120595 || 120595 ⟩ + ⟨120595 || ⟩ = (119905)⟨120595 |120595 ⟩ + 119864 ⟨120595 | ⟩

Since ⟨120595 || ⟩ = 119864 ⟨120595 | ⟩ we have

⟨120595 || 120595 ⟩ = (119905)⟨120595 |120595 ⟩ + (119864 minus 119864 )⟨120595 | ⟩

If 119899 = 119898 then

(119905) = ⟨120595 || 120595 ⟩

This is called the Hellmann-Feynman theorem showing that the power is the

expectation value of the rate of change of the Hamiltonian If 119899 ne 119898

120591 = ⟨120595 | ⟩ =⟨120595 || 120595 ⟩

119864 minus 119864 (Id2)

This is called Epsteinrsquos theorem giving an expression for the TDNACs Re-

member that [119877(119905)] depends on 119905 through the positions of the nuclei Thus

[119877(119905)] =119889

119889119905[119877(119905)] =

part

partR [119877(119905)]

Thus

120591 = sum⟨120595 |

[119877(119905)]| 120595 ⟩

119864 [119877] minus 119864 [119877] (119905)

= sum120591 (119905)

Where the time-independent NACS are defined as

120591 [119877] = ⟨120595 |

120597

120597119877 120595 ⟩ =

⟨120595 [119877] |

[119877]| 120595 [119877]⟩

119864 [119877] minus 119864 [119877] (Id3)

Electron Density Functional Theory Page 9

copy Roi Baer

We see that the TDNACs depend on the velocity of the nuclei When nuclei

are small the couplings out of the ground state are small The 120591 are called

the ldquonon-adiabatic couplings (NACs)rdquo We find that as long as 119864 minus 119864 are not

zero on the trajectory one can always find small enough that makes the

TDNACs as small as we wish All we need for states 119899 and 119898 to stay decou-

pled is for the following conditions to be met

|120591 | = | (119905)120591 (119877 (119905))| ≪ 1

Thus we can make the NACs as small as we wish by taking the rate (119905) to

be small enough So all that is left in Eq lrm(Id1) is = minus120591 119886 Define the

non-dynamical phase as 120579 (119905) = 119894 int 120591 (119905 ) 119889119905

(note 120579 is real) and then

119886 (119905) = 119890 ( ) (119904119897119900119908 119901119903119900119888119890119904119904119890119904 = 119886119889119894119886119887119886119905119894119888 119901119903119900119888119890119904119904)

The total state is then

120601(119905) = 119886 (119905)119890

ℏint ( ) 120595 [119877(119905)] = 119890 ( ( ) ( ))120595 [119877(119905)]

where 120579 (119905) = minus

ℏint 119864 (120591)119889120591

is called the dynamical phase It is easy to prove

that if 119877(119905) traverses a closed loop the non-dynamical phase depends only on

that loop and not on the way it is traversed For example if we traverse the

same loop using different velocities the dynamic phase may change but the

geometric phase will not The closed loop geometric angle is called the Berry

phase The independence of path is a result of the fact that the non-dynamical

phase is a line integral and can be made with no reference to time

Electron Density Functional Theory Page 10

copy Roi Baer

120579 (119905) = 119894 int ⟨120595 (119877[120591])| (119877[120591])⟩119889120591

= 119894 int sum⟨120595 [119877]|120597

120597119877 120595 [119877]⟩

119889120591

= 119894 int sum120591 (119877)

119889119877

( )

( )

The dynamical phase for example is not a line integral and its value depends

not only on the path itself but also on the velocity taken along the way This

observation makes the non-dynamical phase a special quantity Berry has

shown that this quantity can give us information on the way the Hamiltonian

is dependent on its parameters For a real Hamiltonian for example 119890

around a closed path always equals either 1 or -1 If there is an even number

of degeneracies enclosed by the path it is 1 and if an odd number ndash it is -1

iv Motivation for the Born-Oppenheimer approximation classical nuclei

In a molecule we can think of the nuclei (having coordinates 119877) as heavy and

therefore classical particles which are slowly moving The electrons are light

and therefore quantum They have coordinates 119903 (119903 includes spin) and they

119877

119877

A path in parameter

ldquospacerdquo 119877 minus 119877

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 5: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 5

copy Roi Baer

discuss in this course ways of mapping the electrons in a molecule onto a sys-

tem of non-interacting particles In such a picture each electron has its own

orbital or wave-function Generally the orbitals of different electrons in mole-

cules strongly overlap Furthermore most of the valence orbitals are widely

spread over space encapsulating many molecular nuclei Nuclei on the other

hand are massive so their wave functions are strongly localized and hardly

overlap once again they behave much more like classical particles

Describing electronic structure is therefore a quantum mechanical problem

Now the electrons interact via the attractive Coulomb force not only with

the stationary nuclei but also with each other each electron repels each other

electron via the repulsive Coulomb force This makes the electronic structure

problem a many-body problem

Quantum many-body problems are very difficult to track down much more

difficult than classical mechanical ones The reason is that for 119873 electrons the

many-body wave function is then a function of 3119873 variables 120595(119955 hellip 119955 )

where ir (i=1hellip eN ) is 3 dimensional the position vector Highly accurate nu-

merical representation of such functions is next to impossible for 119873 gt 1

Thus electronic structure methods invariably include approximate methods

Typically the wave function encapsulates much more information than we

care to know about For example int|120595(119955 hellip 119955 )| 119889 119903 sdotsdotsdot 119889 119903 gives the proba-

bility for finding an electron at 119955 and an electron at 119955 and an electron at 119955

etc Indeed the total probability is 1

int|120595(119955 hellip 119955 )|

119889 119903 hellip119889 119903

= 1 (Ic1)

Electron Density Functional Theory Page 6

copy Roi Baer

d) The Born-Oppenheimer theory

iii The adiabatic theorem

Suppose a Hamiltonian is dependent parametrically on 119877 = (119877 119877 hellip ) We

write this as [119877] Denote the eigenstates of this Hamiltonian by

[119877]120595 [119877] = 119864 [119877]120595 [119877]

Now suppose we change 119877 with time so that we have a trajectory 119877(119905) The

Hamiltonian will become time dependent (119905) = [119877(119905)] Suppose the sys-

tem is placed in its ground state at time = 0 120601(119905 = 0) = 120595 [119877(0)] It will

evolve according to the TDSE

119894ℏ(119905) = (119905)120601(119905)

Now the adiabatic theorem says that if 119877(119905) changes very slowly then

120601(119905) = 119890 ( )120595 [119877(119905)]

Namely except for a purely TD phase the evolving state stays the instanta-

neous ground state of the Hamiltonian (119905)

To show this we use the instantaneous eigenstates of the Hamiltonian (119905) to

expand 120601(119905)

120601(119905) = sum119886 (119905)119890

ℏint ( ) 120595 [119877(119905)]

We want to plug this expansion into the SE So let us calculate

119894ℏ(119905) = sum[119894ℏ + 119864 119886 ]119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

And

Electron Density Functional Theory Page 7

copy Roi Baer

(119905)120601 = sum119886 119890

ℏint ( ) 119864 120595 [119877(119905)]

Equating the two expressions gives

0 = sum 119894ℏ 119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

Now let us multiply by 120595 and integrate (remember that ⟨120595 |120595 ⟩ = 120575 )

0 = 119894ℏ 119890

ℏint ( ) + sum119886 119890

ℏint ( ) 119894ℏ⟨120595 | ⟩

Now we have

0 =119889

119889119905⟨120595 |120595 ⟩ = ⟨120595 | ⟩ + ⟨ |120595 ⟩

And since ⟨ |120595 ⟩ = ⟨120595 | ⟩ we find

120591 = ⟨120595 | ⟩ = minus⟨120595 | ⟩ = minus120591

The matrix 120591 = ⟨120595 | ⟩ is the matrix of ldquotime-dependent non-adiabatic

couplings (TDNACs)rdquo and it is an anti-Hermitean matrix Furthermore for

119899 = 119898 we see

119877119890⟨120595 | ⟩ = 0

Thus

= minus120591 119886 minus sum 119890 int ( ) 120591 119886

(Id1)

We see that the TDNACS 120591 create the non-adiabatic transitions ie the

transitions out of the ground state

Let us play a bit with 120591 Take the time derivative of the TISE

Electron Density Functional Theory Page 8

copy Roi Baer

(119905)120595 (119905) + (119905) (119905) = (119905)120595 (119905) + 119864 (119905) (119905)

Multiply by ⟨120595 | and get

⟨120595 || 120595 ⟩ + ⟨120595 || ⟩ = (119905)⟨120595 |120595 ⟩ + 119864 ⟨120595 | ⟩

Since ⟨120595 || ⟩ = 119864 ⟨120595 | ⟩ we have

⟨120595 || 120595 ⟩ = (119905)⟨120595 |120595 ⟩ + (119864 minus 119864 )⟨120595 | ⟩

If 119899 = 119898 then

(119905) = ⟨120595 || 120595 ⟩

This is called the Hellmann-Feynman theorem showing that the power is the

expectation value of the rate of change of the Hamiltonian If 119899 ne 119898

120591 = ⟨120595 | ⟩ =⟨120595 || 120595 ⟩

119864 minus 119864 (Id2)

This is called Epsteinrsquos theorem giving an expression for the TDNACs Re-

member that [119877(119905)] depends on 119905 through the positions of the nuclei Thus

[119877(119905)] =119889

119889119905[119877(119905)] =

part

partR [119877(119905)]

Thus

120591 = sum⟨120595 |

[119877(119905)]| 120595 ⟩

119864 [119877] minus 119864 [119877] (119905)

= sum120591 (119905)

Where the time-independent NACS are defined as

120591 [119877] = ⟨120595 |

120597

120597119877 120595 ⟩ =

⟨120595 [119877] |

[119877]| 120595 [119877]⟩

119864 [119877] minus 119864 [119877] (Id3)

Electron Density Functional Theory Page 9

copy Roi Baer

We see that the TDNACs depend on the velocity of the nuclei When nuclei

are small the couplings out of the ground state are small The 120591 are called

the ldquonon-adiabatic couplings (NACs)rdquo We find that as long as 119864 minus 119864 are not

zero on the trajectory one can always find small enough that makes the

TDNACs as small as we wish All we need for states 119899 and 119898 to stay decou-

pled is for the following conditions to be met

|120591 | = | (119905)120591 (119877 (119905))| ≪ 1

Thus we can make the NACs as small as we wish by taking the rate (119905) to

be small enough So all that is left in Eq lrm(Id1) is = minus120591 119886 Define the

non-dynamical phase as 120579 (119905) = 119894 int 120591 (119905 ) 119889119905

(note 120579 is real) and then

119886 (119905) = 119890 ( ) (119904119897119900119908 119901119903119900119888119890119904119904119890119904 = 119886119889119894119886119887119886119905119894119888 119901119903119900119888119890119904119904)

The total state is then

120601(119905) = 119886 (119905)119890

ℏint ( ) 120595 [119877(119905)] = 119890 ( ( ) ( ))120595 [119877(119905)]

where 120579 (119905) = minus

ℏint 119864 (120591)119889120591

is called the dynamical phase It is easy to prove

that if 119877(119905) traverses a closed loop the non-dynamical phase depends only on

that loop and not on the way it is traversed For example if we traverse the

same loop using different velocities the dynamic phase may change but the

geometric phase will not The closed loop geometric angle is called the Berry

phase The independence of path is a result of the fact that the non-dynamical

phase is a line integral and can be made with no reference to time

Electron Density Functional Theory Page 10

copy Roi Baer

120579 (119905) = 119894 int ⟨120595 (119877[120591])| (119877[120591])⟩119889120591

= 119894 int sum⟨120595 [119877]|120597

120597119877 120595 [119877]⟩

119889120591

= 119894 int sum120591 (119877)

119889119877

( )

( )

The dynamical phase for example is not a line integral and its value depends

not only on the path itself but also on the velocity taken along the way This

observation makes the non-dynamical phase a special quantity Berry has

shown that this quantity can give us information on the way the Hamiltonian

is dependent on its parameters For a real Hamiltonian for example 119890

around a closed path always equals either 1 or -1 If there is an even number

of degeneracies enclosed by the path it is 1 and if an odd number ndash it is -1

iv Motivation for the Born-Oppenheimer approximation classical nuclei

In a molecule we can think of the nuclei (having coordinates 119877) as heavy and

therefore classical particles which are slowly moving The electrons are light

and therefore quantum They have coordinates 119903 (119903 includes spin) and they

119877

119877

A path in parameter

ldquospacerdquo 119877 minus 119877

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 6: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 6

copy Roi Baer

d) The Born-Oppenheimer theory

iii The adiabatic theorem

Suppose a Hamiltonian is dependent parametrically on 119877 = (119877 119877 hellip ) We

write this as [119877] Denote the eigenstates of this Hamiltonian by

[119877]120595 [119877] = 119864 [119877]120595 [119877]

Now suppose we change 119877 with time so that we have a trajectory 119877(119905) The

Hamiltonian will become time dependent (119905) = [119877(119905)] Suppose the sys-

tem is placed in its ground state at time = 0 120601(119905 = 0) = 120595 [119877(0)] It will

evolve according to the TDSE

119894ℏ(119905) = (119905)120601(119905)

Now the adiabatic theorem says that if 119877(119905) changes very slowly then

120601(119905) = 119890 ( )120595 [119877(119905)]

Namely except for a purely TD phase the evolving state stays the instanta-

neous ground state of the Hamiltonian (119905)

To show this we use the instantaneous eigenstates of the Hamiltonian (119905) to

expand 120601(119905)

120601(119905) = sum119886 (119905)119890

ℏint ( ) 120595 [119877(119905)]

We want to plug this expansion into the SE So let us calculate

119894ℏ(119905) = sum[119894ℏ + 119864 119886 ]119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

And

Electron Density Functional Theory Page 7

copy Roi Baer

(119905)120601 = sum119886 119890

ℏint ( ) 119864 120595 [119877(119905)]

Equating the two expressions gives

0 = sum 119894ℏ 119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

Now let us multiply by 120595 and integrate (remember that ⟨120595 |120595 ⟩ = 120575 )

0 = 119894ℏ 119890

ℏint ( ) + sum119886 119890

ℏint ( ) 119894ℏ⟨120595 | ⟩

Now we have

0 =119889

119889119905⟨120595 |120595 ⟩ = ⟨120595 | ⟩ + ⟨ |120595 ⟩

And since ⟨ |120595 ⟩ = ⟨120595 | ⟩ we find

120591 = ⟨120595 | ⟩ = minus⟨120595 | ⟩ = minus120591

The matrix 120591 = ⟨120595 | ⟩ is the matrix of ldquotime-dependent non-adiabatic

couplings (TDNACs)rdquo and it is an anti-Hermitean matrix Furthermore for

119899 = 119898 we see

119877119890⟨120595 | ⟩ = 0

Thus

= minus120591 119886 minus sum 119890 int ( ) 120591 119886

(Id1)

We see that the TDNACS 120591 create the non-adiabatic transitions ie the

transitions out of the ground state

Let us play a bit with 120591 Take the time derivative of the TISE

Electron Density Functional Theory Page 8

copy Roi Baer

(119905)120595 (119905) + (119905) (119905) = (119905)120595 (119905) + 119864 (119905) (119905)

Multiply by ⟨120595 | and get

⟨120595 || 120595 ⟩ + ⟨120595 || ⟩ = (119905)⟨120595 |120595 ⟩ + 119864 ⟨120595 | ⟩

Since ⟨120595 || ⟩ = 119864 ⟨120595 | ⟩ we have

⟨120595 || 120595 ⟩ = (119905)⟨120595 |120595 ⟩ + (119864 minus 119864 )⟨120595 | ⟩

If 119899 = 119898 then

(119905) = ⟨120595 || 120595 ⟩

This is called the Hellmann-Feynman theorem showing that the power is the

expectation value of the rate of change of the Hamiltonian If 119899 ne 119898

120591 = ⟨120595 | ⟩ =⟨120595 || 120595 ⟩

119864 minus 119864 (Id2)

This is called Epsteinrsquos theorem giving an expression for the TDNACs Re-

member that [119877(119905)] depends on 119905 through the positions of the nuclei Thus

[119877(119905)] =119889

119889119905[119877(119905)] =

part

partR [119877(119905)]

Thus

120591 = sum⟨120595 |

[119877(119905)]| 120595 ⟩

119864 [119877] minus 119864 [119877] (119905)

= sum120591 (119905)

Where the time-independent NACS are defined as

120591 [119877] = ⟨120595 |

120597

120597119877 120595 ⟩ =

⟨120595 [119877] |

[119877]| 120595 [119877]⟩

119864 [119877] minus 119864 [119877] (Id3)

Electron Density Functional Theory Page 9

copy Roi Baer

We see that the TDNACs depend on the velocity of the nuclei When nuclei

are small the couplings out of the ground state are small The 120591 are called

the ldquonon-adiabatic couplings (NACs)rdquo We find that as long as 119864 minus 119864 are not

zero on the trajectory one can always find small enough that makes the

TDNACs as small as we wish All we need for states 119899 and 119898 to stay decou-

pled is for the following conditions to be met

|120591 | = | (119905)120591 (119877 (119905))| ≪ 1

Thus we can make the NACs as small as we wish by taking the rate (119905) to

be small enough So all that is left in Eq lrm(Id1) is = minus120591 119886 Define the

non-dynamical phase as 120579 (119905) = 119894 int 120591 (119905 ) 119889119905

(note 120579 is real) and then

119886 (119905) = 119890 ( ) (119904119897119900119908 119901119903119900119888119890119904119904119890119904 = 119886119889119894119886119887119886119905119894119888 119901119903119900119888119890119904119904)

The total state is then

120601(119905) = 119886 (119905)119890

ℏint ( ) 120595 [119877(119905)] = 119890 ( ( ) ( ))120595 [119877(119905)]

where 120579 (119905) = minus

ℏint 119864 (120591)119889120591

is called the dynamical phase It is easy to prove

that if 119877(119905) traverses a closed loop the non-dynamical phase depends only on

that loop and not on the way it is traversed For example if we traverse the

same loop using different velocities the dynamic phase may change but the

geometric phase will not The closed loop geometric angle is called the Berry

phase The independence of path is a result of the fact that the non-dynamical

phase is a line integral and can be made with no reference to time

Electron Density Functional Theory Page 10

copy Roi Baer

120579 (119905) = 119894 int ⟨120595 (119877[120591])| (119877[120591])⟩119889120591

= 119894 int sum⟨120595 [119877]|120597

120597119877 120595 [119877]⟩

119889120591

= 119894 int sum120591 (119877)

119889119877

( )

( )

The dynamical phase for example is not a line integral and its value depends

not only on the path itself but also on the velocity taken along the way This

observation makes the non-dynamical phase a special quantity Berry has

shown that this quantity can give us information on the way the Hamiltonian

is dependent on its parameters For a real Hamiltonian for example 119890

around a closed path always equals either 1 or -1 If there is an even number

of degeneracies enclosed by the path it is 1 and if an odd number ndash it is -1

iv Motivation for the Born-Oppenheimer approximation classical nuclei

In a molecule we can think of the nuclei (having coordinates 119877) as heavy and

therefore classical particles which are slowly moving The electrons are light

and therefore quantum They have coordinates 119903 (119903 includes spin) and they

119877

119877

A path in parameter

ldquospacerdquo 119877 minus 119877

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 7: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 7

copy Roi Baer

(119905)120601 = sum119886 119890

ℏint ( ) 119864 120595 [119877(119905)]

Equating the two expressions gives

0 = sum 119894ℏ 119890

ℏint ( ) 120595 [119877(119905)]

+ sum119886 119890

ℏint ( ) 119894ℏ [119877(119905)]

Now let us multiply by 120595 and integrate (remember that ⟨120595 |120595 ⟩ = 120575 )

0 = 119894ℏ 119890

ℏint ( ) + sum119886 119890

ℏint ( ) 119894ℏ⟨120595 | ⟩

Now we have

0 =119889

119889119905⟨120595 |120595 ⟩ = ⟨120595 | ⟩ + ⟨ |120595 ⟩

And since ⟨ |120595 ⟩ = ⟨120595 | ⟩ we find

120591 = ⟨120595 | ⟩ = minus⟨120595 | ⟩ = minus120591

The matrix 120591 = ⟨120595 | ⟩ is the matrix of ldquotime-dependent non-adiabatic

couplings (TDNACs)rdquo and it is an anti-Hermitean matrix Furthermore for

119899 = 119898 we see

119877119890⟨120595 | ⟩ = 0

Thus

= minus120591 119886 minus sum 119890 int ( ) 120591 119886

(Id1)

We see that the TDNACS 120591 create the non-adiabatic transitions ie the

transitions out of the ground state

Let us play a bit with 120591 Take the time derivative of the TISE

Electron Density Functional Theory Page 8

copy Roi Baer

(119905)120595 (119905) + (119905) (119905) = (119905)120595 (119905) + 119864 (119905) (119905)

Multiply by ⟨120595 | and get

⟨120595 || 120595 ⟩ + ⟨120595 || ⟩ = (119905)⟨120595 |120595 ⟩ + 119864 ⟨120595 | ⟩

Since ⟨120595 || ⟩ = 119864 ⟨120595 | ⟩ we have

⟨120595 || 120595 ⟩ = (119905)⟨120595 |120595 ⟩ + (119864 minus 119864 )⟨120595 | ⟩

If 119899 = 119898 then

(119905) = ⟨120595 || 120595 ⟩

This is called the Hellmann-Feynman theorem showing that the power is the

expectation value of the rate of change of the Hamiltonian If 119899 ne 119898

120591 = ⟨120595 | ⟩ =⟨120595 || 120595 ⟩

119864 minus 119864 (Id2)

This is called Epsteinrsquos theorem giving an expression for the TDNACs Re-

member that [119877(119905)] depends on 119905 through the positions of the nuclei Thus

[119877(119905)] =119889

119889119905[119877(119905)] =

part

partR [119877(119905)]

Thus

120591 = sum⟨120595 |

[119877(119905)]| 120595 ⟩

119864 [119877] minus 119864 [119877] (119905)

= sum120591 (119905)

Where the time-independent NACS are defined as

120591 [119877] = ⟨120595 |

120597

120597119877 120595 ⟩ =

⟨120595 [119877] |

[119877]| 120595 [119877]⟩

119864 [119877] minus 119864 [119877] (Id3)

Electron Density Functional Theory Page 9

copy Roi Baer

We see that the TDNACs depend on the velocity of the nuclei When nuclei

are small the couplings out of the ground state are small The 120591 are called

the ldquonon-adiabatic couplings (NACs)rdquo We find that as long as 119864 minus 119864 are not

zero on the trajectory one can always find small enough that makes the

TDNACs as small as we wish All we need for states 119899 and 119898 to stay decou-

pled is for the following conditions to be met

|120591 | = | (119905)120591 (119877 (119905))| ≪ 1

Thus we can make the NACs as small as we wish by taking the rate (119905) to

be small enough So all that is left in Eq lrm(Id1) is = minus120591 119886 Define the

non-dynamical phase as 120579 (119905) = 119894 int 120591 (119905 ) 119889119905

(note 120579 is real) and then

119886 (119905) = 119890 ( ) (119904119897119900119908 119901119903119900119888119890119904119904119890119904 = 119886119889119894119886119887119886119905119894119888 119901119903119900119888119890119904119904)

The total state is then

120601(119905) = 119886 (119905)119890

ℏint ( ) 120595 [119877(119905)] = 119890 ( ( ) ( ))120595 [119877(119905)]

where 120579 (119905) = minus

ℏint 119864 (120591)119889120591

is called the dynamical phase It is easy to prove

that if 119877(119905) traverses a closed loop the non-dynamical phase depends only on

that loop and not on the way it is traversed For example if we traverse the

same loop using different velocities the dynamic phase may change but the

geometric phase will not The closed loop geometric angle is called the Berry

phase The independence of path is a result of the fact that the non-dynamical

phase is a line integral and can be made with no reference to time

Electron Density Functional Theory Page 10

copy Roi Baer

120579 (119905) = 119894 int ⟨120595 (119877[120591])| (119877[120591])⟩119889120591

= 119894 int sum⟨120595 [119877]|120597

120597119877 120595 [119877]⟩

119889120591

= 119894 int sum120591 (119877)

119889119877

( )

( )

The dynamical phase for example is not a line integral and its value depends

not only on the path itself but also on the velocity taken along the way This

observation makes the non-dynamical phase a special quantity Berry has

shown that this quantity can give us information on the way the Hamiltonian

is dependent on its parameters For a real Hamiltonian for example 119890

around a closed path always equals either 1 or -1 If there is an even number

of degeneracies enclosed by the path it is 1 and if an odd number ndash it is -1

iv Motivation for the Born-Oppenheimer approximation classical nuclei

In a molecule we can think of the nuclei (having coordinates 119877) as heavy and

therefore classical particles which are slowly moving The electrons are light

and therefore quantum They have coordinates 119903 (119903 includes spin) and they

119877

119877

A path in parameter

ldquospacerdquo 119877 minus 119877

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 8: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 8

copy Roi Baer

(119905)120595 (119905) + (119905) (119905) = (119905)120595 (119905) + 119864 (119905) (119905)

Multiply by ⟨120595 | and get

⟨120595 || 120595 ⟩ + ⟨120595 || ⟩ = (119905)⟨120595 |120595 ⟩ + 119864 ⟨120595 | ⟩

Since ⟨120595 || ⟩ = 119864 ⟨120595 | ⟩ we have

⟨120595 || 120595 ⟩ = (119905)⟨120595 |120595 ⟩ + (119864 minus 119864 )⟨120595 | ⟩

If 119899 = 119898 then

(119905) = ⟨120595 || 120595 ⟩

This is called the Hellmann-Feynman theorem showing that the power is the

expectation value of the rate of change of the Hamiltonian If 119899 ne 119898

120591 = ⟨120595 | ⟩ =⟨120595 || 120595 ⟩

119864 minus 119864 (Id2)

This is called Epsteinrsquos theorem giving an expression for the TDNACs Re-

member that [119877(119905)] depends on 119905 through the positions of the nuclei Thus

[119877(119905)] =119889

119889119905[119877(119905)] =

part

partR [119877(119905)]

Thus

120591 = sum⟨120595 |

[119877(119905)]| 120595 ⟩

119864 [119877] minus 119864 [119877] (119905)

= sum120591 (119905)

Where the time-independent NACS are defined as

120591 [119877] = ⟨120595 |

120597

120597119877 120595 ⟩ =

⟨120595 [119877] |

[119877]| 120595 [119877]⟩

119864 [119877] minus 119864 [119877] (Id3)

Electron Density Functional Theory Page 9

copy Roi Baer

We see that the TDNACs depend on the velocity of the nuclei When nuclei

are small the couplings out of the ground state are small The 120591 are called

the ldquonon-adiabatic couplings (NACs)rdquo We find that as long as 119864 minus 119864 are not

zero on the trajectory one can always find small enough that makes the

TDNACs as small as we wish All we need for states 119899 and 119898 to stay decou-

pled is for the following conditions to be met

|120591 | = | (119905)120591 (119877 (119905))| ≪ 1

Thus we can make the NACs as small as we wish by taking the rate (119905) to

be small enough So all that is left in Eq lrm(Id1) is = minus120591 119886 Define the

non-dynamical phase as 120579 (119905) = 119894 int 120591 (119905 ) 119889119905

(note 120579 is real) and then

119886 (119905) = 119890 ( ) (119904119897119900119908 119901119903119900119888119890119904119904119890119904 = 119886119889119894119886119887119886119905119894119888 119901119903119900119888119890119904119904)

The total state is then

120601(119905) = 119886 (119905)119890

ℏint ( ) 120595 [119877(119905)] = 119890 ( ( ) ( ))120595 [119877(119905)]

where 120579 (119905) = minus

ℏint 119864 (120591)119889120591

is called the dynamical phase It is easy to prove

that if 119877(119905) traverses a closed loop the non-dynamical phase depends only on

that loop and not on the way it is traversed For example if we traverse the

same loop using different velocities the dynamic phase may change but the

geometric phase will not The closed loop geometric angle is called the Berry

phase The independence of path is a result of the fact that the non-dynamical

phase is a line integral and can be made with no reference to time

Electron Density Functional Theory Page 10

copy Roi Baer

120579 (119905) = 119894 int ⟨120595 (119877[120591])| (119877[120591])⟩119889120591

= 119894 int sum⟨120595 [119877]|120597

120597119877 120595 [119877]⟩

119889120591

= 119894 int sum120591 (119877)

119889119877

( )

( )

The dynamical phase for example is not a line integral and its value depends

not only on the path itself but also on the velocity taken along the way This

observation makes the non-dynamical phase a special quantity Berry has

shown that this quantity can give us information on the way the Hamiltonian

is dependent on its parameters For a real Hamiltonian for example 119890

around a closed path always equals either 1 or -1 If there is an even number

of degeneracies enclosed by the path it is 1 and if an odd number ndash it is -1

iv Motivation for the Born-Oppenheimer approximation classical nuclei

In a molecule we can think of the nuclei (having coordinates 119877) as heavy and

therefore classical particles which are slowly moving The electrons are light

and therefore quantum They have coordinates 119903 (119903 includes spin) and they

119877

119877

A path in parameter

ldquospacerdquo 119877 minus 119877

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 9: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 9

copy Roi Baer

We see that the TDNACs depend on the velocity of the nuclei When nuclei

are small the couplings out of the ground state are small The 120591 are called

the ldquonon-adiabatic couplings (NACs)rdquo We find that as long as 119864 minus 119864 are not

zero on the trajectory one can always find small enough that makes the

TDNACs as small as we wish All we need for states 119899 and 119898 to stay decou-

pled is for the following conditions to be met

|120591 | = | (119905)120591 (119877 (119905))| ≪ 1

Thus we can make the NACs as small as we wish by taking the rate (119905) to

be small enough So all that is left in Eq lrm(Id1) is = minus120591 119886 Define the

non-dynamical phase as 120579 (119905) = 119894 int 120591 (119905 ) 119889119905

(note 120579 is real) and then

119886 (119905) = 119890 ( ) (119904119897119900119908 119901119903119900119888119890119904119904119890119904 = 119886119889119894119886119887119886119905119894119888 119901119903119900119888119890119904119904)

The total state is then

120601(119905) = 119886 (119905)119890

ℏint ( ) 120595 [119877(119905)] = 119890 ( ( ) ( ))120595 [119877(119905)]

where 120579 (119905) = minus

ℏint 119864 (120591)119889120591

is called the dynamical phase It is easy to prove

that if 119877(119905) traverses a closed loop the non-dynamical phase depends only on

that loop and not on the way it is traversed For example if we traverse the

same loop using different velocities the dynamic phase may change but the

geometric phase will not The closed loop geometric angle is called the Berry

phase The independence of path is a result of the fact that the non-dynamical

phase is a line integral and can be made with no reference to time

Electron Density Functional Theory Page 10

copy Roi Baer

120579 (119905) = 119894 int ⟨120595 (119877[120591])| (119877[120591])⟩119889120591

= 119894 int sum⟨120595 [119877]|120597

120597119877 120595 [119877]⟩

119889120591

= 119894 int sum120591 (119877)

119889119877

( )

( )

The dynamical phase for example is not a line integral and its value depends

not only on the path itself but also on the velocity taken along the way This

observation makes the non-dynamical phase a special quantity Berry has

shown that this quantity can give us information on the way the Hamiltonian

is dependent on its parameters For a real Hamiltonian for example 119890

around a closed path always equals either 1 or -1 If there is an even number

of degeneracies enclosed by the path it is 1 and if an odd number ndash it is -1

iv Motivation for the Born-Oppenheimer approximation classical nuclei

In a molecule we can think of the nuclei (having coordinates 119877) as heavy and

therefore classical particles which are slowly moving The electrons are light

and therefore quantum They have coordinates 119903 (119903 includes spin) and they

119877

119877

A path in parameter

ldquospacerdquo 119877 minus 119877

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 10: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 10

copy Roi Baer

120579 (119905) = 119894 int ⟨120595 (119877[120591])| (119877[120591])⟩119889120591

= 119894 int sum⟨120595 [119877]|120597

120597119877 120595 [119877]⟩

119889120591

= 119894 int sum120591 (119877)

119889119877

( )

( )

The dynamical phase for example is not a line integral and its value depends

not only on the path itself but also on the velocity taken along the way This

observation makes the non-dynamical phase a special quantity Berry has

shown that this quantity can give us information on the way the Hamiltonian

is dependent on its parameters For a real Hamiltonian for example 119890

around a closed path always equals either 1 or -1 If there is an even number

of degeneracies enclosed by the path it is 1 and if an odd number ndash it is -1

iv Motivation for the Born-Oppenheimer approximation classical nuclei

In a molecule we can think of the nuclei (having coordinates 119877) as heavy and

therefore classical particles which are slowly moving The electrons are light

and therefore quantum They have coordinates 119903 (119903 includes spin) and they

119877

119877

A path in parameter

ldquospacerdquo 119877 minus 119877

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 11: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 11

copy Roi Baer

feel the effect of slowly moving nuclei The electron dynamics is controlled by

the Hamiltonian

[119877] = + 119881(119903 119877)

The potential 119881(119903 119877) describes the Coulombic interactions between electrons

and nuclei (in atomic units)

119881(119903 119877) = minussum119885

| minus 119903 |

+1

2sum

1

119903

+1

2sum

119885 119885

119877

In general one wants to assume that the total energy of the molecule in this

classical approximation is

119864 = 119879 + ⟨120595 [119877]| [119877(119905)]|120595 [119877]⟩

Where 120595 [119877] is the ground state of the electrons at nuclear configuration 119877

The adiabatic theorem states that this is reasonable as long as nuclei move

slowly Thus the adiabatic theorem allows us to write

119864 = 119879 + 119881 (119877)

where 119881 (119877) is the ground state energy of the electrons at nuclear configura-

tion 119877 This energy is a usual classical mechanical energy and the Newton

equations of motion apply

119872 = 119865 = minus120597

120597119877 119881 (119877)

We see that the adiabatic theorem allows us to consider the nuclei as moving

in a potential well which is essentially the eigenvalues of the electrons This in

essence is the BO approximation when the nuclei are classical

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 12: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 12

copy Roi Baer

v The Born-Oppenheimer approximation in quantum nuclear case

The classical approach motivates a quantum treatment We are expecting that

nuclei will not excite electrons very efficiently That is the motivation for the

BO approximation

The Born and Oppenheimer development is similar to that of the adiabatic

theorem although there are no ldquoexternalrdquo fields Suppose we have a system of

fast particles with coordinates 119903 = (119903 119903 hellip ) and slow particles with coordi-

nates 119877 = (119877 119877 hellip ) The Hamiltonian can be written as

= + + 119881(119903 119877)

The Schroumldinger equation is

120595 (119903 119877) = 119864 120595 (119903 119877) (Id4)

Note that we can assume these wave functions are orthogonal

⟨120595 |120595 ⟩ = intint120595 (119903 119877)120595 (119903 119877)119889119903 119889119877 = 120575

Now to proceed let us first consider the fast r-Hamiltonian

[119877] = + 119881(119903 119877) (Id5)

In this ldquofast Hamiltonianrdquo [119877] the slow variables are simply parameters (it

contains no derivatives with respect to 119877) Thus it depends on 119877 parametrical-

ly The fast eigenvalue problem is

[119877]120601 (119903 119877) = 119882 (119877)120601 (119903 119877)

The eigenvalues are functions of the parameters 119877 They are called the adia-

batic (or Born-Oppenheimer) surfaces The notation with the semicolon be-

tween 119903 and 119877 is designed to emphasize that 120601 are wave functions in 119903 but

they depend only parametrically on 119877 This means for example that the over-

lap of the fast eigenstates (called adiabatic states) involves integration only

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 13: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 13

copy Roi Baer

over 119903 the dynamical coordinates while 119877 is held fixed since it is only a pa-

rameter

⟨120601 (119877)|120601 (119877)⟩ = int120601 (119903 119877)120601 (119903 119877)119889119903 = 120575

We cannot in fact say anything really meaningful about ⟨120601 (119877)|120601 (119877 )⟩ when

119877 ne 119877 (except when 119877 minus 119877 is infinitesimal but we will not go into this issue

here) Now we can expand the ldquorealrdquo wave-function as a linear combination

of the adiabatic functions

120595 (119903 119877) = sum120601 (119903 119877)Φ (119877)

(Id6)

We can do this because for any give 119877 120601 (119903 119877) span the space of wave func-

tions dependent on 119903 In fact the expansion coefficients Φ (119877) are given by

Φ (119877) = int120601 (119903 119877)120595 (119903 119877)119889119903

Now let us plug Eq lrm(Id6) into the SE lrm(Id4)

119864 120595 (119903 119877) = ( + )120595 (119903 119877)

= sum [120601 (119903 119877)Φ (119877)]

+ sum119882 (119877)120601 (119903 119877)Φ (119877)

(Id7)

Note that since = sum

where = minus119894ℏ

We have

[120601 (119903 119877)Φ (119877)]

= Φ (119877) [120601 (119903 119877)]

+ sum1

119872 ( 120601 (119903 119877)) ( Φ (119877))

+ 120601 (119903 119877) Φ (119877)

Multiplying Eq lrm(Id7) by 120601 (119903 119877) and integrating over 119903 gives

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 14: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 14

copy Roi Baer

sum(A + )Φ (119877)

+ ( + 119882 (119877))Φ (119877) = 119864 Φ (119877) (Id8)

Where

A = int120601 (119903 119877) [ 120601 (119903 119877)]119889119903 = ⟨120601 | 120601 ⟩

= sum

int120601 (119903 119877) [ 120601 (119903 119877)]119889119903

119872

= sum⟨120601 | 120601 ⟩

119872

= minussumℏ

119872 ⟨120601 |

120597

120597119877 120601 ⟩

120597

120597119877

= minussumℏ

119872 120591

120597

120597119877

The matrices 120591 = ⟨120601 |

120601 ⟩ are the non-adiabatic couplings in the ldquofastrdquo

system These are exactly the non-adiabatic coefficients in the adiabatic theory

(Eq lrm(Id3))

It is possible to show that

sum(A + )Φ (119877)

+ Φ (119877)

= sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

Thus Eq lrm(Id8) becomes

sumsumminusℏ

2119872 (

120597

120597119877 + 120591

)

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

This is a Schroumldinger ndashlike equation which determines the coefficients Φ (119877)

These are called the ldquoslow eigenfunctionsrdquo Once they are computed one has

an exact solution to the Schroumldinger equation However we do not really

want to solve this infinite set of coupled differential equations Thus we as-

sume that the quantities 120591 for 119895 ne 119896 can be neglected Note that here there is

no which can be taken as small as needed to make the effect of 120591 as mi-

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 15: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 15

copy Roi Baer

nute as we need Still we can hope that the fact that we chose 119877 to be slow de-

grees of freedom allow us to make just this approximation This was the idea

of Born and Oppenheimer who neglected 120591

summinusℏ

2119872

120597

120597119877 Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877)

The resulting equation is a Schroumldinger equation for the slow degrees of free-

dom which move under the force of electrons derived from a potential 119882 (119877)

When applied to molecules the slow degrees of freedom are usually the nuclei

and the fast - the electrons There is a problem with neglecting 120591 because of

their non-dynamical effect Taking them into account results in treating them

as a magnetic field

sumℏ

2119872 (119894

120597

120597119877 minus 120591 )

Φ (119877)

+ 119882 (119877)Φ (119877) = 119864 Φ (119877) (Id9)

The BO approximation breaks the molecular SE into two consecutive tasks

First the electronic SE Eq lrm(Id5) must be solved for any relevant clamped po-

sition of nuclei 119877 Then the nuclear equation lrm(Id9)

Further reading on this subject M Baer Beyond Born-Oppenheimer elec-

tronic non-adiabatic coupling terms and conical intersections (Wiley Hobo-

ken NJ 2006)

e) Electron correlation

vi The electronic wave function of two non-interacting electrons

We have stressed that the electronic structure problem is difficult because of

the complexity of the many-body wave function when electrons interact In

order to appreciate this complexity of the electronic wave function let us first

study a simple system of two non-interacting electrons in a 1D atomic well

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 16: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 16

copy Roi Baer

We consider an atomic well given by the potential 119907 (119909) and we place in it

an electron The Hamiltonian is

ℎ = minusℏ

2119898 nabla + 119907 (119909)

(Ie1)

Now consider a 2-electron problem Assume we have two electrons Fermi-

ons which are non-interacting in the well The Hamiltonian is

= ℎ(1) + ℎ(2) (Ie2)

The notation ℎ(119894) means the Hamiltonian of Eq lrm(Ie1) with the coordinate of

the i-th electron

What are the eigenstates in this case First since each electron can have a

spin we must decide on the spin of the state For now let us assume the state

is spin-polarized ie that the total spine is 1 both electrons are in spin-up or-

bitals We try the following form as a wave function

Ψ(119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]120572(1)120572(2) (Ie3)

Notice that the spatial part is anti-symmetric while the spin part is symmetric

This renders the entire wave-function anti-symmetric in accordance with the

Pauli principle The notation 120572(1)120572(2) means both electrons have spin projec-

tion up (120572) We do not yet know if and under what conditions this wave func-

tion can actually describe eigenstates of the two electrons the

an lrm(Ie2) We assume that 120595 (119909) and 120595 (119909) are orthonormal This makes the

state normalized since

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 17: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 17

copy Roi Baer

intΨ(119909 119909 ) 119889119909 119889119909 =

1

2int(120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 ))

119889119909 119889119909

=1

2int [(120595 (119909 )120595 (119909 ))

+ (120595 (119909 )120595 (119909 ))

minus 2120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 )] 119889119909 119889119909

(Ie4)

The first and second terms are both equal to

int(120595 (119909 )120595 (119909 )) 119889119909 119889119909 =int120595 (119909 )

119889119909 int120595 (119909 ) 119889119909 =1 times 1

= 1 (Ie5)

The third term in lrm(Ie4) is

int120595 (119909 )120595 (119909 )120595 (119909 )120595 (119909 ) 119889119909 119889119909

=int120595 (119909 )120595 (119909 ) 119889119909 int120595 (119909 )120595 (119909 ) 119889119909 = 0 times 0 = 0 (Ie6)

Indeed the entire wave function is orthonormal (thanks to the factor 1radic2

in lrm(Ie3))

Now let us see under what condition the wave function in Eq (Ie3) is an ei-

genstate of the Hamiltonian in lrm(Ie2)

Ψ(119909 119909 ) = (ℎ(1) + ℎ(2))Ψ(119909 119909 )

=1

radic2120595 (119909 )ℎ(1)120595 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

minus1

radic2120595 (119909 )ℎ(1)120601 (119909 ) + 120595 (119909 )ℎ(2)120595 (119909 )

(Ie7)

This should be equated to

119864Ψ(119909 119909 ) =1

radic2119864120595 (119909 )120595 (119909 ) minus 119864120595 (119909 )120595 (119909 ) (Ie8)

If we choose the orbitals 120595 (119909) and 120595 (119909) to be eigenstates of h (which are

orthogonal so that is compatible with our previous assumption)

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 18: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 18

copy Roi Baer

ℎ120595 (119909) = 120598 120595 (119909) 119894 = 01 hellip (Ie9)

Thus

Ψ(119909 119909 ) =1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

minus1

radic2120598 120595 (119909 )120595 (119909 ) + 120598 120595 (119909 )120595 (119909 )

= (120598 + 120598 )Ψ(119909 119909 )

(Ie10)

And we see that indeed Ψ(x x ) is an eigenstate with energy E = ϵ + ϵ

vii Correlation in action a wave function of 2 interacting particles in an

Harmonic trap

We now build a simple electronic structure model that will allow us to study

in some detail the most basic concepts For this we suppose that the electrons

are in a harmonic atom that is the potential well

119907 (119909) =1

2120583120596 119909 (Ie11)

The two lowest eigenstates of a Harmonic oscillator are

120595 (119909) = 119873 119890

120595 (119909) = 119873 radic120583120596

ℏ119909119890

(Ie12)

The normalization constants are

119873 = (120583120596

120587ℏ)

119873 = (4120583120596

120587ℏ)

(Ie13)

And the eigenvalues are

120598 = (119899 +1

2)ℏ120596 119899 = 01 hellip (Ie14)

The groundstate energy of the 2-electron system in the triplet state describes

one electron in 120595 (119909) and another in 120595 (119909)

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 19: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 19

copy Roi Baer

119864 = 120598 + 120598 = 2ℏ120596 (Ie15)

As discussed above singlet and triplet two-electron ground state wave func-

tions composed of two orbitals must be space-symmetric or antisymmetric

respectively We consider below 3 wave functions The first Ψ is the ground

state singlet where both electrons are in 120595 The second and third are a singlet

and a triplet made from one electron in 120595 and the other in 120595

Ψ (119909 119909 ) = 120595 (119909 )120595 (119909 ) = 119873 119890

ℏ(

)

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) + 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 + 119909 )

Ψ (119909 119909 ) =1

radic2[120595 (119909 )120595 (119909 ) minus 120595 (119909 )120595 (119909 )]

= 119873 119890

ℏ(

)(119909 minus 119909 )

(Ie16)

The normalization factors are

119873 = 119873 = radic

120583120596

120587ℏ 119873 =

1

radic2radic120583120596

ℏ119873 119873 =

120583120596

120587ℏ

(Ie17)

Eq lrm(Ie16) describes the distribution of positions of both electrons in their

corresponding states We now want to ask how much are the electrons in this

state aware of each other Do they correlate their motion in some way One

way to measure correlation is to consider the quantity lang119909 119909 rang minus lang119909 ranglang119909 rang If

electrons are completely unaware of each other this quantity called the posi-

tion-autocorrelation function is zero because then the average of the product

of their position must decompose to the product of the average Any deviance

from zero indicates some degree of correlation For the triplet wave function

in Eq lrm(Ie16) we find

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 20: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 20

copy Roi Baer

lang119909 rang = ⟨Ψ |119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |120595 ⟩)

=1

2(⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩) = 0

(Ie18)

Of-course the same result would be obtained if we calculate ⟨119909 ⟩ because the

electrons are equivalent Furthermore

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ =1

2⟨120595 120595 minus 120595 120595 |119909 119909 |120595 120595 minus 120595 120595 ⟩

=1

2(⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ + ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩

minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ minus ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩)

= minus|⟨120595 |119909|120595 ⟩| = minus

1

2

120583120596

(Ie19)

We thus find that

lang119909 119909 rang minus lang119909 ranglang119909 rang = minus1

2

120583120596

(Ie20)

The position autocorrelation functional is non-zero even though the electrons

are non-interacting The correlation is due to the Pauli principle It is negative

because the Pauli principle pushes the electrons apart and since most of the

time they both want to be near the origin this gives preference to them being

placed on opposite sides (when one electron has positive x coordinate the

other has negative and vice versa) Letrsquos see what happens in the singlet wave

function Ψ Here too lang119909 rang = lang119909 rang = 0 Then

lang119909 119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ = ⟨120595 120595 |119909 119909 |120595 120595 ⟩

= ⟨120595 |119909 |120595 ⟩⟨120595 |119909 |120595 ⟩ = 0 (Ie21)

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 21: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 21

copy Roi Baer

The singlet ground state shows no correlation because the Pauli principle is

applied here to the spins not the spatial part of the wave function Note how-

ever this does not mean that all singlet wave functions of 2 electrons have no

correlation Indeed this is the situation in Ψ where the development is

very similar to Eq lrm(Ie19) except that the minus sign is now a plus sign so

lang119909 119909 rang minus lang119909 ranglang119909 rang = ⟨Ψ |119909 119909 |Ψ ⟩ minus 0

=1

2⟨120595 120595 + 120595 120595 |119909 119909 |120595 120595 + 120595 120595 ⟩ = |⟨120595 |119909|120595 ⟩|

=1

2

120583120596

(Ie22)

Here we find positive autocorrelation indicative of the fact that spin-opposite

non-interacting pairs of electrons want to stick together (when one has posi-

tive x the other wants to have as well and when one has negative the other

one wants negative as well) like non-interacting bosons

Since there is no interaction between the electrons the correlation in these

wave functions arises only from the Pauli principle ie because we impose

the fact that electrons are Fermions This is called Fermi correlation Our les-

son is this

1) Wavefunctions that are mere products of singe-particle orbitals have no correlation 2) If the products are symmetrized like in the case of the excited singlet the correlation

lang119909 119909 rang is positive indicating that the particles ldquolike to be togetherrdquo ie both on the right or both on the left of the origin

3) If the products are anti-symmetrized like in the case of the triplet the correlation lang119909 119909 rang is negative indicating that the particles ldquolike to be separatedrdquo

Up to now we assumed no e-e interaction So now letrsquos include it and add to

the Hamiltonian an interaction term

= ℎ(1) + ℎ(2) + 119881(119909 119909 ) (Ie23)

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 22: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 22

copy Roi Baer

Let us take one case of coupling which is simple enough to yield to analytical

analysis

119881(119909 119909 ) = 120583120574 119909 119909 (Ie24)

With 120574 lt 120596 In this case the Hamiltonian is

=119901

2120583+

119901

2120583+

1

2120583120596 119909

+1

2120583120596 119909

+ 120583120574 119909 119909 (Ie25)

The interaction seems strange at first sight because it does not seem to depend

on the distance between the particles as we are used to from electrostatics

yet it does describe a repulsion We can see this by writing the Hamiltonian

as

=119901

2120583+

119901

2120583+

1

2120583(120596 + 120574 )119909

+1

2120583(120596 + 120574 )119909

minus1

2120583120574 (119909 minus 119909 )

(Ie26)

We now see that the Hamiltonian describes two particles in a Harmonic po-

tential 119881 (119909) =

120583(120596 + 120574 )119909 interacting by a Harmonic potential

minus

120583120574 (119909 minus 119909 )

The potential drops as |119909 minus 119909 | grows ie it is repulsive

Exercise lrmI-1

Find the eigenvalues and eigenfunctions of this Hamiltonian (You can define

new coordinates 119883 and 119909 by

radic = 119909 and

radic = 119909 )

Exercise lrmI-2

Write the ground-state energy and wave function for the triplet state of the

system in the previous exercise Determine the effect of interaction on the en-

ergy by calculating 119903 =

and on the correlation function 119888 = lang119909 119909 rang minus

lang119909 ranglang119909 rang

Solution The result is shown in Figure lrmI-2 (left panel)

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 23: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 23

copy Roi Baer

Figure lrmI-2 (Left panel) The ratio = (Right panel) the correlation ratio ( ) (

)

vs interaction strengths where = (

)

f) The electron density is a much simpler object

than the wave function

The complexity of the electronic wave function is overwhelming It includes

all the information we can have on the electrons in a molecule in a certain

state However all these intricacies and details are often uninteresting for us

in many cases we simply have no direct use for them Take for instance the

electronic energy - our primary target in the Born-Oppenheimer picture It

depends only on the relative distance of pairs of particles This is because the

electron-electron Coulomb repulsion is a pair-wise interaction

One interesting quantity besides energy that can be extracted from the elec-

tronic wave function is the electron density 119899(119955) This 3D function tells us the

expectation value of the density of electrons That is 119899(119955)119889 119903 is the expectation

value of the number of electrons at a small volume 119889 119903 around point r Thus

we can write

119899(119955) = ⟨120595|(119955)|120595⟩ (If1)

We use the notation

00 01 02 03 04 05

04

05

06

07

08

09

10

r

00 01 02 03 04 050

5

10

15

cc

2

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 24: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 24

copy Roi Baer

⟨120595|120601⟩ equiv int120595 (119955 119955 hellip 119955 )120601(119955 119955 hellip 119955

)119889 119903 119889 119903 hellip119889 119903

(If2)

Here (119955) is the operator corresponding to the electron number density Since

electrons are point particles and the position operator for the ith electron is 119955

this operator is defined by

(119955) = sum120575( minus 119955)

(If3)

We used the definition of a 120575-function according to which

int120575(119955 minus 119955)119891(119955 )119889 119903 = 119891(119955)

(If4)

The function 120575( minus 119955) is the density of electron 119894 at 119955

Exercise lrmI-3

Calculate the expectation value of the electron number density and show that

119899(119955) = 119873 int120595(119955 119955 hellip 119955 )120595(119955 119955 hellip 119955

)119889 119903 hellip119889 119903

(If5)

Looking at Eq lrm(If5) we see that 119899(119955) involves integrating out a huge amount

of wave-function details It is as if only the data concerning the density distri-

bution of a single electron remains This is multiplied by 119873 in Eq lrm(If5) so

119899(119955) accounts for the combined density of all electrons Indeed integrating

over the entire space one obtains from lrm(If5)

int119899(119955)119889 119903 = 119873 (If6)

Expressing the fact that the total number of electrons is 119873

Exercise lrmI-4

Calculate and plot the 1D electron density of the triplet ground state from Ex-

ercise lrmI-2 as a function of the interaction parameter

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 25: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 25

copy Roi Baer

When we look for the ground-state energy of a complicated system with

Hamiltonian the variational principle is often extremely important It says

quite simply that the ground-state energy is a minimum of a functional of

wave functions The functional is

119864[120601] =⟨120601||120601⟩

⟨120601|120601⟩ (If7)

And the variational principle states that

119864 equiv 119864[120601 ] le 119864[120601] (If8)

For all 120601 What this principle allows us is to determine which of the two wave

functions 120601 and 120601 gives an energy closer to 119864 without knowing 119864 Sim-

ple the one with lower energy is closer

We can build a parameterized family of functions 120601 and determine the op-

timized parameter as the one that gives minimum to 119864(120574) equiv 119864[120601 ]

Exercise lrmI-5

Consider the quantum ground-state problem for a particle in a well 119907(119909) =

119896119909 We consider the family of functions

120601 (119909) =119890

radicradic2120587120590 (If9)

Here is a positive parameter What is the best function of this type for rep-

resenting the ground state

g) Dilation relations

Consider the normalized 120595(119909) From it we can define a family of normalized

wavefunctions

120595 (119909) = 120574 120595(120574119909) (Ig1)

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 26: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 26

copy Roi Baer

We can check that each member is indeed normalized

⟨120595 |120595 ⟩ = int|120595 (119909)| 119889119909 = 120574int|120595(120574119909)| 119889119909 = int|120595(119910)| 119889119910

(Ig2)

This operation of stretching the argument of a function is called dilation Di-

lation affects functionals Consider the kinetic energy functional

119879[120595] equiv ⟨120595||120595⟩ = int120595(119909) (minus1

2

119889

119889119909 )120595(119909)119889119909

(Ig3)

Exercise lrmI-6

Prove the dilation relation of 119879[120595 ] and 119879[120595]

119879[120595 ] = 120574 119879[120595] (Ig4)

The potential between all particles is called ldquohomogeneous of order 119899rdquo if

119881(120574119909 hellip 120574119909 ) = 120574 119881(119909 hellip 119909 ) (Ig5)

Examples are 119899 = 2 is for Harmonic potential wells and harmonic interactions

and 119899 = minus1 is for the Coulomb potential wells and Coulombic interactions

Exercise lrmI-7

Prove the dilation relation for homogeneous potentials of order 119899

119881[120595 ] = 120574 119881[120595] (Ig6)

We combine the results from Eqs lrm(Ig4) and lrm(Ig6) and obtain an interesting

property of the total energy for systems with homogeneous interactions

119864[120595 ] = 119879[120595 ] + 119881[120595 ] = 120574 119879[120595] + 120574 119881[120595] (Ig7)

For a molecule the interaction between the particles (electrons and nuclei) is

the Coulomb potential 119881(119955 119929) equiv 119881( 119955 hellip 119955 119929 hellip 119929

) which is homoge-

neous of order 119899 = minus1 one finds the energy of a molecule obeys

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 27: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 27

copy Roi Baer

119894ℏ(119905) = (119905)120601(119905)119864[120595 ] = 119879[120595 ] + 119881[120595 ]

= 120574 119879[120595] + 120574119881[120595] (Coulomb) (Ig8)

viii The concept of the virial in classical mechanics

First let us define the virial For a system with coordinate 119902 collectively de-

noted as 119954 the virial in classical mechanics is the time average of 119954 sdot 119917 where

119917 is the force vector

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ =1

120591int 119954(119905 ) sdot 119917(119905 )119889119905

It can be shown that for bound systems[5]

119907119894119903119894119886119897 = ⟨119954 sdot 119917⟩ = minus2⟨119879⟩

For conservative systems the force is a gradient of the potential 119917 = minusnabla119907(119954)

The viral relates to dilation of the coordinates through

119889

119889120574119907 (119954) = 119954 sdot nabla119907(120574119954) = 119954 sdot nabla119907 (119954) (Ig9)

For homogeneous potentials we have 120574

119907 (119954) = 119899120574 119907(119954) = 119899119907 (119954) thus

119899119907 (119954) = 120574119954 sdot nabla119907 (119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig10)

In particular for 120574 = 1

119899119907(119954) = 119954 sdot nabla119907(119954) (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899) (Ig11)

We will especially be interested in Coulomb systems where 119899 = minus1 then

119894ℏ(119905) = (119905)120601(119905) (Ig12)

119907(119954) + 119954 sdot nabla119907(119954) = 0 (119907 ℎ119900119898119900119892119890119899119890119900119906119904 119900119903119889119890119903 119899 = minus1) (191)

Exercise lrmI-8

Eq lrm(Ig11) is known as Eulerrsquos equation after its discoverer In Thermody-

namics it is extremely useful Thermodynamics stipulates that if you know

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 28: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 28

copy Roi Baer

the energy of a system 119880(119878 119881 ) as a function of its macroscopic parameters

119878 119881 then you have complete thermodynamics knowledge The first and se-

cond laws of thermodynamics state that

119894ℏ(119905) = (119905)120601(119905) (Ig13)

119889119880 = 119879119889119878 minus 119901119889119881 + 120583 sdot 119889 (192)

where 119879 119901 and 120583 are respectively the temperature the pressure and the chem-

ical potentials A second stipulation is that 119880 is a homogeneous function of

order 1

From this show that for all thermodynamical systems

1 119880 = 119879119878 minus 119875119881 + 120583 sdot

2 (

) =

For homogeneous potentials the viral theorem becomes

119894ℏ(119905) = (119905)120601(119905) (Ig14)

119907119894119903119894119886119897 = minus119899⟨119907⟩ = minus2⟨119879⟩ (193)

For Coulomb systems 119899 = minus1

119894ℏ(119905) = (119905)120601(119905) (Ig15)

⟨119907⟩ = minus2⟨119879⟩ (194)

From which

119894ℏ(119905) = (119905)120601(119905) (Ig16)

119864 = ⟨119907⟩ + ⟨119879⟩ = minus⟨119879⟩ =⟨119907⟩

2 (195)

We now show that this relation holds also in quantum mechanics

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 29: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 29

copy Roi Baer

ix The Virial Theorem in quantum mechanics

Now if 120595 is the ground-state energy then 119864[120595 ] obtains the minimum and

therefore

119894ℏ(119905) = (119905)120601(119905) (Ig17)

119889

119889120574119864[120595 ] = 0 (196)

Plugging this into Error Reference source not found one obtains

119894ℏ(119905) = (119905)120601(119905) (Ig18)

0 = 2120574119879[120595] minus 119899120574 119881[120595] (197)

Since we know that the optimal value of 120574 is 1 then (dropping the [120595] symbol

and remembering that all following quantities are ground state expectation

value

119894ℏ(119905) = (119905)120601(119905) (Ig19)

2119879 = 119899119881

119864 = 119879 + 119881 = (1 +2

119899)119879 = (

119899

2+ 1)119881

(198)

These relations show that the virial theorem (Eq lrm(Ig16) holds in quantum me-

chanics provided 120595 is the full molecular eigenstate For molecules 119899 = minus1

and one has

119894ℏ(119905) = (119905)120601(119905) (Ig20)

2119879 = minus119881 (Coulomb)

119864 = 119879 + 119881 = minus119879 =1

2119881

(199)

A subtle issue all energies and potentials in the above expressions are abso-

lute We usually give energy and potentials only to within an additive con-

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 30: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 30

copy Roi Baer

stant However the fact that the potential is homogenous it cannot tolerate

addition of a constant (119909 is homogeneous of order 2 but 119909 + 119886 is not)

x Slaterrsquos theory of the chemical bond using the viral theorem

What happens when 120595 is the electronic eigenstate in the Born Oppenheimer

approximation If we look only at the electronic wave function we do not ex-

pect Eq lrm(Ig19) to be valid Indeed using = + 119881(119955 119929) where = (minus

nabla )

and 119881(119955 119929) =

|119955 119929| (with obvious summation on nuclei and electrons)

[119929]120595[119929](119955) = 119864[119929]120595[119929](119955) (Ig21)

Upon dilation 120595 [119929](119955) = 120574 120595[119929](120574119955) and note that 119929 is not dilated Then

we have for the KE and potential

119879 [119929] equiv ⟨120595 ||120595 ⟩ = int120595 [119929](119955) (minus1

2nabla )120595 [119929](119955)119889 119903

= 120574 int120595[119929](120574119955) (minus1

2nabla )120595[119929](120574119955)119889 119903

= 120574 int120595[119929](119956) (minus1

2nabla )120595[119929](119956)119889 119904 = 120574 119879[119929]

(Ig22)

And we define

119882 [119929 119929 ] equiv ⟨120595 [119929]|[119929 ]|120595 [119929]⟩ = int120595 (119955 119929) 119881(119955 119929 )119889 119903

= 120574 int120595(120574119955 119929) 119881(119955 119929 )119889 119903

= int120595(119956 119929) 119881(120574 119956 119929 )119889 119904

= 120574int120595(119956 119929) 119881(119956 120574119929 )119889 119904 = 120574119882[119929 120574119929 ]

(Ig23)

We have defined 119882 a two 119929 quantity Of course the physical interaction po-

tential is 119881[119929] = 119882[119929119929] The reason we defined 119882 this way is that we can

now write

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 31: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 31

copy Roi Baer

119864 [119929] = 120574 119879 [119929] + 120574119882[119929 120574119929] (Ig24)

Note that we inserted the subscript 119890 to the KE since now it is only the KE of

the electrons

One still has the variational principle ie 119889119864 [119929]119889120574| = 0 (Eq (196))

0 = 2120574119879 [119929] + 119882[119929 120574119929] + 120574(119825 sdot nabla119929 119882)[119929119929 ]119929 119929 (Ig25)

Here we used the fact that

119891(120574119929) = [119929 sdot nabla119891](120574119929) Putting in the solution

120574 = 1 we find

0 = 2119879 [119929] + 119881[119929] + 119825 sdot nabla119929 119882[119929119929 ]119929 119929 (Ig26)

From the Hellmann-Feynman theorem (see section XXX) nabla119929 (119882[119929119929 ])119929 119929 =

nabla 119864(119929) Adding to the electronic energy 119864 the nuclear potential energy

119864 =

sum

|119929 119929 | gives the BO energy 119864 = 119864 + 119864 and since the nuclear

energy is homogeneous of order -1 we can use Eq (191) and

0 = 119864 + sum119825 sdot nabla

119864 (Ig27)

Plugging all this into eq Error Reference source not found we find Slaterrsquos

formula[6]

119879 = minus119864 minus 119825 sdot nabla 119864 (Ig28)

From this using 119864 (119929) = 119879 + 119881 (where 119881 = 119881 + 119864 is the total PE of the

molecule) we find also

119881 = 2119864 + 119825 sdot nabla 119864 (1910)

We point out an interesting property of Eq

Error Reference source not found is that we can obtain electronic properties

such as kinetic energy 119879 or potential energy 119881 directly from the potential sur-

face 119864 This will be important for analyzing properties of DFT quantities

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 32: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 32

copy Roi Baer

Note that at stationary points on the Born Oppenheimer PES where nabla 119864 =0

the usual virial relation eq (198) holds (with neglect of nuclear kinetic ener-

gies)

Slater derived Eq Error Reference source not found in a different manner

not from variational or dilation considerations He used it for analyzing the

general nature of chemical bonding of a diatomic molecule We follow in his

footspteps here Consider a diatomic molecule Suppose the BO curve is give

as a potential of the following form in terms of the inter-nuclear distance 119877

119864 (119877) =119860

119877 minus

119861

119877 + 119864 (Ig29)

Where 119860 and 119861 are positive constants and typically 119898 ≫ 119899For example in the

case the PE curve is given by the Lennard-Jones potential then 119898 = 12 and

119899 = 6 This potential describes long-range attraction and short-range repul-

sion It has a minimum at

119877 = (119861119899

119860119898)

(Ig30)

One then has

119879 =119861(1 minus 119899)

119877 minus

119860(1 minus 119898)

119877 minus 119864 (Ig31)

When 119877 rarr infin we have 119879 asymp minus ( )

minus 119864 Let us consider what happens to

the KE as the distance 119877 is reduced Clearly it initially descends however at

some distance 119877 it reaches a minimum and below this distance it begins to

increase Since

119889

119889119877119879 = minus

119861119899(1 minus 119899)

119877 +

119860119898(1 minus 119898)

119877 (Ig32)

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 33: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 33

copy Roi Baer

We find the minimum kinetic energy is obtained at a distance somewhat larg-

er than 119877

119877 = (119860119898(119898 minus 1)

119861119899(119899 minus 1))

= (119898 minus 1

119899 minus 1)

119877 gt 119877 (Ig33)

For example for 119898 = 12 119899 = 6 we find 119877 = 114119877 The sum of electronic

and nuclear potential energy is

119881 =119860(2 minus 119898)

119877 minus

119861(2 minus 119899)

119877 + 2119864 (Ig34)

At large 119877 119881 + 119864 actually increases as the atoms approach The potential en-

ergy does not rise indefinitely At some inter-nuclear distance larger than the

distance at which the BO potential obtains its minimum it reaches a maxi-

mum and then starts decreasing We have

119860119898(2 minus 119898)

119877 =

119861119899(2 minus 119899)

119877

119877 = (119860119898(119898 minus 2)

119861119899(119899 minus 2))

= (119898 minus 2

119899 minus 2)

119877 gt 119877

(Ig35)

For the Lennard Jones potential we have 119877 = 116119877 For 119877 lt 119877 we find

that the kinetic energy grows sharply as 119877 decreases while the potential en-

ergy decreases This sharp behavior can be interpreted as a condensation of

the electron cloud around each nuclei as such a condensation causes increase

in kinetic energy 119879 and decrease in electronic potential energy 119881 The total BO

energy continues to drop as long as 119877 gt 119877 and then starts rising

As an application of this theory consider the 119883hellip119884 system where 119883 is a neu-

tral atom and 119884 a distant atomic ion (can also be a molecular ion) The ener-

gy at large distance 119877 is that of Eq Error Reference source not found with

119860 = 0 and 119861 = ( )

where 120572(119883) is the polarizability of 119883 and 119899 = 4 This form

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 34: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 34

copy Roi Baer

of the potential surface results from a simple exercise in electrostatics Quan-

tum mechanics is needed only for calculating the value of the polarizability of

119883 120572(119883) Slaterrsquos analysis states that

119879 = minus3120572(119883)

2119877 minus 119864

119881 =120572(119883)

119877 + 2119864

(Ig36)

So the force due to the total potential energy is repulsive while that due to the

KE is attractive The KE attraction outweighs the PE repulsion by a factor of

three halves Slaterrsquos analysis shows that the interaction here is fundamentally

different from that of electrostatics where the force attributed entirely to the

Coulombic attraction

Another application is to the simplest chemical bond that of the 119867 molecule

The text-book analysis of this molecule assumes that the molecular orbital is a

ldquobondingrdquo linear combination of the two atomic orbitals (LCAO) both having

the 1s shape 120595 (119903) = 119890

but each centered on its corresponding nucleus

Quantitatively this picture is very inaccurate While it does show binding its

predicted a bond length is almost 14 Å (while the true bond length is 106Å)

and its estimate of atomization energy is 17 eV while the experimental value

is close to 27eV There are no electron correlation effects here so the only

source of inaccuracy is the limited efficacy of the small basis

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 35: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 35

copy Roi Baer

Figure lrmI-3 The optimal dilation parameter as a function of the inter-nuclear distance 119929

in where the wave function is the sum of 2 1s-type orbitals (localized on each of the

two nuclei)

One can add a dilation factor to the wave function 120595 (119903) = 119890

and perform

a variational determination of 120577 Then one can obtain for each value of the

inter-nuclear distance the optimal 120577 denoted 120577 (119877) At 119877 rarr infin the dilation pa-

rameter should be equal to 1 as then the 1s orbital is exact At 119877 rarr 0 the mole-

cule becomes the He ion and 120577 should approach the exact value of 2 See Fig-

ure lrmI-3 But the optimal value 120577 is not monotonic At large 119877 it becomes small-

er than 1 indicating that the electron cloud expands the kinetic energy drops

and the potential energy rises Thus long-range attraction is driven by the ki-

netic energy At 119877 = 52119886 the value of 120577 is again 1 and then grows monoton-

ically as 119877 is reduced Now the orbital is contracting kinetic energy is growing

and potential energy is decreasing In this range chemical binding is driven

by the potential energies At the distance minimizing the BO energy the value

of 120577 is 120577 asymp 123 ie the orbitals are then considerably contracted (relative to

the free H atom) With such contracted orbitals the calculated bond length of

the LCAO calculation is 106Å The bond energy computed by subtracting the

calculated H2+ energy from that of the hydrogen atom gives the bond energy

of 235eV The first to report this type of calculation is in ref [7] a clear exposi-

tion of the calculation is given in refs [8] These improved theoretical esti-

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 36: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 36

copy Roi Baer

mates with a two-function basis achieved with just a single dilation parame-

ter show that orbital shapes are crucial for a good description of chemical

bonding

This result contradicts our intuition regarding the behavior of electrons in a

covalent bond It seems that a major source of the chemical bond energy in H

is not due to the electrostatic benefit resulting from the placement of electrons

in the middle region separating the two nuclei Rather it is due to the contrac-

tion of the electron wave function near each of the nuclei induced by change

in Coulomb potential near that nucleus due to the presence of the other nu-

clei Around one nucleus of H the nuclear Coulombic potential becomes

deeper and distorted towards the second nucleus As a result a contraction of

the electron density around the nucleus is obtained and the charge placed be-

tween the atoms is not in the ldquomidwayrdquo region of the bond it is between them

alright but very close to each nucleus Each nuclei seems to share the electron

not by placing it midway between them but rather by having the electrons

much closer to each nucleus

Combining this with the above result we find a very general conclusion

Rule 1 of chemical attraction As atoms approach from afar (for 119877 gt 119877 ) the

kinetic energy decreases and the potential energy increases and thus it is the

lowering of kinetic energy which is responsible for the distant attraction between

molecules

Rule 2 of chemical bonding For distances near but larger than the bonding

distance the attraction is more subtle The electronic kinetic energy rises

sharply and the potential energy drops even more sharply Thus a major

source of energy is the contraction of orbitals around their respective nuclei

inspired by the deepening of potential wells due to the presence of close-by

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 37: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 37

copy Roi Baer

nuclei The electronic PE is thus the major glue when atoms are close to each

other It has to offset the nuclear repulsion and the repulsion due to kinetic

energy of electrons

xi Other Dilation Facts

Consider the Schroumldinger equation

minusℏ

2119898120595 (119909) + 119907(119909)120595(119909) = 119864120595(119909) (Ig37)

We now dilate we assume 120595 (119909) = 120595(120582119909) is an eigenstate and find the corre-

sponding Hamiltonian Note that 120595 (119909) = 120582 120595 (120582119909) thus

minusℏ

2119898120582 120595

(119909) + 119907(120582119909)120595 (119909) = 119864120595 (119909) (Ig38)

We see that the required Hamiltonian involves a dilated potential and a scalled

mass by a factor 120582 The energy is left intact Alternatively we can multiply

through by the square of the dilation factor 120582 and obtain

minusℏ

2119898120595

(119909) + 120582 119907(120582119909)120595 (119909) = 120582 119864120595 (119909) (Ig39)

Now the potential is dilated and scaled 119907(119909) rarr 120582 119907(120582119909) when then the wave

function is dilated by 120595(119909) rarr 120595(120582119909) and the total energy is also scaled by

119864 rarr 120582 119864 This is general for basically any Schroumldinger equations and holds for

any number of spatial dimensions

Now consider homogeneous potentials in 3D 119907(120582119955) = 120582 119907(119955) In this case the

potential transform is indeed a multiplicative scaling 119907(119955) rarr 120582 119907(119955) In par-

ticular in Coulomb systems when 119899 = minus1 we find that the potential scaling

is simply by a factor 120582 We can turn around the discussion and state

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 38: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 38

copy Roi Baer

If the Coulomb coupling constant is scaled by a factor 120582 then the total

energy is scaled by a factor 120582 and the wave function is dilated by a fac-

tor 120582 In particular the density is dilated and scaled as 119899(119955) rarr 120582 119899(120582119955)

According to the virial theorem the potential energy and the total energy are

related by

= minus119879 = 119864 thus the expectation value of the potential and kinetic

energies is scaled by a factor 120582 as well

Exercise lrmI-9

An application of the above discussion allows an interesting exact conclusion

concerning the ground-state energy per particle of the homogeneous electron

gas of density 119899 (see section XXX for details) Denote this quantity

120598 (119899 119890 ) and the Coulomb coupling constant 119890 4120587120598 Show that

3119899120597120598

120597119899+ 119890

120597120598

120597119890 = 2120598 (Ig40)

Hint Start from 120598 (120582 119899 λ119890 ) = λ 120598 (119899 119890 ) Then take the derivative with

respect to 120582 Then set 120582 = 1

h) Introduction to Functionals

xii Definition of a functional and some examples

A formula that accepts a function and maps it to a number is called a function-

al We denote the number a functional 119865 maps to a function 119899(119955) by this by

the symbol 119865[119899] Take for example the simplest functional 119865[119899] = 0 This

functional maps any function 119899(119955) to the number zero A more interesting

functional is

119865119955 [119899] = 119899(119955 ) (Ih1)

This functional maps to each function 119899(119955) its value at a certain point 119955

Next consider the functional

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 39: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 39

copy Roi Baer

119868 [119899] =1

119881int119899(119955)119889 119903

(Ih2)

mapping each function its average value in a given volume 119881

Another familiar example is the kinetic energy functional for a particle of

mass 120583 in some wave function 120595(119955)

119879[120595] = minusℏ

2120583int120595 (119955)nabla 120595(119955)119889 119903

(Ih3)

Or the energy for a given potential 119881(119955)

119864[120595] = 119879[120595] + 119881[120595] (Ih4)

Where

119881[120595] = int119881(119955)|120595(119955)| 119889 119903 (Ih5)

xiii Linear functionals

A functional is called linear if for any two functions 119899(119955) and 119898(119955) one has

119865[119899(119955) + 119898(119955)] = 119865[119899(119955)] + 119865[119898(119955)] and if 119865[120572119899(119955)] = 120572119865[119899(119955)] for any num-

ber 120572 The functionals above 119865119955 [119899] and 119868 [119899] are examples of such functionals

please check Any linear functional can be written in the form

119865[119899] = int119891 (119955)119899(119955)119889 119903

(Ih6)

The function 119891 (119955) is called the linear response kernel of 119865 For example the

linear response kernel of 119865119955 is 119891 (119955) = 120575(119955 minus 119955 ) And that of 119868 is 119891 (119955) =

xiv Functional Derivatives

Let us now introduce the concept of a functional derivative Consider the

functional 119865119955 [119899] of Eqlrm(Ih1) A functional derivative denoted

() measures

how 119865[119899] changes when we make a small localized change of 119899(119955) at

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 40: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 40

copy Roi Baer

Let us give a more detailed definition Suppose we change the argument func-

tion in 119865[119899] by making an arbitrarily small change in (119955) rarr 119899(119955) + 120598Φ(119955) We

use an arbitrary function Φ(119955) multiplied by an infinitesimal number 120598 (that

is a number 120598 we intend to make vanishingly small at a later statge) In prin-

ciple for any finite 120598 the change

() in 119865 has a very complicated dependence

on 120598 But since 120598 is assumed small we can Taylor expand with respect to 120598Φ

and then ldquothrow outrdquo all terms beyond the term linear in 120598Φ This makes

sense since we assume that 120598 is a small as we want Thus the linear part of 120575119865

is a functional of 120598Φ and it is a linear functional We denote the linear re-

sponse kernel of this functional by the symbol

() called ldquothe functional de-

rivativerdquo Therefore we can write

120575119865 equiv 119865[119899 + 120598Φ] minus 119865[119899] = int120575119865

120575119899()120598Φ()119889 + 119874(120598 )

(Ih7)

Note that this the linear response kernel

() is independent of Φ(119955) and 120598

One technique to obtain the kernel is taking Φ(119955) = 120575(119955) = 120575(119955 minus ) To

make this notion precise we define

120575119865

120575119899()equiv lim

rarr

119865[119899 + 120598120575] minus 119865[119899]

120598

(Ih8)

Where 120575 = 120575(119955 minus ) is a delta function expressing the fact that we make a

localized change in 119899(119955) at r

Examples

Apply definition lrm(Ih6) to 119865119955 [119899] and 119868 [119899] Find the functional derivatives

and check that Eq lrm(Ih5) holds

Two shortcuts when ldquofunctional integratingrdquo one can often use regular dif-

ferentiation rules All one needs to remember is the following shortcuts

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 41: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 41

copy Roi Baer

120575119899(119909)

120575119899(119909 )rarr 120575(119909 minus 119909 )

120575119899prime(119909)

120575119899(119909 )rarr 120575 ( )

(Ih9)

And use them within ldquochain rulesrdquo ( ( ))

( )rarr 119891 (119899(119909))120575(119909 minus 119909 ) (where

119891 (119899) equiv

119891(119899)) An example of the use of this rule is the previous exercise

120575119868 120575119899()

equiv 119881 int120575119899(119955)

120575119899()119889 119903 = 119881 int120575(119955 minus ) 119889 119903 = 119881

(Ih10)

Another example is the following functional related to the kinetic energy

119869[119891] = int 119891 (119909) 119889119909

(Ih11)

The functional derivative is

120575119869[119891]

120575119891()= int

120575(119891 (119909) )

120575119891()119889119909

= int 2119891 (119909)120575119891 (119909)

120575119891()119889119909

= int 2119891 (119909)120575prime(119909 minus )119889119909

= minus2119891 () (Ih12)

Here we used the identity int 119892(119909)120575 (119909 minus )119889119909 = minus119892 () and the shortcut

( )

( )= 120575prime(119909 minus )

xv Invertible function functionals and their functional derivatives

Consider a ldquocontinuous set of functionalsrdquo which is ldquoindexedrdquo by a continu-

ous variable 119955 in some domain 119863 More precisely 119907[119899](119955) assigns a functional

to each 119955 in 119863 Let us suppose further that 119899 is a function of 119955 defined on the

same domain 119863 An example is the relation between the electrostatic potential

and the charge density

119907 [119899](119955) = int119899(119955 )

|119955 minus 119955 |119889 119903

(Ih13)

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 42: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 42

copy Roi Baer

For any charge distribution 119899(119955 ) the integral forms a the electrostatic poten-

tial 119907 (119955) The potential at a give point 119955 depends in principle on the charge

density at all points in space This is a very non-local dependence

What about the functional derivative If one makes a small change in the

function 119899 at the point 119955 120575119899(119955 ) how will that affect the functional at 119955 We

write this as a definition of the functional derivative

120575119907[119899](119955) = int120575119907[119899](119955)

120575119899(119955 )120575119899(119955 )119889 119903

(Ih14)

In the example above it is immediately noticeable that

120575119907 [119899](119955)

120575119899(119955 )=

1

|119955 minus 119955 |

(Ih15)

An interesting concept can arise when one knows in advance that the relation

119907[119899] is unique ie

119907[119899](119955) = 119907[119899 ](119955) for all 119955 hArr 119899(119955) = 119899 (119955) for all 119955

In this case one can also consider 119899 to be a functional of 119907 Indeed

120575119899[119907](119955) = int120575119899[119907](119955)

120575119907(119955 )120575119907(119955 )119889 119903

Once can combine this equation withlrm(Ih15) and see that

120575119899(119955) = int120575119899[119907](119955)

120575119907(119955 )int

120575119907[119899](119955 )

120575119899(119955 )120575119899(119955 )119889 119903 119889 119903

(Ih16)

This shows that

int120575119899(119955)

120575119907(119955 )

120575119907(119955 )

120575119899(119955 )119889 119903 = 120575(119955 minus 119955 )

(Ih17)

Thus there is an inverse relation between the functional derivatives This is

similar to what happens in usual derivatives with inverse functions When a

function 119892(119909) is invertible we can speak of 119909(119892) The change in 119892 due to a

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 43: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 43

copy Roi Baer

change in 119909 is 119889119892(119909) =

119889119909 Similarly 119889119909 =

119889119892 Therefore (

) =

(( ( )

)

) ( )

xvi Convex functionals

A function 119891(119909) is said to be convex if for any pair of abscisas 119909 and 119909 the

curve (119909 119891(119909)) described by 119891(119909) in the interval 119909 le 119909 le 119909 is always below

the straight line connecting (119909 119891(119909 )) and (119909 119891(119909 )) A more formal defini-

tion is that for any 0 lt 120582 lt 1 we have

119891(120582119909 + (1 minus 120582)119909 ) le 120582119891(119909 ) + (1 minus 120582)119891(119909 ) (Ih18)

A useful interpretation of the above definition for convexity is in terms of av-

erages We can view 120582119909 + (1 minus 120582)119909 as a weighted average of the two points

with positive weights (such an average is usually called a convex average) In

this sense we have convexity obey

119891(⟨119909⟩) le ⟨119891(119909)⟩ (Ih19)

This relation is much more general than just 2 points We can easily show that

for convex functions it holds for any averaging procedure ⟨119909⟩ = sum 119888 119909 and

⟨119891(119909)⟩ = sum 119888 119891(119909 ) with 119888 ge 0 and sum 119888 = 1 Eq lrm(Ih19) is sometimes called

Jensenrsquos inequality

One of the important implications of a convex function is that it cannot have a

local minimum either there is no minimum (for example 119891(119909) = 119890 ) or there

is just one global minimum If a function is known to be convex and to have a

minimum then any ldquodescentrdquo algorithm is surely to find the (global) mini-

mum This is useful

We can Taylor expand 119891(120582119909 + (1 minus 120582)119909 ) = 119891(120582(119909 minus 119909 ) + 119909 ) with respect to

120582 assuming now 120582 ≪ 1 Then

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 44: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 44

copy Roi Baer

120582119891(119909 ) + (1 minus 120582)119891(119909 ) ge 119891(120582(119909 minus 119909 ) + 119909 )

= 119891(119909 ) + nabla119891(119909 ) sdot 120582(119909 minus 119909 ) + 119874(120582 ) (Ih20)

After rearrangement and division by 120582 we set 120582 to zero and obtain

119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih21)

This relation if obeyed for all pairs of points is equivalent to the convexity of

the function By exchanging 119909 and 119909 we obtain also

nabla119891(119909 ) sdot (119909 minus 119909 ) ge 119891(119909 ) minus 119891(119909 ) ge nabla119891(119909 ) sdot (119909 minus 119909 ) (Ih22)

From Error Reference source not found we further have for 120582 rarr 0

119891(119909 ) + 120582nabla119891(119909 ) sdot 119906 le 119891(119909 + 120582119906)

= 119891(119909 ) + 120582nabla119891(119909 ) sdot 120575119909 + 120582 1

2120575119909 sdot nablanabla119891(119909 ) sdot 120575119909

+ 119874(120582 |119906| )

(Ih23)

From this after cancelling and dividing by 120582 we find

119906 sdot nablanabla119891(119909 ) sdot 119906 ge 0 for

arbitrary 119906 which means the Hessian nablanabla119891(119909 ) is positive definite another

characteristic of convex functions

A similar definition of convexity can be applied to functionals

119865[120582119899 + (1 minus 120582)119899 ] le 120582119865[119899 ] + (1 minus 120582)119865[119899 ] (Ih24)

And they too have the equivalent differential property which will be useful

below

119865[119899 ] minus 119865[119899 ] ge int(δ119865

120575119899(119955))

(119899 (119955) minus 119899 (119955))119889 119903

(Ih25)

An example for a convex functional useful for density functional theory is the

so-called Hartree energy functional

119864 [119899] =1

2∬

119899(119955)119899(119955 )

|119955 minus 119955 |119889 119903119889 119903

(Ih26)

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 45: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 45

copy Roi Baer

The second functional derivative of this function is |119955 minus 119955 | which is a mani-

festly positive matrix (this is easily seen by Fourier transforming the function

119907(119955) =

with result (119954) =

which is positive) Another example is the von-

Weizsacker kinetic energy functional

119864 [119899] = minus1

2intradic119899(119955)nabla radic119899(119955)119889 119903

(Ih27)

The physical meaning of this functional is that it gives 119873 times the kinetic en-

ergy of a single-particle with ground state wave function 120595(119955) = radic (119955)

where

119873 = int119899(119955)119889 119903 Again by showing the Hessian is positive definite we can

straightforwardly show this functional is convex First rewrite it as

119864 [119899] =1

2int(nablaradic119899(119955))

119889 119903 =

1

8int

(nabla119899)

119899119889 119903 (Ih28)

We drop the explicit 119955 in the last expression for brevity Then make an arbi-

trary but small perturbation 120598Φ(119955)

119864 [119899 + 120598Φ] =1

8int

(nabla119899 + 120598nablaΦ)

119899 + 120598Φ119889 119903 (Ih29)

And expand the denominator to second order in 120598 (119899 + 120598Φ) rarr

(1 minus 120598

+

120598 (

) + ⋯) Expanding the numerator and performing the multiplication

order by order one can then read off the second order contribution and after

some manipulation bring it to the form

120598 120575 119864 [119899 + 120598Φ] =120598

8int

Φ

119899(nabla119899

119899minus

nablaΦ

Φ)

119889 119903 (Ih30)

Clearly this second order term is absolutely non-negative for all perturba-

tions and at all physical densities This shows that the underlying functional

derivative Hessian is positive definite and thus the functional is convex

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 46: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 46

copy Roi Baer

A similar condition exists for the second functional derivative The Jensen in-

equality holds here as well

119865[⟨119899⟩] le ⟨119865[119899]⟩ (Ih31)

This relation in combination with the fact that 119891(119909) = 119890 is convex is useful

for developing mean field approaches in Statistical mechanics It was used by

Gibbs and later in a quantum context by Peierls[9]

i) Minimizing functionals is an important part

of developing a DFT

xvii Minimization of functions

We start with the problem of minimization of a simple 1D function Given a

function 119891(119909) we want to find a point 119909 such that the function is minimal It

is clear that the slope of 119891(119909) at 119909 = 119909 must be zero If the slope is positive

then we can go left (decrease 119909 ) and reduce 119891(119909) Same logic (but to right) if

it was negative Thus a necessary condition for a minimum is

119891 (119909 ) = 0 (Ii1)

Let us expand in a Taylors series the function around the point 119909 Clearly we

have

119891(119909) = 119891(119909 ) + 119891 (119909 )(119909 minus 119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii2)

However taking Eq Error Reference source not found into account

119891(119909) = 119891(119909 ) +1

2119891 (119909 )(119909 minus 119909 ) + ⋯ (Ii3)

When 119909 is extremely close to 119909 we may neglect high order terms and then we

see that Eq Error Reference source not found is the equation of a parabola

In order that 119891(119909) be a minimum we must have an ascent of 119891(119909) when we

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 47: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 47

copy Roi Baer

move away from 119909 no matter which direction we take (left or right) For

small displacements 119909 rarr 119909 we see from Eq

Error Reference source not found that the change in 119891(119909) is

119891 (119909)(119909 minus

119909 ) Since (119909 minus 119909 ) is always positive no matter the direction we move we

must demand that 119891 (119909 ) gt 0 as well Thus our necessary conditions for a

minimum are

119891 (119909 ) = 0 119886119899119889 119891 (119909 ) gt 0 (Ii4)

Now let us consider the case of functions of two variables 119891(119955) where

119955 = (119909119910) equiv (119909 119910) Notice that we use the transpose symbol 119879 to turn a col-

umn vector into a row vector Let us Taylor expand 119891(119955) around a point 119955

119891(119955) = 119891(119955 ) + nabla119891(119955 ) (119955 minus 119955 ) +1

2(119955 minus 119955 ) nablanabla119891(119955 )(119955 minus 119955 ) + ⋯

= 119891(119955 ) + 119944 (119955 minus 119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯

(Ii5)

Note that we use the notations

119944 = 119944(119955 ) = nabla119891(119955 ) =

(

120575119891

120575119909120575119891

120575119910)

119867 = 119867(119955 ) =

(

120575 119891

120575119909

120575 119891

120575119910120575119909

120575 119891

120575119909120575119910

120575 119891

120575119910 )

119955

(Ii6)

The vector 119944(119955) is called the gradient and the matrix 119867(119955) the Hessian at 119955

Note that the Hessian is a symmetric matrix since the order of differentiation

is immaterial When 119955 is a minimum moving infinitesimally away from it in

any direction will not change the function Why We can show this leads to a

contradiction Let 119941 be an arbitrary direction If the function decreases when

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 48: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 48

copy Roi Baer

moving from 119955 in that direction no matter how small the step size then this

contradicts that 119955 is a minimum Now if 119941 is an ascent direction then minus119941

will be a descent direction - again a contradiction to 119955 being a minimum

Thus we conclude that 119891 cannot change (to first order) or in other words the

gradient 119944 of 119891(119955) at 119955 must be zero Then Eq

Error Reference source not found becomes

119891(119955) = 119891(119955 ) +1

2(119955 minus 119955 ) 119867 (119955 minus 119955 ) + ⋯ (Ii7)

Continuing in order for this function to have a minimum at 119955 the second

term on the right hand side must always be positive Thus Hessian matrix

must be such that for any vector ne 119907 119919119907 gt 0 A matrix having this property

is called positive definite (PD) When a PD matrix is symmetric then its ei-

genvalues must be strictly positive We can summarize the necessary condi-

tions for a minimum at 119955

119944 = nabla119891(119955 ) = 0 119886119899119889 119919 119894119904 119875119900119904 119863119890119891 (Ii8)

As an example let us take the function 119891(119909) = 119909 + 119910 This function is always

non-negative so its minimum is at 119909 = 119910 = 0 where it obtains the value 0

The gradient is indeed zero at (119909 119910) = (00)

(120597119891

part119910)( )

= (2119910)( ) = 0 (Ii9)

And the Hessian is 2119868 where 119868 is the 2times2 identity matrix thus it is trivially

PD

xviii Constrained minimization Lagrange multipliers

Next we discuss constrained minimization Suppose we want to minimize

119891(119909 119910) under a constraint that 119910 = 119892(119909) This is ldquoeasyrdquo since it can be trans-

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 49: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 49

copy Roi Baer

formed into an unconstrained minimization All we have to do is minimize

119865(119909) = 119891(119909 119892(119909)) This will give

119865 (119909) = (120597119891

120597119909)( ( ))

+ (120597119891

120597119910)( ( ))

119892 (119909)

119865 (119909) = (120597 119891

120597119909 )( ( ))

+ 2(120597 119891

120597119909120597119910)( ( ))

119892 (119909) + (120597119891

120597119910)( ( ))

119892primeprime(119909)

+ (120597 119891

120597119910 )( ( ))

119892 (119909)

(Ii10)

And we can then search for 119909 such that 119865 (119909 ) = 0 and 119865 (119909 ) gt 0

But sometimes it is not possible to write the constraint directly as 119910 = 119892(119909) A

more general form is ℎ(119909 119910) = 0 This is also more symmetric Consider the

situation of minimizing 119891(119909 119910) under the constraint ℎ(119909 119910) = 0 Suppose the

point depicted in the following figure is this minimum

This means that moving slightly along the isopleth of ℎ = 0 from 119955 will not

change 119891 if it decreases 119891 this is not a minimum and if it increases 119891 we will

move in the opposite direction and 119891 will decrease

h=1

h=0 h=-1

nabla119891

nabla119891

nablaℎ

nablaℎ

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 50: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 50

copy Roi Baer

Thus at 119955 the gradient of 119891 nabla119891(119955 ) must be normal to the line ℎ = 0 Fur-

thermore by definition the gradient of ℎ is normal to its isopleth Thus the

necessary condition for minimum is that both vectors point to the same direc-

tion and so are proportional We denote the proportionality constant by 120582

nabla119891(119955 ) = 120582 nablaℎ(119955 )

ℎ(119955 ) = 0 (Ii11)

Lagrange noticed that this relation and wrote a Lagrangian for the constrained

problem

119871(119955 120582) = 119891(119955) minus 120582ℎ(119955) (Ii12)

The new variable 120582 is called a Lagrange multiplier The necessary conditions

for a minimum in Eq Error Reference source not found translate into the

condition of finding a stationary point of 119871

nabla119871(119955 120582 ) = 0 (Ii13)

Where nablaequiv (120597 120597 120597 120597 ) The derivative with respect to 120582 gives us back the

constraint As an example consider minimizing 119891(119909 119910) = 119909 + 119910 under the

constraint ℎ(119909 119910) = 119909 + 119910 minus 1 (ie find the point on the unit circle for which

119909 + 119910 is minimal) We have

119871(119955 120582) = (119909 + 119910) minus 120582(119909 + 119910 minus 1) (Ii14)

The necessary condition is

nabla119871 = (1 minus 2120582 119909 1 minus 2120582 119910 minus(119909 + 119910 minus 1)) = (000)

rArr 119909 = 119910 =1

2120582 2 (

1

2120582 )

= 1 (Ii15)

The solution is thus

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 51: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 51

copy Roi Baer

120582 = plusmn1

radic2 119909 = 119910 = plusmn

1

radic2

(Ii16)

Since these are necessary conditions we need to consider them further (they

might correspond to a maximum) The minimum corresponds only to the

negative solution

The method of Lagrange multipliers is often more convenient to work with

than direct replacement The problem is thus that of finding a minimum of

119891(119955 hellip 119955 ) under the 119872 lt 119873 constraints 119945(119955 hellip 119955 ) = or ℎ (119955 hellip 119955 ) = 119888

119894 = 1hellip 119872 is

find 119955 such that 119945(119955 ) = for which ∶ 119891(119955 ) le 119891(119955)

for any 119955 such that 119945(119955) = (Ii17)

To facilitate such a search we formulate the Lagrangian function

119871(119955 120640 ) = 119891(119955) minus sum120582 (ℎ (119955) minus 119888 )

(Ii18)

Our plan is to find the position of 119955 which minimizes 119871 for any choice of 120640 and

then change until 119945(119955) = At 120640 we have 119871 assuming a minimum at a point

119955 A necessary condition for the constrained minimum to be achieved at the

point 119955 and with the Lagrange multipliers 120640 is

nabla 119871( 120524 ) = 0

nabla120524119871(119955 120640 ) = 119945(119955 ) minus = 0

(Ii19)

Note that 120640 is not necessarily a minimizer of 119871 In fact the opposite is tue 120640

is a maximizer of 119871 and the search for constrained minimizations is a search

for saddle points (see ref [10] for a method to solve such a problem on the

large scale)

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 52: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 52

copy Roi Baer

It is interesting now to ask how 119891(119955) changes if we change the value of the

constraint 119888 Indeed when the constraints are changed the optimized point

and Lagrange multiplier can change so the Lagrangian is changed

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 ) (Ii20)

Now because of Eq lrm(Ii19) we find

120575119871(119955 120640 ) = 119871(119955 + 120575119955 120640 + 120575120640 + 120575 ) minus 119871(119955 120640 )

= 119871(119955 120640 + 120575 ) minus 119871(119955 120640 ) (Ii21)

Thus

119889

119889119888 119871(119955 120582 ) =

120575

120575119888 119871(119955 120582 ) = 120640

(Ii22)

Since 119871(119955 120640 ) = 119891(119955 ) we find

119889

119889119888 119891(119955 ) = 120640

119900119903 nabla 119891(119955 ) = 120640

(Ii23)

This equation reveals the ldquomeaningrdquo of the Lagrange multipliers 120582 at the op-

timal point they are equal to the rate at which the optimal value of the mini-

mized function 119891 changes when 119888 the value of the 119894 constraint is changed

This is an important result which we use below whenever we want to give

physical significance to Lagrange multipliers

xix Minimization of functionals

The same considerations for functions apply for functionals Given a func-

tionalI f a necessary condition for its minimum is

120575119868

120575119891(119955)= 0

(Ii24)

For example consider a 1D classical particle of mass 119899 in a potential well 119907(119909)

The action 119878 is a functional of the trajectory of this particle

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 53: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 53

copy Roi Baer

119878[119909] = int [1

2119898(119905) + 119907(119909(119905))] 119889119905

(Ii25)

For any trajectory 119909(119905) between times 119905 and 119905 119871[119909] returns a number La-

grange showed that finding the trajectory that makes 119871[119909] stationary (alt-

hough not necessarily minimal) under the condition that 119909(119905 ) = 119909 and

119909(119905 ) = 119909 are given and thus not varied is equivalent to solving Newtons

equations under these same constraints The functional differentiation of the

kinetic energy is performed with

120575

120575119909(119905)int (119905 ) 119889119905

= 2int (119905 )120575(119905 )

120575119909(119905)119889119905

= 2int (119905 )(119905 minus 119905 )119889119905

= minus2int (119905 )120575(119905 minus 119905 )119889119905

= minus2(119905) (Ii26)

The condition for stationarity of the action under changes in the trajectory

which leave the edges intact give

0 =120575119878

120575119909(119905)= minus119898(119905) minus 119907 (119909(119905))

(Ii27)

From which we obtain Newtons equation 119898(119905) = minus119907 (119909(119905)) This equation

must be solved subject to the given constraints at the endpoints ie 119909(119905 ) = 119909

and 119909(119905 ) = 119909

Minimizing functionals with constraints can again be done using Lagrange

multipliers Then one defines

119871[119891 120582] = 119868[119891] + 120582ℎ[119891] (Ii28)

And the necessary conditions are

120575119871

120575119891(119955)|

= 0 120575119871

120575120582|

= ℎ[119891 ] = 0 (Ii29)

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 54: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 54

copy Roi Baer

For example let us solve the following problem Find the shape in a plane of a

closed contour encircling maximal area under the constraint of a given cir-

cumference 119897 We need to find the contour given by 119903(120579) with 119903(120579 + 2120587) =

119903(120579) which gives zero variation to

119871[119903 120582] = 119878[119903] + 120582(119871[119903] minus 119897 )

Where 119878 is the area and 119871 is the circumference We limit ourselves to curves

that can be uniquely defined A point on the curve is defined by the angle 120579

and the distance from the origin 119903(120579) The vector to a point on the curve and

its derivative is

119955(120579) = 119903(120579)(cos 120579 sin 120579)

119955 (120579) = 119903 (120579)(cos 120579 sin 120579) + 119903(120579)(minussin 120579 cos 120579)

The area of a small arc from 120579 to 120579 + 119889120579 is 119889119878 =

|119955(120579) times 119955(120579 + 119889120579)| =

|119955(120579) times 119955prime(120579)|119889120579 Notice that |119955 times 119955 | = 119903

and so 119889119878 =

119903(120579) 119889120579

The square circumference is 119889119871 = (119955(120579 + 119889120579) minus 119955(120579)) = (119955 (120579))

119889120579 Thus

119889119871 = |119955 (120579)|119889120579 = radic119903(120579) + 119903 (120579) 119889120579 Thus the area is a functional is

119878[119903] = int1

2119903(120579) 119889120579

119871[119903] = int radic119903(120579) + 119903 (120579) 119889120579

The functional derivative of S is easy

120575119878[119903]

120575119903(120579)= 119903(120579)

The functional derivative of 119871 is computed by adding a ldquosmallrdquo function 120598(120579)

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 55: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 55

copy Roi Baer

2 2 2

02

2 2

0

22 2

2 20

2 2

2 21

l r r d l

r r r r d l

r rr r d l

r r

Then we can use 211 1

2O so

2

2 20

r rl d

r r

Integrating by parts the second term (remembering that the end terms drop

because 120579 = 2120587 is the same point as 120579 = 0

120575119897 = int119903120598

radic119903 + 119903 119889120579

minus int (119903

radic119903 + 119903 )

120598119889120579

We thus get

120575119897

120575119903(120579)=

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

We define the Lagrangian

119871[119903] = 119878[119903] minus 120582(119897[119903] minus 119897 )

And get the maximum from

0 =120575119871

120575119903(120579)= 119903 minus 120582 (

119903

radic119903 + 119903 minus (

119903

radic119903 + 119903 )

)

Clearly a solution is 119903(120579) = 120582 a constant since then all derivatives are zero

The curve is then circle of radius 120582 and given the circumference we have

120582 = 119897 2120587

EXERCISES FOR CHAPTER 1

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital

Page 56: (I) Introductory background topics · Electron Density Functional Theory Page 1 © Roi Baer (I) Introductory background topics a) The Schrödinger equation XXX b) Bosons and Fermions

Electron Density Functional Theory Page 56

copy Roi Baer

1) Consider a system of two particles in a harmonic well Each particle repels the other

The potential energy is 119881(119909 119909 ) =

119896119909

+

119896119909

minus

119902(119909 minus 119909 )

We assume that

the system is bound (so 0 lt 2119902 lt 119896) (a) Write down the exact Hamiltonian for this system assuming the mass are 119898 and 119898 (b) Make a transformation separating

the Hamiltonian into two uncoupled particles the center of mass 119883 =

and the reduced mass 119909 = 119909 minus 119909 Write the Hamiltonian of each part and deter-mine the mass of each particle What are the masses when 119898 ≫ 119898 (c) Write down the energy levels of the system and the eigenfunctions (d) Now make a Born-Oppenheimer approximation Assuming 119898 ≫ 119898 determine the Born Oppenhei-mer potential surface (obtained when 119909 is considered a parameter) Solution The Hamiltonian is

119867 = minusℏ

2119898 120597 minus

2119898 120597 +

1

2119896119909

+1

2119896119909

minus1

2119902(119909 minus 119909 )

In the new coordinates

119883 =(119898 119909 + 119898 119909 )

119872 119909 = 119909 minus 119909 119909 = 119883 minus

119898

119872119909 119909 = 119883 +

119898

119872119909

If 119891(119909 119909 ) is any function than we can define the ldquosamerdquo scalar function as

119865(119909 119883) = 119891 (119883 minus

119909 119883 +

119909)

2) Calculate the correlation in the case of a singlet where both electrons are in the ground state orbital and when one is in the ground state orbital and the other in the first excited orbital