UNIVERSITY OF MINNESOTA This is to certify that I …dtraian/tony-thesis.pdfUNIVERSITY OF MINNESOTA...
Transcript of UNIVERSITY OF MINNESOTA This is to certify that I …dtraian/tony-thesis.pdfUNIVERSITY OF MINNESOTA...
UNIVERSITY OF MINNESOTA
This is to certify that I have examined this bound copy of a masters thesis by
Anthony Carlson
and have found that it is complete and satisfactory in all respects,
and that any and all revisions required by the final
examining committee have been made.
Traian Dumitrica
Name of Faculty Adviser
Signature of Faculty Adviser
Date
GRADUATE SCHOOL
An Extended Tight-Binding Approach for Modeling
Supramolecular Interactions of Carbon Nanotubes
A THESIS
SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL
OF THE UNIVERSITY OF MINNESOTA
BY
Anthony Carlson
IN PARTIAL FULFILLMENT OF THE REQUIREMENTS
FOR THE DEGREE OF
Master of Science
Traian Dumitrica, Adviser
October 2006
c© Anthony Carlson October 2006
Contents
Chapter 1 Introduction 1
1.1 Fullerenes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Production . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3 Research interest in carbon nanotubes . . . . . . . . . . . . . . . . 6
1.4 Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.5 CNT-based applications . . . . . . . . . . . . . . . . . . . . . . . . 9
1.6 Role of van der Waal forces . . . . . . . . . . . . . . . . . . . . . . 10
1.7 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Chapter 2 Nature of bonding and mathematical models 15
2.1 Bonding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.2 Atomistic Forces . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.3 Electromagnetic Cohesion . . . . . . . . . . . . . . . . . . . . . . . 18
2.3.1 Repulsive force . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.3.2 Attraction in physical bonding . . . . . . . . . . . . . . . . . 19
i
2.3.3 Van der Waals forces and non-ideal gasses . . . . . . . . . . 19
2.3.4 Classification of van der Waals forces . . . . . . . . . . . . . 20
2.4 Mathematical models . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.4.1 Empirical . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.4.2 Semi-empirical . . . . . . . . . . . . . . . . . . . . . . . . . 24
2.4.3 First principles . . . . . . . . . . . . . . . . . . . . . . . . . 25
Chapter 3 Graphite structure and interlayer properties 26
3.1 Graphene . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.2 Graphite stacking patterns . . . . . . . . . . . . . . . . . . . . . . . 28
3.2.1 Corrugation of graphene planes . . . . . . . . . . . . . . . . 30
3.2.2 Graphite unit cell and Brillouin zone . . . . . . . . . . . . . 31
3.3 Graphene as extended molecules . . . . . . . . . . . . . . . . . . . . 31
3.3.1 Experimental exfoliation energy . . . . . . . . . . . . . . . . 33
3.3.1.1 Girifalco method . . . . . . . . . . . . . . . . . . . 33
3.3.1.2 Benedict method . . . . . . . . . . . . . . . . . . . 33
3.3.1.3 Zacharia method . . . . . . . . . . . . . . . . . . . 33
3.3.1.4 Exfoliation energy summary . . . . . . . . . . . . . 34
3.4 Z-axis compressibility . . . . . . . . . . . . . . . . . . . . . . . . . . 34
3.5 Vibrational properties . . . . . . . . . . . . . . . . . . . . . . . . . 35
3.6 Ab-initio studies of graphite . . . . . . . . . . . . . . . . . . . . . . 36
3.6.1 Hartree-Fock treatment of graphite . . . . . . . . . . . . . . 36
ii
3.6.2 Density Functional Theory treatment of graphite . . . . . . 37
3.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Chapter 4 Existing models for dispersion interactions in graphite 41
4.1 Empirical . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
4.2 Semi-empirical and first principles models . . . . . . . . . . . . . . 44
4.3 Motivation for what is to be done . . . . . . . . . . . . . . . . . . . 46
Chapter 5 Modeling electronic and repulsive interaction with a
tight binding formalism 48
5.1 Linear combination of atomic orbitals . . . . . . . . . . . . . . . . . 49
5.2 Schrodinger equation in LCAO approximation . . . . . . . . . . . . 49
5.3 The basis and atomic centered orbitals . . . . . . . . . . . . . . . . 50
5.4 Periodic solids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
5.5 The Hamiltonian and the two-center approximation . . . . . . . . . 53
5.6 Slater-Koster parametrization . . . . . . . . . . . . . . . . . . . . . 55
5.6.1 Orbital decomposition . . . . . . . . . . . . . . . . . . . . . 55
5.6.2 s-p decomposition . . . . . . . . . . . . . . . . . . . . . . . . 56
5.6.3 Construction of tight binding matrices . . . . . . . . . . . . 59
5.7 Tight Binding Total Energy . . . . . . . . . . . . . . . . . . . . . . 60
5.8 Fitting TB parameters . . . . . . . . . . . . . . . . . . . . . . . . . 62
5.8.1 Density functional based tight binding . . . . . . . . . . . . 63
5.9 Use of Porezag parametrization and current code . . . . . . . . . . 64
iii
Chapter 6 Quantum mechanical origin of dispersion forces 65
6.1 Main assumptions in modeling dispersion interactions . . . . . . . . 66
6.2 Microscopic dispersion theory (London) . . . . . . . . . . . . . . . . 67
6.2.1 System description . . . . . . . . . . . . . . . . . . . . . . . 67
6.2.2 The perturbed system . . . . . . . . . . . . . . . . . . . . . 68
6.2.3 Energy corrections in perturbation theory . . . . . . . . . . 69
6.2.4 Application to Hydrogen . . . . . . . . . . . . . . . . . . . . 70
6.2.5 Higher order corrections . . . . . . . . . . . . . . . . . . . . 73
6.2.6 Applying London’s dispersion theory in Calculating C6 for
graphite . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
6.2.6.1 London’s form and polarizability . . . . . . . . . . 73
6.2.6.2 Slater-Kirkwood approximation . . . . . . . . . . . 75
6.2.6.3 Kirkwood approximation . . . . . . . . . . . . . . . 76
6.3 Macroscopic dispersion theory (Lifshitz + Hamaker) . . . . . . . . . 76
6.3.1 Lifshitz dispersion theory . . . . . . . . . . . . . . . . . . . . 77
6.3.1.1 Dielectric response and ǫ(iξ) . . . . . . . . . . . . . 77
6.3.1.2 Lifshitz dispersion theory . . . . . . . . . . . . . . 78
6.3.2 Lifshitz theory applied to the evaluation of Hamaker con-
stants for graphite . . . . . . . . . . . . . . . . . . . . . . . 79
6.3.3 Hamaker constant derivation . . . . . . . . . . . . . . . . . . 80
6.3.3.1 Point-Surface Interaction . . . . . . . . . . . . . . 81
iv
6.3.3.2 Surface-Surface Interaction . . . . . . . . . . . . . 82
6.4 Summary of calculated C6 coefficients . . . . . . . . . . . . . . . . . 84
Chapter 7 Damping functions 85
7.1 Hydrogen damping function . . . . . . . . . . . . . . . . . . . . . . 87
7.2 Damping functions in literature . . . . . . . . . . . . . . . . . . . . 87
Chapter 8 Tight-binding plus dispersion parametrization 90
8.1 Total energy definition . . . . . . . . . . . . . . . . . . . . . . . . . 91
8.2 Evaluation of tight-binding energy . . . . . . . . . . . . . . . . . . . 93
8.2.1 Orbital expansion . . . . . . . . . . . . . . . . . . . . . . . . 94
8.3 Parameter fitting procedure . . . . . . . . . . . . . . . . . . . . . . 95
8.3.1 Equilibrium interlayer spacing . . . . . . . . . . . . . . . . . 97
8.3.2 Phonon frequency calculation . . . . . . . . . . . . . . . . . 97
8.3.3 Compressibility calculation . . . . . . . . . . . . . . . . . . . 99
8.4 Optimization results . . . . . . . . . . . . . . . . . . . . . . . . . . 100
8.5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
Chapter 9 TBD description of carbon nanotube interactions 107
9.1 Tube-graphene interactions . . . . . . . . . . . . . . . . . . . . . . . 108
9.2 Tube-tube interactions . . . . . . . . . . . . . . . . . . . . . . . . . 109
9.3 Universal binding curve . . . . . . . . . . . . . . . . . . . . . . . . . 114
9.4 Nanotube loops . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
v
9.5 Nanotube bundles . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
9.6 Multiwalled tube . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
9.7 C60 and carbon nanotube interactions . . . . . . . . . . . . . . . . . 120
9.8 Tube-tube MD simulation . . . . . . . . . . . . . . . . . . . . . . . 122
Chapter 10 Conclusion and future work 125
Appendix A Tight-binding + dispersion molecular dynamics overview127
A.1 Calculation of Forces . . . . . . . . . . . . . . . . . . . . . . . . . . 129
A.1.1 Ionic and van der Waal’s forces . . . . . . . . . . . . . . . . 130
A.1.2 Tight-binding band structure forces . . . . . . . . . . . . . . 131
A.2 Periodic boundary conditions . . . . . . . . . . . . . . . . . . . . . 132
Appendix B Quantum Mechanics Overview 135
B.1 Observables and expectation values . . . . . . . . . . . . . . . . . . 136
B.2 Dirac Notation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
B.3 Time-dependant Schrodinger equation . . . . . . . . . . . . . . . . 137
B.4 Time-independent Schrodinger equation . . . . . . . . . . . . . . . 138
B.5 General Hamiltonian . . . . . . . . . . . . . . . . . . . . . . . . . . 139
B.6 Complexity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
B.7 Born-Oppenheimer approximation . . . . . . . . . . . . . . . . . . . 140
B.8 One-Electron Approximation . . . . . . . . . . . . . . . . . . . . . . 141
B.9 Variational Method . . . . . . . . . . . . . . . . . . . . . . . . . . . 142
vi
B.10 Extended example . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
Appendix C Non-orthogonal Hellmann-Feynman forces 146
C.1 Model Review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146
C.2 Force derivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
Appendix D Multipole Expansion 152
Appendix E Perturbation Theory 156
E.1 First order correction to energy . . . . . . . . . . . . . . . . . . . . 158
E.2 First order correction to the wavefunction . . . . . . . . . . . . . . 159
E.3 Second order correction to the energy . . . . . . . . . . . . . . . . . 160
Appendix F Optimization 163
vii
List of Figures
1.1 Graphite to CNT . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 Description of chirality . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Armchair, zigzag, and chiral tubes . . . . . . . . . . . . . . . . . . . 4
1.4 Exponential growth in nanotube papers . . . . . . . . . . . . . . . . 6
1.5 Nanotube twine . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.6 Hexagonal lattice of graphene and AFM scan . . . . . . . . . . . . . 11
1.7 CNT bundle cross section . . . . . . . . . . . . . . . . . . . . . . . 12
2.1 Rare gas dimer energy curve . . . . . . . . . . . . . . . . . . . . . . 16
2.2 Schematic representations of van der Waals energy contributions . . 21
3.1 Hexagonal lattice of graphene and AFM scan . . . . . . . . . . . . . 28
3.2 Graphite stacking patterns . . . . . . . . . . . . . . . . . . . . . . . 29
3.3 AFM scan of graphite showing Pz orbital corrugation . . . . . . . . 30
3.4 Hexagonal lattice of graphene and AFM scan . . . . . . . . . . . . . 32
viii
3.5 Raman active interlayer modes . . . . . . . . . . . . . . . . . . . . 35
4.1 Interlayer energy plots from existing empirical models . . . . . . . . 42
5.1 Schematic of relevant s and p orbital orientations . . . . . . . . . . 56
5.2 S and P orbital decomposition . . . . . . . . . . . . . . . . . . . . . 57
6.1 Hydrogen dimer schematic . . . . . . . . . . . . . . . . . . . . . . . 70
6.2 Semi-infinite slabs schematic . . . . . . . . . . . . . . . . . . . . . . 78
6.3 Point-surface Hamaker integration . . . . . . . . . . . . . . . . . . . 81
6.4 Surface-Surface Hamaker integration . . . . . . . . . . . . . . . . . 83
7.1 Tang’s damping function . . . . . . . . . . . . . . . . . . . . . . . . 89
8.1 Cutoff function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
8.2 Tight-binding interlayer energy . . . . . . . . . . . . . . . . . . . . 94
8.3 Modified tight-binding interlayer energy . . . . . . . . . . . . . . . 96
8.4 Fitted tight-binding plus dispersion - I . . . . . . . . . . . . . . . . 101
8.5 Fitted tight-binding plus dispersion - II . . . . . . . . . . . . . . . . 102
8.6 Graphite energy landscape . . . . . . . . . . . . . . . . . . . . . . . 102
8.7 Hydrostatic pressure effects on the E2g(1) mode . . . . . . . . . . . 105
8.8 Molecular dynamics results of graphene . . . . . . . . . . . . . . . . 106
9.1 Lock-in orientations on graphite . . . . . . . . . . . . . . . . . . . . 109
9.2 Interaction energy scans of tubes on graphite . . . . . . . . . . . . . 110
ix
9.3 Tube-tube orientation . . . . . . . . . . . . . . . . . . . . . . . . . 111
9.4 Tube-tube θ, dz landscape . . . . . . . . . . . . . . . . . . . . . . . 111
9.5 Tube-tube interaction energy scans . . . . . . . . . . . . . . . . . . 113
9.6 Tube-tube cohesive energy . . . . . . . . . . . . . . . . . . . . . . . 114
9.7 Carbon nanotube/tube/graphene universal curve . . . . . . . . . . 115
9.8 Hexagonal lattice of graphene and AFM scan . . . . . . . . . . . . . 118
9.9 Energy scan on CNT bundle . . . . . . . . . . . . . . . . . . . . . . 118
9.10 Nested (5,5)‖(10,10) CNT pair . . . . . . . . . . . . . . . . . . . . . 119
9.11 Nested (5,5)‖(10,10) energy landscape . . . . . . . . . . . . . . . . 121
9.12 Nested (5,5)‖(10,10) energy translation and rotation . . . . . . . . . 121
9.13 C60 external to a (10,10) tube energy scan . . . . . . . . . . . . . . 122
9.14 C60 external to a (10,10) tube energy scan . . . . . . . . . . . . . . 123
9.15 Peapod micrograph . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
9.16 MD results of two (5,5) tubes . . . . . . . . . . . . . . . . . . . . . 124
A.1 Molecular dynamics flowchart . . . . . . . . . . . . . . . . . . . . . 129
A.2 Schematic of periodic boundary conditions . . . . . . . . . . . . . . 133
D.1 Two separated charge clouds A and B . . . . . . . . . . . . . . . . . 153
x
List of Tables
3.1 Summary of experimental graphite exfoliation energies . . . . . . . 34
3.2 Summary of DFT calculations of graphite . . . . . . . . . . . . . . 38
4.1 Results of LJ empirical models with experimental data comparison . 43
6.1 Summary of C6 dispersion coefficients for graphite . . . . . . . . . . 84
8.1 Energy convergence: K-point selection . . . . . . . . . . . . . . . . 93
8.2 Optimized parameters for the tight-binding plus dispersion model . 100
9.1 Tube-graphene and tube-tube interactions summary . . . . . . . . . 112
9.2 Comparison of CNT bundle results . . . . . . . . . . . . . . . . . . 117
B.1 Operators in quantum mechanics . . . . . . . . . . . . . . . . . . . 136
D.1 Tabulation of selected terms in the multipole expansion . . . . . . . 155
xi
1Introduction
1.1 Fullerenes
The element carbon is the first element in group IV of the Periodic Table. It has
the ability to chemically bond with itself and other elements readily via orbital
hybridization. The whole of organic chemistry is dedicated to the study of the
millions of carbon-based molecules. There is a diverse variety of carbon solids
including a few crystalline allotropes and many amorphus and semi-crystalline
solids (such as lonsdaleite, and chaoite), but the most recognized forms are the
crystalline forms of diamond and graphite.
In the diamond crystal, each carbon atom is tetrahedrally bonded to its four
1
nearest neighbors via sp3 hybridization. The most stable allotrope of carbon is
graphite, which is nearly iso-energetic with diamond (∆E ∼ 0.014 eV/atom [172]).
Graphite has an interesting layered structure composed of planar sheets, called
graphene. In graphene, carbon atoms are trigonally bonded to their three nearest
neighbors by strong sp2 bonds, forming a hexagonal network (see left side of Figure
1.1). To build graphite, graphene sheets stack on top of one another. There is no
chemical bonding between layers; rather these layers are weakly bound by van der
Waals (vdW) forces [133]. This distinction between chemical and physical (vdW)
bonding will be addressed later in § 2.3.
While these two allotropes of carbon have been known for quite some time, new
forms, termed fullerenes, have been identified. Fullerenes are hollow, all carbon
structures in the form of balls, tubes and other closed structures with walls that
are one-atomic layer thick. A notably stable soccer ball-like fullerene is C60, com-
monly referred to as a “bucky ball”. It is named after Richard Buckminster Fuller,
an American author and architect whose geodesic dome had a very similar con-
struction. Tubular fullerenes are referred to as carbon nanotubes (CNTs). These
elongated molecules with nanometer-size diameters have hemispherical capped
ends, and length-to-diameter ratios typically of about 1000, which makes them
truly one-dimensional structures. The wall of these tubes share the same hexago-
nal structure as that of individual graphene sheets, except that they are figuratively
rolled up as is shown in Figure 1.1. “Figuratively” is emphasized because it de-
scribes their structure, not their manufacture. This figurative connection is useful
also because many of the physical properties of nanotubes can be derived from
graphite’s properties. In addition, adjacent tubes interact via vdW forces rather
than chemical bond just as the adjacent sheets of graphene in the graphite crystal
Ignoring now the issue of the capped ends, tubes are classified based on their
curvature and chirality. In an unwrapped representation of the nanotube, the
chirality describes how the hexagonal structure of the graphene lattice is orientated
2
Figure 1.1: Graphite to CNT from [21]
with respect to the axis of the tube. Figure 1.2 again shows a graphene lattice 1.
The point in the top left is the origin. Picking this point up and curling the sheet
onto one of the labeled atoms that fall between the two vectors creates a unique
CNT. Due to the symmetry of the graphene lattice only those atoms between the
two vectors need to be considered for a unique tube. The vector pointing from
the origin to the atom picked to describe the CNT is known as the chiral vector
( ~Ch). This vector is conveniently defined in terms of the graphene lattice vectors
labeled ~a and ~b, shown in Figure 1.2, as
Figure 1.2: The chirality is described by the nanotube wrapping angle of thegraphene sheet
1The lattice vectors and unit cell shown Figure 1.2 are differen’t but equivalent to thosedescribed later in § 3.1
3
~Ch = n~a+m~b ≡ (n,m), (1.1)
where n and m are integers. The angle this vector forms with respect to vector
~a is the chiral angle (χ). Tubes with χ = 0◦ are known as zigzag tubes and
are described by the chiral vector (n, 0). The moniker of zigzag comes from the
sawtooth like structure that is found around the circumference of the tube as can
be seen in Figure 1.3. The other extreme is χ = 30◦, or armchair tubes. These
tubes are defined by the case when n = m, otherwise noted by (n, n). All other
tubes that fall between these limits are referred to as chiral tubes (n,m).
Figure 1.3: Armchair, zigzag, and chiral tubes
Nanotubes can exist as single layered structures known as single-walled carbon
nanotubes (SWCNTs) or be composed of several coaxial SWCNTs nested inside
one another and referred to as multi-walled carbon nanotubes (MWCNTs). The
smallest observed SWCNT diameter is ∼4 A (1 A = 1 × 10−10 m), but most
often form in diameters around 10 A [134, 160]. For a spatial reference, human
hair has a median diameter of ∼ 1 × 106A. MWCNTs diameters can get quite
4
large. Starting with the smallest observed SWCNT (4 A) at the core successive
SWCNT encapsulate one another with an average wall-to-wall spacing of 3.4-3.9
A, increasing with decreasing tube diameter [84]. MWCNTs with diameters larger
than 300 A in diameter, comprising up to 100 SWCNTs, are not uncommon [160].
As mentioned, the discovery of fullerenes is quite recent. Smalley et. al, in 1985
were the first to experimentally verify the existence of the C60 fullerene [99]. The
discovery of a MWCNT, via high-resolution transmission electron microscopy (HR-
TEM), goes to to Iijima in 1991 [74]. (This distinction is being debated as dis-
cussed in a recent editorial by Monthioux et al. [121] who points out that in 1952
a Russian journal published a TEM image of a MWCNT [141], and later in 1976
Endo et al. also published a TEM image of a MWCNT [129]. Neither of these ar-
ticles made a clear distinction of the nature of their photographic results as Iijima
did.) The first evidence of a SWCNT was published two years later in two papers
[12, 76].
1.2 Production
While trace amounts of fullerenes are found in nature, i.e. in the soot of a candle
flame, hight quality CNTs are produced artificially. Production methods include
electric arc discharge between two graphite electrodes [37, 75], laser vaporization
of graphite [57], chemical vapor deposition (CVD) of hydrocarbon over metallic
catalysts seeds [28, 155], and other novel methods. All these methods have differ-
ent yields, some preferentially produce SWCNTs (regarded as higher quality, as
opposed to MWCNTs). Differing lengths and diameters are present and the prod-
uct usually needs a post-production purification step. Early production of CNTs
resulted in tubes that where quite short in length, < 10 µm. Advanced methods
have been developed to produce longer CNTs, with reports of several centimeters
for some SWCNTs [179].
5
1.3 Research interest in carbon nanotubes
After Iijima’s 1991 paper in Nature there was an explosion of interest in carbon
nanotube research that continues today. Figure 1.4 shows the nearly exponential
growth of published papers on nanotubes, with the year 2005 averaging 7 peer-
reviewed articles published a day.
19931994199519961997199819992000200120022003200420050
500
1000
1500
2000
2500
Year of publication
Num
ber
of p
aper
s
Figure 1.4: Graph showing the rapid increase in the number of published pa-pers on nanotubes. Data points are the number of papers returned from asearch of “nanotube”, from the extensive ISI Web of Knowledge database athttp://portal.isiknowledge.com
Why is there such an intense interest in the field of carbon nanotubes? Compared
to other nano-scale objects CNTS are well defined elongated structures that are
basically one-dimensional systems. There were early theoretical predictions of
unique properties which were proposed before they were measured. These predic-
tions included exotic mechanical and electronic properties that have since spurred
much new theoretical and experimental work on these unique objects.
6
1.4 Properties
An early and fascinating prediction of CNT electronic properties was that depend-
ing on tube chirality and diameter, CNTs would be metallic, semiconducting, or
insulating [60, 149]. This theoretical prediction was experimentally verified a few
years later [158]. Metallic MWCNTs have been shown to have amazing elec-
trical transmission properties, experimentally carrying 109 A/cm2 compared to
∼ 105 A/cm2 for some metals [171]. An excellent in-depth review of other unique
CNT electronic phenomena including field emission, optical, and contact proper-
ties is summarized in [3].
The mechanical properties of CNTs have attracted as much or more interest than
their electronic properties. The carbon-carbon sp2 bond found in graphite, and
hence in CNTs, is one of the strongest in nature. The in-plane elastic constant
for graphite is very stiff C11 = C22 ≈ 1.06 TPa [2, 169]. Axially, CNTs share this
same stiffness as evident by their high Young’s modulus. Computational results
include a tight binding study that reports a Young’s modulus (E) of 1.22 TPa,
and a DFT study that reports E = 1.1 TPa (tight binding and DFT methods will
be discussed further in § 2.4) [69].
It is possible to perform tensile loading experiments on CNTs between two atomic
force microscopy (AFM) tips, monitored in real time with electron microscopy
[15, 173, 174]. Measurements on SWCNTs have yielded Young’s moduli around
∼ 1 TPa, ultimate tensile strength (UTS) of ∼ 66 GPa and strain at failure (ǫf )
of 2 − 19%2. The average experimental Young’s moduli match quite well with
theoretical studies mentioned above. The CNT specimens that were tested were
quite short and were possibly defect-free. A detailed theoretical study of failure
of ideal CNTs shows that brittle or ductile failure (depending mostly on chirality)
occurs at strain levels above 17%. [34]. The introduction of defects into CNTs
can greatly reduce their their ultimate tensile strength [124].
Only when one compares these quoted CNT material properties to other materials
7
does one realize their unique mechanical qualities. One can compare them to steel’s
material properties (e.g. high strength tempered 4340 steel alloy, E = 12 TPa,
UTS = 1.7 GPa, ǫf = 12% [22]). It is wiser to compare CNTs material proper-
ties to other fiber-like materials like carbon fiber rather than three-dimensional
solids such as steel. Carbon fibers are polycrystalline graphitic fibers used heavily
in polymer-matrix composites for aerospace and other high-performance appli-
cations. Typical values for the Young’s modulus, tensile strength and strain at
failure are E = 0.3-0.7 TPa, UTS = 1.5-4.8 GPa, ǫf = 0.1-2.0 % [19, 22]. For
the high end of these quoted values carbon nanotubes are ∼ 1.4, 13.6, 9.5 times
greater respectively, and considering that carbon fibers are ∼ 1.5 times denser
than carbon nanotubes (ρcnt = 1.3 g/cm3 [19]), the specific strength and stiffness
properties of CNTs are even more impressive.
Currently the mechanical properties of individual carbon nanotubes do not fully
transfer to macroscopic bulk applications. It is possible to take the short length
sections of CNTs and spin them into a nanotube yarn as shown in Figure 1.5
A, and this yarn can be twisted or braided into rope like structures as shown in
Figure 1.5 B and C [176]. The mechanical properties of these braided fibers have
an ultimate tensile strength an order of magnitude less than the individual CNTs.
While stiff and strong axially, CNTs are quite compliant radially. They are flex-
ible enough to be curled into tight circles (as seen in bucky paper discussed be-
low), kinking if bent too much, but reversibly and elastically popping back when
straightened, seemingly without any defects [43, 169]. Compared to carbon fibers,
which are extremely brittle in bending, CNTs show remarkable bending resilience.
2The discussion of material properties such as the ultimate tensile strength or the Young’smodulus, necessitates a definition of cross sectional area. For carbon nanotubes the cross sec-tional area is not well defined. Since SWCNTs are an atomic layer thick and basically circularin cross section the area can be defined as A = 2πRδR where R is the radius of the tube and δRis the wall thickness. Should this value be the diameter of the carbon nuclei, or some functionof the electronic cloud spatial extent? Traditionally the value of δR = 3.4 A has been employed.
8
Figure 1.5: Carbon nanotubes twisted and braided into ropes, from [176]
1.5 CNT-based applications
The outstanding mechanical properties of CNTs in nanoscale devices or composite
materials remains a powerful motivation for the research. Carbon nanotubes have
been successfully used as atomic force microscopy (AFM) tips [26, 58]. In a sim-
ilar apparatus to a AFM setup, CNT tweezers composed of two nanotubes have
been used to selectively pinch and manipulate a target nanoparticle by applying a
voltage bias across the two tubes [85]. These tweezers when closed stick together
until a common polarity voltage is applied to them. Other electromagnetic based
CNT devices have been fabricated, including relays and switches that usually take
the form of a CNT cantilevered over a well or a fixed-fixed CNT over a well. Ac-
tuation or closing happens upon the application of a voltage bias and subsequent
deformation of the tube to a closed state [86, 150]. A form of non-volatile random
access memory that has a similar structure to these switches has been described
in literature [144] and is being produced by Nantero for sale in 2007 [78].
A very interesting mechanical aspect encountered in MWCNTs is the possibil-
ity of very low friction between the walls, which can provide for a very smooth
relative linear or rotational motion between a nested pair. This idea motivated
9
the possibility of CNT bearings [29, 94, 176]. Both rotational and linear bearings
have been modeled and tested. Relative linear translation of an inner tube being
partially pulled out of its shell have been performed with an AFM tip attached to
the inner tube while the outer tube is held rigid. Repeated in and out motion of
the inner tube did not deform or induce defects in the structure as viewed under
an SEM. During this same experiment it was noted that the inner tube quickly
retracted inside its parent SWCNT when the tube broke free of the AFM tip.
This rapid retraction induced by vdW forces led to the idea of a MWCNT system
that could sustain mechanical oscillation of an inner tube in the GHz range. This
proposed system consists of a short capped carbon nanotube nested in a lengthier
counterpart [104, 177, 178].
1.6 Role of van der Waal forces
The unique mechanical properties of carbon nanotubes indirectly lead to an un-
usual role of the forces between them. Like adjacent sheets of graphite, nanotubes
only occasionally form chemical bonds between one another and instead interact
via vdW forces. These “supramolecular” interactions are normally weak and, for
most molecular species, are easily overcome by thermal agitation (see the case of
proteins). However, they turn out to be significant in case of nanotubes, building
up very strong attractive forces over the extensive aligned contacts.
Because of their hollow structure and large aspect ratio, forces or interaction be-
tween them can cause deformations of bending, torsion, flattening polygonization.
A purified form of raw CNTs known as bucky paper is shown in Figure 1.6 (a).
The spaghetti-like agglomeration with highly curved structures is particularly in-
triguing in view of the strong covalent carbon-carbon bonding, which confers to
CNTs high mechanical stiffness. This characteristic feature is attributed not to
thermodynamic fluctuations, but rather to vdW forces between tubes, which pre-
vent these loops from unfolding. Another outstanding example of this balance
10
of the strain energy from bending and the van der Waals attraction is shown in
Figure 1.5 (b) [113]. Here we see closed loops of SWCNT bundles turned onto
themselves and stuck in a stable loop. It was reported that there is a critical
radius of 0.03 µm for SWCNT loops at room temperature. We will address this
phenomenon later with some sample calculations.
(a) (b)
Figure 1.6: (a) Bucky paper composed of numerous CNT bundles, from [77] (b)Stable curled CNT bundles
The supramolecular vdW forces in CNT systems also cause SWCNTs to self-
organize in ropes or bundles [80, 160]. The tubes in a bundle organize into a
triangular lattice (see Figure 1.7) with an approximate wall to wall distance of ∼ 3
A [160]. Interestingly these bundles seem to grow together with all tubes being
essentially the same length, and it has been noted that all the tubes constituting
a bundle have essentially the same diameter and only one or two chiralities [28].
Another interesting manifestation of vdW interactions is the experimental finding
that large diameter SWCNTs can be found collapsed, losing their circular cross
section and having opposite inner faces sticking together [125]. For two SWCNTs
larger than 20 A in diameter there is a flattening of the once circular cross section
along the edge of contact between one another as predicted by an empirical model
[145], or on a flat surface such as graphite [70]. For nanotube bundles with tube
diameters larger than 20 A the circular cross section can even become hexagonal
11
Figure 1.7: Typical CNT bundle showing cross sectional packing pattern [80]
in character [109, 159].
1.7 Motivation
Nanometer scale devices, while currently not outside the realm of manipulation or
measurement, pose unique hurdles to the fuller understanding or their function-
ality due simply to their diminutive size. Computational models of nano-devices
can be utilized to perform digital experiments that offer further insight or moti-
vation for experimental work. The atomistic simulation of carbon nanotubes is
quite common at many levels of theory, but almost all of these models, even some
of the most advanced, lack the vdW attraction. We have seen the importance of
vdW forces in many of the examples discussed above. In order to model and gain
insight for example into the possibility of efficiently gluing nanotubes by weak
supramolecular forces in a self-assembly process or the performance of MWCNT
bearings and oscillators it is important to have a detailed understanding of these
forces and an accurate model to describe them.
As nanomechanical systems typically involve a substantial number of atoms, it is
computationally convenient to adopt a classical description of the covalent bond-
ing. However, there are several reasons to favor a quantum mechanical treatment:
12
Firstly the interactions of supramolecular bodies are intrinsically coupled with
their own mechanical properties. For example, the degree of deformation can en-
hance (as in the case of partial polygonization of nanotubes in the array bundles)
supramolecular attraction, or decrease it under other circumstances. Thus, the
main motivation is the need for an accurate description of the interatomic interac-
tions. It is known that classical potentials (typically fitted to the bulk form) often
do not yield reasonable results. For instance, the second generation bond–order
potential of Brenner [17] yields elastic moduli that deviate significantly from the
tight-binding and density functional theory data [69, 81].
Secondly, since the electronic subsystem is treated explicitly, the characteristic
corrugation between nanotubes can be realistically accounted for. The widely
used classical treatment of Girifalco et al. [54, 53] does not properly capture this
effect although it offers an overall good energetic description of binding beyond
the covalent range.
Finally, an additional motivation is the renewed interest in TB modeling at larger
scale using the recently proposed objective molecular dynamics scheme [35, 73].
This method heavily reduces the computational effort through a drastic reduction
in the number of atoms to be accounted for and thus makes tractable tight binding
plus vdW nanomechanical modeling.
The purpose of this work is to extend an accurate tight binding modeling of the
covalent bonding with the long-range vdW attraction. This pursuit starts with a
discussion of the nature and variety of bonding in Chapter 2, including a discussion
of the levels of theoretical and mathematical models available. This background
leads into Chapter 3 where the structure of graphite and CNTs is pursued be-
yond their brief introduction above. Chapter 3 also details those properties that
are related to the interlayer vdW interactions in graphite from experimental and
computational techniques. This data set is used to evaluate the performance of
existing graphite models that are reviewed in Chapter 4. The proposal for a tight-
13
binding plus dispersion model is presented in the last chapters along with a study
of intertube interactions in MWCNTs and CNT bundles, and is finished with an
implementation of tight-binding plus dispersion molecular dynamics.
14
“All things are made of
atoms, little particles that
move around in perpetual mo-
tion, attracting each other
when they are a little distance
apart, but repelling upon be-
ing squeezed into one an-
other”
Richard Feynman
2Nature of bonding and mathematical
models
In the study and simulation of atomistic scale systems it is an obvious prerequisite
to understand the physical models that describe the interaction between the atomic
constituents. This chapter serves as a general non-mathematical introduction to
the concept and variety of atomistic bonds relevant for graphite and CNTs, with
a particular focus on the variety of physical bonding. These concepts will be
expanded in later chapters.
15
2.1 Bonding
A bond in the everyday macroscopic sense is something that fasten things to-
gether. It is quite the same on the atomistic scale and can be described as an
energetically favorable state. This state is described by the stable and finite sep-
aration of at least two atoms under the mutual influence of one another. In order
to understand some fundamental aspects of a bond, consider the simple example
of two interacting rare gas atoms. The solid line plotted in Figure 2.1 shows the
generic form of the interaction energy of this two-atom system as a function of
internuclear separation. The interaction energy is defined as the energy in excess
or recess of this two-atom system with respect to the energy of the two atoms iso-
lated from each other ( i.e. the energy of isolated atoms is defined as zero). There
0.4 0.5 0.6 0.7 0.8 0.9 1−2
−1.5
−1
−0.5
0
0.5
1
1.5
2
Internuclear Separation
Ene
rgy
Repulsion
Attraction
r
Ecoh
Figure 2.1: Characteristic energy vs. internuclear separation for a simple rare gasdimer
is a distinct minima in this plot. The energy at this point is lower than that
of the atoms at infinite separation and corresponds to a energetically favorable
state, which is a bond by our previous definition. The magnitude of the energy
at this minima (Ecoh) is the cohesive energy, and can be thought of as the energy
required to break this bond. The distance r at which this minima occurs is simply
the equilibrium bond length.
The nature of a bond intrinsically contains two complementary components: an
16
attractive, and a repulsive, component. This statement can be supported by a few
obvious points. First, it is evident that solids exist so there must be an attractive
component to bring atoms together. At the same time we are aware of the finite
density of solids, implying a repulsive component to keep matter from collapsing
upon itself. The two labeled dashed lines in Figure 2.1 show these two components
of the total interaction energy. When added together they simply sum to the solid
total interaction energy curve.
This view of bonding, while not rigorous, paves the way for thinking about more
difficult concepts. It leads logically to the question of the nature and variety of
these energetic interactions. Are the bonds between two argon atoms the same
as the bond between two carbon atoms? Over what length and energy scales do
they form?
2.2 Atomistic Forces
The starting point in a discussion of relevant forces at the atomic scale is a list
of the fundamental forces as given by the standard model: strong nuclear, weak
nuclear, gravitation, and electromagnetic forces. To what extent do these forces
play a role in bonding between atoms? The strong nuclear force is responsible for
the cohesion of the nucleus and has an extremely short range, as does the weak
nuclear interaction. Both of these forces act on the scale of 10−3 A, which is a few
orders of magnitude less than even one of the smallest interatomic bond length of
∼ 0.8 A for H2 [115]. These forces play no role in the bonding between atoms.
Gravitation on the other hand acts over longer ranges than the strong and weak
nuclear force. Understood in Newton’s classical sense, the gravitational interaction
energy between bodies separated by a distance r is
Ug(r) =Gm1m2
r. (2.1)
17
The gravitational energy is quite small on the atomistic scale. Take for example
two argon atoms (6.63× 10−26 Kg) separated by 5 A, which is approximately the
nearest neighbor distance in solid argon [4]. At this distance the gravitational
potential energy between the two atoms is −5.8×10−52 J, whereas experimentally
the potential energy at this same distance is −5.9 × 10−22 J [89]. The gravita-
tional potential energy is thirty orders of magnitude less than the experimental
measured energy. By inductive reasoning we are left with the electromagnetic
force as the dominant force in atomic bonding [89]. The manner in which the
numerous charged particles that constitute atoms interact electromagnetically is
diverse and will be addressed in the following section.
2.3 Electromagnetic Cohesion
Atomic cohesion due to the interplay of electronic charge can generally, but not
exactly, be classified into chemical and physical bonding. The general length scales
for these bonds are 1.5-3.0 A, and 3.0-5.0 A respectively.
The chemical, or covalent bond, is quantum mechanical in nature and is character-
ized by a directional (i.e. anisotropic) short range bond that is immensely strong.
In this chemical bond there is an overlap of the electronic clouds and sharing of
electrons. Details on the quantum mechanics and computational modeling of co-
valent bonds are described in later in Chapter 5. The rest of this present chapter
is dedicated to further explanation of the physical bond.
The physical bond, while still electromagnetic in nature, is a weaker, generally
isotropic bond that equilibrates at a greater distance than covalent bonds and
generally does not contain any charge overlap. Before discussing more about
the physical bond, the repulsive contribution to bonding for both chemical and
physical bonding is addressed.
18
2.3.1 Repulsive force
For neutral species the origin of the repulsive force that occurs at small distances
is based on the Pauli exclusion principle which states that no two fermions can
occupy precisely the same quantum state [4]. Phenomenologically explained there
is a huge energy penalty when two particles are very close to one another. As two
atoms are brought together their electron clouds tend to diminish along the inter-
nuclear axis and the electronic screening of the nucleus that was supported before
degrades and a repulsive Coulomb force between the exposed nuclei pushes away.
This is a very short-range interaction and energetically decays approximately as
1/r12. Of course another form of repulsion is simple Coulomb repulsion between
like charged particles that interact over long distances according the inverse square
law Fc ∝ q1q2/r2.
2.3.2 Attraction in physical bonding
The attractive component of a physical bond can be classified into the attraction
due to Coulomb interaction of ions and that between neutral species. The physi-
cal attraction between neutral species is more nuanced than the simple Coulomb
repulsion/attraction. Current understanding of this physical attraction has its
historical roots in the study of gasses.
2.3.3 Van der Waals forces and non-ideal gasses
The ideal gas law, used to describe the properties of gasses, fails when the gas is
not sufficiently dilute. Gas particles in a dense gas are more concentrated, and
hence closer together. In denser gasses the gaseous constituents spend more time
in the range of physical bonds, and deviations from the ideal gas law occur due to
this interaction. In this regime the effects of physical bonding must be considered
to properly model the gas. In 1873 Johannes van der Waals proposed a new
equation of state that captures the ideal gas deviations attributed to the physical
19
attraction, and finite size of molecules. Its form is
(P +
a
V
)(V − b) = kBT. (2.2)
In this equation P is the pressure, V is the molar volume, kB and T are Boltz-
mann’s constant and temperature respectively. These four quantities are found
in the ideal gas law. The van der Waals equation has two parameters in addition
to these: a and b. Where a is a measure of the physical attraction between the
molecular constituents, and b is a measure of the molecular size. The attractive
physical force described by a has been given the general moniker “van der Waals
force”. The physical vdW force between neutral atoms/molecules is classifiable
into three distinct groups with equally distinct mechanistic origins.
2.3.4 Classification of van der Waals forces
The three effects that are collectively referred to as van der Waals forces can
be classified as orientational (Vo), inductive (Vi), and dispersive (Vd) interactions
[116]. Their sum is referred to as the total van der Waals energy as shown in equa-
tion (2.3). In their simplest incarnation are expressed in terms of the internuclear
separation r of two atoms/molecules
VvdW (r) = Vo(r) + Vi(r) + Vd(r), (2.3)
and the total vdW energy for a system of N particles is expressed as a pairwise
summation over all constituents
UvdW =1
2
N∑
i,j
VvdW (rij), (2.4)
where rij is the distance between atoms i and j and the factor 1/2 accounts for
20
double counting.
The orientational term (Vo) arises if both molecules have a permanent dipole
µ. Dipoles can favorably rotate into a head to tail state, and come together to
lower their interaction energy. Figure 2.2 (A) schematically shows two dipoles
favorably orientated. In the gas phase these molecules are rotating, and at high
temperature they are spinning so rapidly that the dipole effectively disappears
and the interaction energy will tend toward zero. In 1912 W.H. Keesom showed
that at finite temperature, since the probability of a given orientation of dipoles
can be determined by a Boltzmann factor, one can integrate over all orientations
and come to [116]
Vo(r) = − 1
r6
2µ21µ
22
(4πεo)2(3kBT ). (2.5)
µ1 and µ2 are the magnitudes of the dipoles on atoms 1 and 2 that are separated
by a distance r, εo is the permittivity of free space, and kB and T are Boltzmann’s
constant and temperature respectively. We see that the dependance is 1/r6 and in
the limit of high temperature the orientational dipole-dipole energy tends toward
zero.
+− +− +− +−+−+−
A B C
Figure 2.2: Schematic representations of van der Waals energy contributions.Elongated molecules are permanent dipoles, spherical molecules are non-polaratoms/molecules with an induced dipole. (A) - dipole/dipole orientationalinteraction, (B) - dipole/induced-dipole inductive interaction, (C) - induced-dipole/induced-dipole dispersive interaction
Induction is an effect between a dipolar molecule and a non-polar molecule, as
shown in Figure 2.2 (B). The electric field of the dipolar molecule induces a dipole
21
in the non-polar molecule resulting in an effective dipole-dipole effect, as discussed
above. The ability of a molecule or atom to become induced into a dipole is know
as its polarizability. The static polarizability is simply the proportionality between
the induced dipole and the electric field strength E . The interaction is given by
[27, 116]
Vi(r) =1
r6
1
(4πεo)2(−µ2
1α2 − µ22α1). (2.6)
Finally, the motivation for a third vdW term in addition to the orientation and
inductive effects was to explain the presence of non-ideal gas behavior in non-polar
gasses. The dispersion effect, the last of the three to be elucidated, turned out to
be a completely quantum mechanical phenomenon that is more or less correctly
explained by being a collective interaction of instantaneous dipoles that are mutu-
ally induced. Further details on the quantum mechanics of this phenomenon will
be presented in Chapter 6. We will see there that the general relationship and
dominant term for this dispersive interaction energy takes the form
Vd(r) ≈C
r6, (2.7)
where C is a dispersion constant which will be discussed much more later.
Among these three terms in the van der Waals energy, the dispersion term is
the dominate contribution in most cases [116]. For example HCL(g), which has a
permanent dipole moment, has a intermolecular dispersion contribution which is 6
times greater than the orientational term and 20 times greater than the induction
term [116]. It is for this reason that while there are three collective vdW terms,
they are often referred to as vdW or dispersion forces in a synonymous sense.
22
2.4 Mathematical models
We have introduced chemical and physical bonds and now address the methods
of modeling them as they pertain to a set of atoms. For the general purpose
of atomistic models Schrodinger’s wave formulation of non-relativistic quantum
mechanics is a sufficient level of theory to capture almost all physical properties.
In 1928 Paul Dirac said of this equation
“The underlying physical laws necessary for the mathematical the-
ory of a large part of physics and the whole of chemistry are thus
completely known, and the difficulty is only that the exact application
of these laws leads to equations much too complicated to be soluble.”
The high fidelity of Schrodinger’s formulation comes with a computational price,
and deviations from this theory are either necessary for efficiency, or the neces-
sary information can be attained with a reduced (albeit less exact) formulation
depending on the property being calculated. Computational atomistic models can
be broken into three main methods in increasing accuracy and decreasing compu-
tational speed: empirical, semi-empirical and first principles.
2.4.1 Empirical
Empirical models consist of assumed or educated guesses at the functional form
of the energy dependance expressed in terms of relative ionic coordination. These
functions contain free parameters that are fitted to experimental or first principles
data on the representative material. The worst case scenario for these type of
energy functions is that they reproduce the fitted data points only. The quality of
an empirical potential can be judged on its predictive powers and transferability.
For example if a potential is fitted with the equilibrium spacing and cohesive
energy of solid argon, and then goes on to predict other measurable quantities of
argon such as bulk modulus or phonon frequencies, it is said to be predictive. On
23
the other hand, the potential is said to be transferable if it is good at describing
properties in different crystal structures or phases. For example, if a potential
is fitted to the face centered cubic (FCC) structure and the potential accurately
describes the bulk modulus of the diamond structure, the potential has a degree
of transferability.
Empirical potentials can be classified by their arguments. A two-body potential
depends only on the relative orientation of two atomic constituents. Higher order
many-body potentials depend on the relative orientation of more than two atoms at
a time. A well known empirical two-body potential is the Lennard-Jones potential
otherwise known as the 6-12 potential. Two equivalent forms of this potential are
φ(r) = 4ǫ
[(σr
)12
−(σr
)6]
=C12
r12− C6
r6, (2.8)
where ǫ and σ are constants, as are C6 and C12 in the second form. These constants
have different values for different atoms.
Empirical potentials are computationally tractable. The functions can be readily
calculated and, for the same cpu time, a much larger system can be considered than
with higher order formulations of energy calculation. The number one drawback
is the lack of predictive power.
2.4.2 Semi-empirical
Semi-empirical methods are based on the first principles formulation of quantum
mechanics. In some models terms are deemed negligible and ignored, in others
certain aspects of the calculations are parameterized. The tight-binding (TB)
method is a popular and much used semi-empirical method. The details of this
method are covered in Chapter 5. This and other semi-empirical methods have the
advantage of describing quantum mechanical phenomena that cannot traditionally
be attained from empirical potentials, and they tend to be computationally nimble
24
compared to first principles methods.
2.4.3 First principles
First principles, or ab initio calculations represent the pinnacle of electronic struc-
ture calculations. Starting with the fundamental constants and Schrodinger’s
equation as a postulate these methods proceed to describe the nature of atomistic
systems to a degree that is almost irrefutable. Computational resources and meth-
ods have come a long way since the time of Paul Dirac’s quote, and some of these
complicated equations referred to have in fact become readily soluble. But the
shear complexity of the many electron problem governed by Schrodinger’s equa-
tion remains burdensome for large atoms or several 10s of main group elements.
While these methods are rigorous in describing physical effects they are computa-
tionally expensive. The methods applied in solving Schrodinger’s equation break
into two main types: Hartree-Fock (HF) based methods and density functional
theory (DFT) methods. While both make approximations to make calculations
possible, they represent the best available methods for atomistic modeling. As
we will see latter the most crucial approximations in these methods entail the
electron-electron interactions. These interactions are the basis of van der Waals
attraction and the more common of these first principles methods fail to capture
this effect as further discussed in § 3.6.
25
“Let no man ignorant of ge-
ometry enter here”
Above the door to Plato’s
Academy
3Graphite structure and interlayer
properties
Our goal as stated is the development of an accurate model of graphite with
an emphasis on the interlayer properties related to dispersion forces. With this
goal in mind this chapter reviews details of the graphite structure, followed by
experimental measurements focusing on properties that pertain to the dispersion
forces between graphene layers, and insights provided by ab-inito calculations.
26
3.1 Graphene
As mentioned earlier the graphene plane consists of carbon atoms trigonally bonded
to neighbors in a sp2 hybrid network of covalent bonds. The in-plane hybridiza-
tion results in σ bonds. Normal to the sp2 plane are half filled un-hybridized pz
orbitals, that form weak π bonds between neighboring sites in-plane. The con-
tribution of these π bonds to the in-plane binding energy is very small and are
usually referred to as non-bonding [83]. The σ bonds in graphene is one of the
strongest bonds known [148]. Figure 3.1 (a) shows a more detailed structure of
the graphene hexagonal lattice shown in the introduction. The solid light lines
between atoms in this figure represent the chemical bonds. For both graphite and
carbon nanotubes the carbon-carbon bond length is ac−c = 1.42 A [7, 51]. The
dark arrows denote the primitive lattice vectors and together with the other two
heavy lines form the unit cell. The primitive lattice vectors, ~a1 and ~a2, defined in
terms of the cartesian coordinate system are given by
~a1 =a
2
1√
3
0
and ~a2 =a
2
1
−√
3
0
, (3.1)
where a is the magnitude of the primitive lattice vector, a = |~a1| = |~a2| =√
3 ac−c = 2.46 A. Each primitive unit cell contains two atoms labeled 1 and
2 shown in Figure 3.1 (a). The position of any of the atoms of type 1 or 2 can
be referenced with respect to the tip of the primitive lattice vectors by their basis
vectors. The basis vectors for atoms 1 and 2 are, ~B1 = (1/3)~a1 + (2/3)~a2 and
~B2 = (2/3)~a1 + (1/3)~a2, and the positions of an arbitrary atom of type 1 or 2 is
~R1 = n1~a1 + n2~a2 + ~B1, and ~R2 = n1~a1 + n2~a2 + ~B2, where n1, n2 ∈ Z.
The structure of graphene sheets is known through direct measurement via nu-
merous experimental methods. One of the more illustrative measurements is an
atomic force microscope (AFM) scan of a graphite surface, as shown in figure 3.1
27
2
1
ac−c
~a2
~a1
y
x
(a) (b)
Figure 3.1: (a) 2 dimensional graphene layer. Heavy dark line indicates primitiveunit cell composed of two atoms 1 and 2. (b) AFM scan of graphite surface withoverlay of a orientation of a single hexagonal ring, from [67]
(b). The AFM measures force between the AFM tip and the sample. High force
feedback indicates a localization of electrons, and hence the force contours can
approximately be interpreted as electron densities. With this interpretation the
figure shows a strong localization of electrons along bonds and nearly no electrons
in the open areas of the hexagons. This is in direct accordance with the nature of
covalent bonding and the sharing of electrons along the bond direction.
3.2 Graphite stacking patterns
The graphite crystal is characterized by many sheets of graphene stacked upon
one another. There are of course many ways in which these planes can be aligned.
The simplest form of stacking is known as AAA or simple hexagonal, and is shown
schematically in the left hand side of Figure 3.2. In this stacking pattern all carbon
atoms have neighbors directly above and below themselves.
A second stacking possibility is shown in the right hand side of Figure 3.2 and is
known as the ABA stacking pattern. In this structure half of the carbon atoms
in a graphene sheet have neighbors directly above and below themselves, and
28
A
A
A
B
A
A
Zeq
Simple Hexagonal Bernal
Figure 3.2: Graphite stacking patterns. AAA on the left and ABA on the right.Dashed lines denote atoms that are aligned with the z-axis.
the other half are situated such that directly above and below them is the open
center of a hexagon. If one starts with an AAA stacking pattern and translates
every other layer by twice the bond length along one of the carbon-carbon bond
directions the result is an ABA stacking pattern. The ABA stacking pattern is
sometimes referred to as the Bernal form, the namesake of its founder [11]. A
third stacking pattern, which is not illustrated here, is the ABC or rhombohedral
stacking. In ABC stacking half of the carbon atoms are directly below a open area
in the above hexagon and directly above a carbon, and vice versa for the other
half of the carbon atoms. Of course there are graphitic structures in which there
is no apparent stacking pattern, where the various layers are randomly rotated or
offset. These disorder stacking patterns are referred to in literature as turbostatic
[8, 47].
Graphite found in nature is usually graphitic carbon. Graphitic carbon is charac-
terized by small domains of graphite that are lumped together in random orien-
tations. Some samples of large natural graphite crystals have been collected and
tested to find that approximately 90 percent of the graphene layers are stacked
in the ABA pattern, and mixed in with it is the less common rhombohedral or
29
ABC stacking [23]. A very pure form of graphite termed highly orientated py-
rolytic graphite (HOPG) is synthetically manufactured. These graphite crystals
are composed of nearly 100 % ABA stacked planes [83].
For HOPG the interlayer spacing is well set at Zeq = 3.354 A for room temperature
[7, 8, 51], and shrinks slightly at low temperature (3.336A at 4 K) [7]. For other
less refined natural graphite and disordered turbostatic graphite the interlayer
spacing increases slightly to Zeq = 3.43 A [24, 47].
3.2.1 Corrugation of graphene planes
The stacking patterns discussed in the previous section highlight that the ABA
stacking seems to be the most energetically favorable. The AAA stacking on the
other hand has not been experimentally identified and is the least energetically
favorable stacking (as shown by DFT-LDA calculations [24]). The reason for
this can be attributed mainly to the nature of the pz orbitals sticking out of
the graphene planes. In the AAA stacking all of these pz orbitals would be at
a maximum amount of overlap possible. Any other stacking pattern than AAA
would have less pz overlap and would be more energetically favorable. Another
AFM scan of a graphite surface, this time shown from an oblique angle, is shown
in Figure 3.3. The discrete nature of the pz orbitals is evident and will be referred
to as a corrugated surface.
Figure 3.3: AFM scan of graphite showing Pz orbital corrugation, from [67]
30
3.2.2 Graphite unit cell and Brillouin zone
The graphite unit cell is similar to that of graphene but with a third primitive
lattice vector given by
~a3 = 2Zeq
0
0
1
. (3.2)
The ABA graphite unit cell has four atoms. The first two share the same defini-
tion as those of graphene and the other two have basis vectors ~B3 = (1/3)~a1 +
(2/3)~a2 + (1/2)~a3 and ~B4 = ~a1 + (1/2)~a3. The reciprocal lattice vectors of the
direct hexagonal lattice are
~b1 =2π
a
1
1√3
0
~b2 =2π
a
1
−1√3
0
~b3 =π
Zeq
0
0
1
, (3.3)
where the primitive and reciprocal lattice vectors must satisfy ~bi ·~aj = δij [4]. The
first Brillouin zone is given by the Wigner-Seitz unit cell of the reciprocal lattice
and is shown in Figure 3.4 (b) with some high symmetry points labeled.
A graphene sheet by itself is a zero gap semiconductor as can be seen by the
touching of the π bands at K [165]. When the graphene sheets are brought together
the interaction between layers is such that these bands split and graphite becomes
a semi-metal [165].
3.3 Graphene as extended molecules
Due to the dichotomy in the magnitude of intra and interlayer bonding (i.e. very
strong and very weak) it makes sense to talk about the two separately. Linus
Pauling said of graphite, “each of the layers is a giant molecule, and the superim-
31
(a) (b)
Figure 3.4: (a) 2 dimensional graphene layer. Heavy dark line indicates primitiveunit cell composed of two atoms 1 and 2. (b) AFM scan of graphite surface withoverlay of a orientation of a single hexagonal ring, from [67]
posed layer molecules are held together only by weak van der Waals forces” [133].
If we restrict ourselves to looking only at the interlayer properties we can gain
some insight into the dispersion energy between layers. By treating the layers as
extended molecules the interaction energy between them can be plotted versus the
interlayer spacing. This energy curves looks very similar in form to the diatomic
energy curve shown early in Figure 2.1. In these type of plots, which we will see
more of, the energy is referred to as the interlayer energy per atom and is de-
fined as E(Z) = [E(Z)−E(∞)]/Natom, where the tilde denotes interlayer energy.
The location of the minima of this curve corresponds to the equilibrium interlayer
spacing, Zeq = 3.35 A. The well depth in this interlayer energy picture is the exfo-
liation energy (Exf = E(Zeq)) and is the energy to dissociate the graphene sheets.
We refer to this as the exfoliation energy because we reserve the term cohesive
energy to refer to the energy to disassociate all carbon atoms in graphite rather
than just the layers.
32
3.3.1 Experimental exfoliation energy
The exfoliation energy is a difficult quantity to experimentally measure and there
seem to be but three experimental methods to measure the exfoliation energy of
graphite. A short description of the methods is presented in chronological order
in the following sections.
3.3.1.1 Girifalco method
The early and often quoted value by Girifalco was done by a heat of wetting
experiment that was apart of a doctoral thesis. The general idea of this experiment
comes from a relation that the cohesive energy is a function of the surface area
and the heat evolved when wetting (adding water to graphite is exothermic). The
average value over 8 experiments is reported as 260 ergs/cm2. Using the number
density of carbon atoms in a graphene plane (ρ = 4/√
3a2 = 2.46 atoms/A2) the
exfoliation energy per atom via the heat of wetting method is 42.5 meV/atom.
3.3.1.2 Benedict method
The second measurement was an experiment done on collapsed multi-walled nan-
otubes MWCNT. Benedict et al. noticed collapsed MWCNT of large diameter
during TEM measurements. This collapse was noted to occur along the length of
the tube and the caps at the end of the tubes were “bulbed” out. They proposed
that the cohesion seen in the middle was balanced by the strain energy in the
bulbous ends. Using the mean curvature modulus of graphite with an elastic con-
tinuum theory for the strain energy description balanced with a LJ potential they
calculated a value for the exfoliation energy of graphite of 35 ± 15 meV/atom.
3.3.1.3 Zacharia method
The third experimental result was a recent set of experiments of thermal desorption
(TD) rates of polyaromatic hydrocarbons (PAH) physisorbed on HOPG surfaces
33
under ultra high vacuum [175]. The chemical structure of PAH’s is very similar to
that graphite, sharing similar orbital hybridization and bond lengths. Large PAH’s
are simply small flecks of graphite with hydrogen bonds on the edges. This group
utilized an Arrhenius rate equation to study the activation energies were found
for successively larger PAH’s based on their experimental data. In their analysis
the effects of the hydrogen on the PAH’s was subtracted and a generalization of
52 meV/atom was found to be the binding exfoliation energy.
3.3.1.4 Exfoliation energy summary
Table 3.1 summarizes these three experimental values. Within the bounds of their
reported errors the exfoliation energy ranges between 20 and 57 meV/atom. The
lower bound defined by the Benedict model is suspect based on their use of a
continuum model for the strain energy. It seems that the other two methods
might hold a little more weight, placing the exfoliation energy around the mid to
high 40’s of meV/atom.
Exf Method Reference (et al.)[meV/atom]
43 ± 5 Heat of wetting Girifalco [54]35 ± 15 Tube collapse Benedict [9]52 ± 5 Thermal desorption Zacharia [175]
Table 3.1: Summary of experimental graphite exfoliation energies in meV/atomlisted in chronological order
3.4 Z-axis compressibility
The z-axis compressibility (kz) is related to the c33 elastic constant via [161]
c33 = k−1z . (3.4)
This property is described by the curvature of interlayer energy curve around the
34
equilibrium separation. The formal definition of this property and its calculation
are shown later in § 8.3.3. Experimental results on HOPG via ultrasonic testing
have found the compressibility to be 2.74(±0.0075)×10−12cm2dyne−1 at standard
temperature and pressure [16, 128]. The compressibility is a strong function of
pressure and temperature [50]. For example at a pressure of 20 kbar and standard
temperature it decreases to 1.85 × 10−12 cm2 dyne−1. At standard pressure and
low temperature (4 K) it decreases to 2.46 × 10−12 cm2 dyne−1.
3.5 Vibrational properties
Two vibrational modes in graphite highlight the dichotomy of intra and interlayer
binding and the latter gives us another more insight into the nature of interlayer
properties. A representation of the atomic motion for two active Raman modes
in question are shown in Figure 3.5. In plane the strong covalent bonds lead to
a high frequency E2g(2) mode that has a frequency of 46.3 THz, shown in Figure
3.5 (A) [162].
(B)(A)
Figure 3.5: Raman active modes: E2g(1), and E2g(2). The relative motion shownhere is adopted from [127]
The more interesting Raman active mode to us, is the rigid interlayer shearing
mode E2g(1). This mode is dominated by the weaker dispersion interactions and
35
overlap of Pz orbitals. Figure 3.5 (B) illustrates this motion. This mode is obvi-
ously of a much lower frequency than the latter due to the weaker physical bonding
involved and has been repeatedly measured at 42 cm−1 = 1.26 THz 1[2, 62, 119].
The E2g(1) shear mode is governed by the curvature of the energy landscape de-
scribed by the relative translation of every other graphene sheet relative to the
others.
3.6 Ab-initio studies of graphite
The calculation of graphite properties via a first principles treatment is something
to consider in addition to the experimental results given above. In § 2.4.3 two first
principles methods were mentioned Hartree-Fock self consistent field (HF-SCF)
methods and density functional theory. Both of these methods have been applied
to the graphite structure. The details of these methods will not be completely
explained here, rather a quick overview of main ideas and important points relating
to dispersion energy will be made
3.6.1 Hartree-Fock treatment of graphite
The defining idea of Hartree-Fock methods lies treating the electron-electron in-
teraction in a mean field manner. A particular electron sees the average field of
the other electrons but is unaware of their instantaneous positions. In real systems
electrons are instantaneously correlated with each other, leading to a lowering of
total energy. The difference between the HF-SCF energy and the “true” energy
Eo, is known as the correlation energy.
Ecor = EHF −Eo. (3.5)
Hartree-Fock methods by definition completely ignore the electronic correlation
1converting wavenumbers to frequency: vc = f , where c is the speed of light. 42 cm−1 ·3 × 1010 cm s−1 = 1.26 × 1012 Hz
36
term and thus do not model dispersion forces [5]. Corrections to HF theory to
account for electron correlations include Moller-Plesset many-body perturbation
theory (MP2/MP4), coupled cluster (CC), and configuration interaction methods
(CI) [64]. These HF corrections are very expensive and scale unfavorably. The
application of corrected HF-SCF theory to graphite is restricted due to issues of
imposing periodic boundary conditions in HF methods [?]. The closest simulation
to graphite using corrected HF methods is the calculation of the binding energy
of large PAH molecules due to vdW interactions [126].
3.6.2 Density Functional Theory treatment of graphite
Density functional theory is a reformulation of the electronic Schrodinger equation
in terms of the electronic density, ρ(~r), instead of the wave function. The theoret-
ical foundations of rigorously defined DFT rest on Hohenberg and Kohn’s (HK)
1964 paper [71], which are based on the earlier work of the Thomas-Fermi model
[90]. In this landmark paper two important theorems were stated. The first is
the proof of the existence of a functional mapping between the many-body wave
function and the electron density. Put in a differen’t way, the electron density
and the position of the nuclei completely determine the ground state energy. This
paper only proved the existence, but not the form of the energy functional, known
as the exchange correlation (XC) function. The second theorem in this paper is
the statement that the electronic distribution which minimizes the total energy is
the actual electronic distribution. This theorem leads naturally to a variational
treatment of the total energy with an extend basis set to describe the systems
at hand. A year later Kohn and Sham made some slight modifications to this
work in regard to the approximate functional dependance of the kinetic energy
with respect to a HF-SCF treatment [92]. This method is the basis for most
DFT calculations today and proceeds in a similar manner to HF-SCF methods.
This method has the benefit of correctly modeling electron correlation effects (and
hence vdW forces) if the functional form of the exchange correlation function is
37
known.
Unfortunately the exact universal form of the XC function is not known. Much
research on its form, has been done since the mid-sixties, and many forms have
been proposed. Some forms are extremely accurate for certain states of condensed
matter including the local density approximation (LDA), and varieties of the gra-
dient corrected methods (GGA). Both of these XC functions, and others used
fail to properly model long range interaction in rare gasses [135]. The reason for
this failure is in the fact that most XC functions are dependant on only the lo-
cal density of electrons and are not a function of electron densities at appreciable
distances . Graphite is a system that has a electron distribution that is sparse com-
pared to most condensed matter but much less so than gasses. A DFT treatment
of graphite shows some strengths but also some failings. Table 3.2 summarizes
results from a number of DFT calculations of graphite.
a Zeq kz Exf Eco
Method Ref. (A) (A) (×10−12 cm2
dyne) (meV/atom) (eV/atom)
LDA [31] . . . 2.8 0.97 110 . . .LDA [24] 2.45 3.3 . . . 20 . . .LDA [151] 2.451 3.36 4.11 25 8.80LDA [161] 2.453 3.44 7.69 30 8.60GGA [63] 2.461 4.5 0.01 3 7.87LDA-nl [146] 2.47 3.76 7.69 24 . . .Exp (4 K) 2.46a 3.336a 2.46b . . . . . .Exp (300 K) 2.46a 3.354a 2.74c Table 3.1 . . .
Table 3.2: Summary of DFT calculations of graphite including the lattice con-stant, equilibrium spacing, z-axis compressibility, exfoliation energy and in-planecohesive energy. aRef. [7], bRef. [50], cRef. [13]
All methods seem to treat the in plane properties quite well. For example the lat-
tice constant a is in very good accordance to experiment and the various in-plane
cohesive energies (Eco) agree quite well even between XC functions but unfortu-
nately doesn’t have an experimental measurement to compare to. These methods
seem to have trouble with interlayer properties. The GGA XC shows almost zero
38
interlayer binding energy and does not account for even some of the dispersion
energy between layers. Its lack of interlayer binding has been noted in literature
[52]. This failing of the GGA XC function probably coincides with the paucity
of published information on its application to graphite. The LDA methods show
some dispersive binding energy that is in the neighborhood of experimental values.
This dispersive energy rapidly goes to zero beyond Zeq and is generally exponen-
tial rather than 1/r6 [151]. The LDA methods also generally show the equilibrium
separation around the experimental value, though there are some outliers. All
examples shown do not accurately calculate the z-axis compressibility.
More recently there have been attempts to rectify the failing of XC functions for
long range interactions [91, 147]. An application of the method in [147] has been
applied to graphite and is shown in the table listed under LDA+nl, where the
nl stands for the “non-local” correction to the LDA [146]. While this represents
a more physical treatment by including non-local electron density information in
the XC function it still preforms poorly in the description of interlayer properties.
One interesting point that may be fortuitus, is the energy difference between
between AAA and ABA stackings at Zeq (referred to as ∆EABA−AAA) as calculated
by different DFT methods. Charlier reported 17 meV/atom under LDA [24], and
Kolmogorov reported 15 meV/atom for both the LDA and GGA approximations
[96]. This point will be referenced a few times in later sections.
3.7 Summary
In this chapter we have seen the some aspects of interlayer graphite properties that
are known to a high degree from experiments including the equilibrium separation,
z-axis compressibility, and the E2g(1) mode. The exfoliation energy measured
in three experiments gives a clustering around 45 meV/atom but a much wider
margin experimental disagreement than the three previous properties. We have
also seen the general disagreement of first principles calculations with respect to
39
calculating the interlayer properties. The information presented in this chapter
will be utilized later in the fitting of our dispersion correction, but first we use it
in the analysis of existing graphite models.
40
“What should they know of
the present who only the
present know?”
Blair Worden
4Existing models for dispersion
interactions in graphite
This chapter reviews previous work in the modeling of graphite interlayer proper-
ties and ends with the motivation for what will be done to address shortcomings
of the various models.
4.1 Empirical
The empirical models in this section only address the calculation of interlayer
properties. There are also intralayer empirical models for carbon ([17]) that are
sometimes used in conjunction with these for a fuller description of graphite.
41
The most quoted empirical model is the Lennard-Jones model proposed by Giri-
falco et al. [54, 53]. Recall that there are two constants in the LJ potential, given
in equation (2.8). In Girifalco’s model the attractive C6 constant was calculated
with the Kirkwood method (explained in § 6.2.6.2) from experimental data given
in [6], and is reported as 15.2 eV A6. The repulsive constant, C12, was calculated
by requiring that the lattice constant be equal to Zeq = 3.35 A, and is reported
as 24.1 × 103 eV A12
.
Another empirical model is that of Ulbricht et al. [163]. It is similar in nature
to Girifalco’s model, but fitted with experimental data from thermal desorption
experiments described earlier in § 3.3.1.3. The values for their LJ constants are
C6 = 15.4 eV A6
and C12 = 22.5 × 103 eV A12
.
We have implemented these two empirical models to study the interlayer properties
they produce as a function of the interlayer separation, and are shown in Figure
4.1, for both the ABA and AAA stacking. The details of this implementation
(unit cell, periodic boundary conditions etc.) will be discussed later.
3 3.1 3.2 3.3 3.4 3.5 3.6 3.7−50
−48
−46
−44
−42
−40
−38
−36
−34
−32
−30
Interlayer separation [A°]
Inte
rlaye
r en
ergy
[meV
/ato
m]
giri − AAAgiri − ABAulbr − AAAulbr − ABA
Figure 4.1: Interlayer energy scans for the empirical models of Girifalco (giri) andUlbricht (ulbr) for ABA and AAA stacking of graphite around Zeq
42
Table 4.1 summarizes the results from this evaluation, which includes the energy
difference in the stackings EABA−AAA, the exfoliation energy(Exf ), the E2g(1)
phonon frequency, and the z-axis compressibility (kz) of graphite in the ABA
stacking (details of calculating the latter two quantities are addressed in § 8.3.2
and § 8.3.3 respectively)
Model Exf kz E2g(1) EABA−AAA
[meV/atom] [cm2dyne−1] 10−12 [THz] [meV/atom]
Girifalco 43.41 3.18 0.36 0.9Ulbricht 46.6 3.17 0.34 1.1Experiment 30-50 2.74 1.26 17-23
Table 4.1: Results of LJ empirical models with experimental data comparison
Both of these models reproduce the correct interlayer equilibrium distance (they
were fitted to do so), and predict exfoliation energies (Exf) that are in the range
of experimental values summarized in Table 3.1. For both models there is about a
≈ 16 % error with respect to the experimental z-axis compressibility. The percent
error in the calculated E2g(1) mode frequency is much larger, ≈ 72% for both
models. The large discrepancy in this mode comes from modeling the repulsive
interaction isotropically. As was discussed in § 3.2.1 the Pz orbitals that stick out
of the graphene plane play a major role in the repulsive overlap between planes
and are very anisotropic in nature, as was illustrated in the AFM scan in Figure
3.3. This deficiency in modeling corrugation can also be seen in the relatively
small energy difference between the AAA and ABA stacking as compared to the
approximate DFT-LDA numbers. If this energy difference between stacking tends
to zero the E2g(1) mode will become non-existent in the model.
There is a third empirical model which tackles this shortcoming. The “registry
dependant” empirical model of Kolmogorov [96], rectifies this problem by intro-
1There is a slight discrepancy (≈ 2.1% difference) between the exfoliation energy quoted in[54] (42.5 meV/atom) and what I calculated here. They utilized an analytical lattice summationtechnique whereas I am using a small unit cell with periodic boundary conditions
43
ducing an anisotropic repulsion term that mimics the nature of the Pz orbital.
The attractive dispersive contribution is the same as previous models (C6/r6).
The model contains 7 free parameters and is fitted to Zeq, Exf = 48 meV/atom,
kz, and the energy difference between ABA/AAA stacking from a DFT-LDA anal-
ysis. The value of C6 in the model is given as 10.24 eV A6. As to whether this
coefficient was fitted along with the 6 other parameters in the repulsive term or
calculated separately is not made clear. The E2g(1) mode was not in the database
for fitting and its value as calculated by the fitted model is not reported. It
should be in the neighborhood of the experimental value because the model is
fitted to reproduce the energy difference between ABA and AAA stacking. We
have discussed the limited knowledge of the exfoliation energy, and the unknowns
associated with the ab-initio based interalyer energy calculations (both of which
this model used in fitting), so this model has to be viewed as a step in the right
direction in modeling corrugation, but may be limited in its predictive powers.
4.2 Semi-empirical and first principles models
The next level of sophistication in modeling graphite interlayer interactions is a
quantum mechanical based, semi-empirical or first principles model. As discussed
in § 3.6 these methods generally fail to capture dispersion energetics. The method
traditionally employed to correct this shortcoming is a dispersion addition to the
energy, with the total energy defined as [68]
Etot = Efp/se + Edispersion, (4.1)
where Efp/se is the energy computed from either a first principles (fp) or semi-
empirical (se) method. In these models it is assumed that the short range behavior
in the region of atomic overlap is modeled accurately, whereas the long range
dispersion is either completely missing (as in HF and TB methods) or only partially
accounted for (DFT). The form of Edispersion is generally a pairwise C6/r6 term
44
that is effectively switched off at close range by a damping function ( damping
functions will be addressed in Chapter 7).
Adding a dispersion energy correction on top of a quantum based method has
been widely used. Many Hartree-Fock plus dispersion models for rare gasses and
aromatic hydrocarbons have been reported. Some early papers from the mid 70’s
to early 80’s include [1, 68, 157], and continue to the present day [79]. There is a
similar counterpart and precedence for this method in DFT plus dispersion models,
for rare gasses [168], and in aromatic hydrocarbons [117, 130, 180]. A study of a
DFT plus dispersion model for graphite has been constructed by Hasegawa and
Nishidate [63]. This model has the benefit of an accurate first principles treatment
of the intralayer bonding and with the added damped dispersion term models
graphite to a degree which is satisfactory. This model has obvious limitations in
the systems size due to the expensive DFT procedure, and there is the possibility
of double counting some of the dispersion energy due to the DFT’s ability to pick
some of it up.
On the semi-empirical side there are a few articles in literature that describe
tight-binding plus dispersion models. Biro et al. describe an intermolecular Huckel
model with a Lennard-Jones energy addition to model dispersion [156]. This model
has two tight-binding parameterizations, one for intra-molecular interactions and
one for inter-molecular interactions. This model has been applied to graphite
and CNTs. This model is different than the other tight-binding plus dispersion
models that follow in that the dispersion term is not damped and the repulsive
energy term is effectively double counted by including the repulsive C12/r12 term
in addition to the repulsion present in the tight-binding formalism.
Elstner et al. added a damped dispersion correction to a non-orthogonal tight
binding model [38]. This model was created to study the stacking of nucleic
acid base pairs, and necessarily included H,N,C,O parametrization, of both the
tight-binding parameters and the dispersion interactions between each. Another
45
example of a tight binding plus dispersion model is that of Palser [131]. In a similar
fashion to Elstner, Palser added a damped dispersion energy to a non-orthogonal
tight-binding model. This model was focused specifically on modeling graphitic
structures. Palser’s model has three free parameters that were fit to reproduce
Zeq, Girifalco’s experimental interlayer energy, and the DFT based ∆EABA/AAA
stacking difference of 17 meV/atom from [24]. The dispersion coefficient in this
fitting came to 11 meV A6. The z-axis compressibility that was not fitted to, is 9.4
% different than the experimental value. Palser did not report on a calculation of
E2g(1). Kwon and Tomanek have a similar model fitted to the same ABA graphite
properties as Palser [100].
4.3 Motivation for what is to be done
These tight-binding plus dispersion methods unlike the empirical models discussed
implicitly model the corrugation of the Pz orbitals overlapping at close distances.
More importantly from a standpoint of modeling nanomechanical systems is the
quantum mechanical treatment of the inplane bonding that is not addressed with
the empirical models. This fact allows one to apply molecular dynamics to study
the evolution of interacting graphitic structures under the influence of repulsive
overlap forces and the attractive dispersion forces, all the while having access to
the electronic structure which is important in modeling NEMs. The computational
efficiency of tight-binding plus dispersion as compared to the DFT plus dispersion
of Hasegawa, allows a much greater freedom in the simulations of larger systems
over longer time scales. For these reasons the tight-binding plus dispersion method
is deemed a valid avenue for modeling the subjects proposed in the introduction,
and will be developed here.
The following three chapters lay the theoretical ground work for a physically mo-
tivated and accurate model of graphite based on this tight-binding plus dispersion
model. The dispersion force and covalent bonding that have been mentioned so
46
frequently in these introductory chapters will be considered in a more rigorous
manner. In chapter 5 the tight-binding methodology and parametrization will be
fully presented. In chapter 6, the definition of C6, and the methods for calculating
it will be established. The motivation for this chapter is to provide confidence in
correctly modeling the long range interactions and provide a sanity check when
final results are presented. The idea of damping functions will be addressed in
Chapter 7, and in Chapter 8 all the discussed aspects of the model are brought
together and the parameter optimization is explained.
47
5Modeling electronic and repulsive
interaction with a tight binding formalism
As discussed in § 2.4.2 tight-binding is a semi-empirical method based on quantum
mechanics. This chapter describes the foundation of tight-binding, its parametriza-
tion and use in total energy calculations of systems of atoms. The reader may find
it useful to refer to Appendix B if he/she becomes lost. Before we get started it
should be noted that we are treating are quantum mechanical systems in the
one-electron picture considering no relativistic spin effects, or self-consistency.
Also as in most quantum mechanical calculations we are employing the the Born-
Oppenhemier approximation by treating only the electrons quantum mechanically.
48
5.1 Linear combination of atomic orbitals
In quantum chemistry the electronic states of a collection of interacting atoms that
make up a molecule are often described in terms of a collective wavefunction. This
collective wavefunction is described by a Linear Combination of Atomic Orbitals
(LCAO). The molecular wavefunction Ψ written in this LCAO form is
Ψλ =
n∑
i
cλi φi. (5.1)
The λ refers to the particular state (λ = 1 is the ground state), c is the expansion
coefficient to be determined, and φ is the atomic orbitals for the n orbitals. These
atomic orbitals are known as the basis functions and usually are not the exact
atomic orbitals. Rather they usually only have atomic like character known from
the solution of the hydrogen atom e.g. Slater type orbitals (STO’s).
5.2 Schrodinger equation in LCAO approximation
The governing equation for the wavefunction in a molecule or solid is given by the
Schrodinger equation
HΨλ = ελΨλ. (5.2)
H is the quantum mechanical Hamiltonian for the molecule, and ελ is the energy
of the state λ. Within a LCAO framework the solving of this equation is equiv-
alent to finding the set of expansion coefficients (ci) for each state λ defined in
equation (5.1) that minimizes the energy that particular state. To do this the
variational method (Rayleigh-Ritz method) is applied in the minimization of the
energy functional given by
49
ελ =〈Ψλ|H|Ψλ〉〈Ψλ|Ψλ〉 . (5.3)
Bracket notation as discussed in section B.2 is employed in this definition. Insert-
ing the LCAO expansion for Ψλ into equation (5.3) and applying the variational
principle with
∂ε
∂ci= 0, (5.4)
leads to n equations in the form of the generalized eigenvalue problem given as
Hc = εSc. (5.5)
H is the Hamiltonian matrix with elements given by Hij = 〈φi|H|φj〉 and S is the
overlap matrix with elements defined by Sij = 〈φi|φj〉, and c is a vector containing
the expansion coefficients. If the basis is chosen such that all the basis functions
|φi〉 are orthonormal, the overlap matrix will be equal to the identity matrix and
equation (5.5) will reduce to a regular eigenvalue problem
Hc = ε c. (5.6)
A detailed derivation of the application of the variational principle to a LCAO
basis leading to the generalized eigenvalue problem is presented in Appendix B.9
for the interested reader.
5.3 The basis and atomic centered orbitals
When solving the Schrodinger equation via the variational method with a LCAO
the size of the basis must obviously be finite due to limited computational re-
50
sources. One ideally wants to include as few atomic orbitals as necessary to in-
crease computational speed, but enough to capture the essence of the system. The
atomic orbitals included define the basis of the system, for covalent solids typically
the S, Px, Py, and Pz are sufficient to describe the ground state properties.
When solving a problem with atoms dispersed in space, each basis is centered at
the nucleus. The value of the basis wave function at an arbitrary point ~r in space
depends on the vector distance between this point and the nucleus at which it is
based. The vector connecting these two points is simply ~r − ~R, where ~R is the
position vector of the nucleus taken from the same global origin as ~r. The notation
used is
φα(~r − ~R). (5.7)
Where α labels the orbital type, α ∈ (S, Px, Py, Pz) in our 4 orbital basis. We can
rewrite our one-electron LCAO wave function now in the form
Ψλ(~r) =∑
i,α
cλiα φα(~r − ~Ri). (5.8)
λ is an index describing the eigenstate, the sum on i goes over the n atoms in the
system, and the sum on α over the basis.
We can also rewrite the matrix elements for the Hamiltonian and overlap matrices
that make our generalized eigenvalue problem from equation (5.5). Instead of Hij
we now adopt the notation Hiαjβ. Meaning the matrix element is an integral
between orbitals of type α and β on atoms i and j. Using our new notation these
are
Hiαjβ = 〈φiα|H|φjβ〉 =
∫φ∗
iα(~r − ~Ri) H φjβ(~r − ~Rj) dτ, (5.9)
51
Siαjβ = 〈φiα|φjβ〉 =
∫φ∗
iα(~r − ~Ri) φjβ(~r − ~Rj) dτ. (5.10)
5.4 Periodic solids
When modeling an infinite periodic solid such as a crystal it is not necessary to
explicitly model all atoms, rather one makes use of Bloch’s theorem. The premise
is that in a periodic solid the value of the wavefunction evaluated at an arbitrary
translation of a lattice vector must follow [4]:
Ψλ(~r + ~R) = eik~RΨλ(~r). (5.11)
Here the lattice translation vector is ~R = w1~a1 +w2~a2 +w3~a3 (the w’s are simply
integers), and k is the wavevector in the first Brillouin zone.
A proposed wavefunction for a periodic solid must satisfy the relation in (5.11).
One form that satisfies this is known as the Bloch sum and is given by [154]
ϕk
iα(~r − ~Ri) = N−1
2
∑
~Ri
eik~Riφiα(~r − ~Ri), (5.12)
where ~Ri represents the set of all translation vectors for atom i. i.e. all integer
permutations for the w’s given above. ϕk
iα will be now referred to as a crystal-
like wavefunction. Using this as our basis for periodic solids we can write our
Hamiltonian and overlap matrices in terms of wavevector k as:
Hk
iαjβ = 〈ϕk
iα(~r − ~Ri)| H |ϕk
jβ(~r − ~Rj)〉 , (5.13)
Hk
iαjβ = N−1∑
~Ri,~Rj
eik(~Rj−~Ri)
∫φ∗
iα(r − Ri)Hφjβ(r −Rj)dτ. (5.14)
52
One of the sums cancels with N−1 and the matrix becomes
Hk
iαjβ =∑
~Rj
eik~RjiHiαjβ. (5.15)
An analogous progression of the last three equations is applied to the overlap
matrix and it takes a similar form:
Sk
iαjβ =∑
~Rj
eik~RjiSiαjβ. (5.16)
At this point we defined the form of the integrals for the Hamiltonian and overlap
matrix elements given in real space by equations (5.9), and (5.10), and in k-space
by equations (5.15) and (5.16). Now we just need to evaluate them. These integrals
turn out to be very difficult to evaluate, but by applying some approximations to
the full form of the matrix elements the problem can be reduced to a manageable
form. The next few section discuss the main approximations of tight-binding and
introduce the parameters that make this method semi-empirical.
5.5 The Hamiltonian and the two-center approximation
The first approximation is known as the two-center approximation and has to do
with the evaluation of the integrals in the Hamiltonian and overlap matrices. The
Hamiltonian operator in these matrix elements is the one-electron Hamiltonian
discussed in Appendix B and given in equation (B.21). The potential energy
includes a contribution from each ion core in the system:
V (r) =N∑
i
Vi(r − Ri). (5.17)
The potential Vi is located on atom i. These potentials are assumed to be spheri-
cally symmetric. Plugging this potential into the Hamiltonian and then into the
53
real space Hamiltonian matrix element given in equation (5.9) we have
Hiαjβ =
∫φ∗
iα(r − Ri)
[1
2m∇2 + V1(r − R1) + . . . + VN(r −RN )
]φjβ(r − Rj)dτ.
(5.18)
In order to better understand the individual terms in this expression we can write
the above as
Hiαjβ = 〈φiα| T |φjβ〉 + 〈φiα| Vi |φjβ〉 + 〈φiα| Vj |φjβ〉 +∑
k 6=i,j
〈φiα| Vk |φjβ〉 . (5.19)
The first term is the kinetic energy and the rest are three classes of potential
energy terms. From this form we can classify the contribution to an individual
matrix element, Hiαjβ by making the following definitions
1. When all three locations, i,j and k are on the same atom we have an on-site
integral
2. When the potential k is on the same site as one of the wave functions i,
and the other wave function is at another site (i 6= j) we have a two-center
integral
3. When all three are on different cites (i 6= j 6= N) we have three-center
integrals
The three-center integrals are small compared to the two-center integrals. The
negation of three-center integrals (i.e. the “two center approximation”) greatly
simplifies the evaluation of these terms. Later when we discuss the parametrization
it will be seen how some of the error incurred in ignoring three-center integrals
are absorbed in the the fitting of the parameters.
54
The reduced Hamiltonian matrix elements within the two-center approximation
are
Hiαjβ = 〈φiα| H2c |φjβ〉 , (5.20)
where H2c is the two center Hamiltonian operator, described by
H2c =1
2m∇2 + Vi(r −Ri) + Vj(r − Rj). (5.21)
Within the two-center approximation the matrix elements essentially become a di-
rectionally dependant pair potential that are a function of internuclear separation,
|Ri − Rj |, and the angular momenta of the orbitals α and β.
5.6 Slater-Koster parametrization
The tight binding or “simplified” linear combination of atomic orbitals method was
first described by Slater and Koster in 1954 [154]. They describe their method as a
manner of interpolation of finer resolved first principle calculations. This original
paper was concerned with calculating energy band properties of bulk crystalline
materials. The basis of the method is to decompose the integrals found in the
Hamiltonian and overlap matrices by means of orbital symmetry and replace the
direct evaluation of these simplified integrals by a parametrization based on the
two-center approximation.
5.6.1 Orbital decomposition
The orientation between arbitrarily aligned s-p, or p-p orbitals can be decom-
posed into σ and π bonds, as is discussed in the next section. Our basis orbitals
have atomic like character and this character is described by spherical harmonics
known form the full analytical solution of the hydrogen atom. The integrals over
55
space of the products of these spherical harmonic functions (i.e. elements in the
Hamiltonian and overlap matrices) are always zero due to symmetry for certain
angular momenta and orientation. Just considering our sp basis, the orientations
that have none-zero space integrals are only 4 and are a shown in Figure 5.1.
Figure 5.1: Relative orientation of s and p orbitals that have a non-zero 3 spaceintegration between them. Shaded region is “+” and white region is “-”
5.6.2 s-p decomposition
Consider the example of the Hamiltonian matrix element between an s orbital on
one atom and a p orbital on another atom (i 6= j). The matrix element to be
evaluated is
〈ψis|H2c|ψjp〉 = 〈S|H2c|Pα〉 . (5.22)
The right hand side introduces a simplified notation that will be used in this
section. It simply means an s type orbital on atom 1 and a p type orbital with
angular momenta α on atom 2.
In order to simplify the evaluation of this matrix element we start by decomposing
the p orbital into components that are parallel (σ) and perpendicular (π) to the
internuclear unit vector ~d that is shown in figure 5.2.
56
Figure 5.2: S and Pz orbital decomposition. Vector ~r connects the two atoms, ~dis a unit vector of ~r.
To facilitate this calculation we define the vector ~a, which is a unit vector along
the cartesian axes corresponding to the type of P orbital considered. For a Pz
orbital ~a is simply 0i + 0j + 1k. The vector ~n, is a unit vector normal to ~d, and
in the plane described by ~d and ~a. Both of these vectors are shown in figure 5.2.
It is now easy to describe the arbitrary Pα orbital as a sum of σ and π parts as
|Pα〉 = ~a · ~d |Pσ〉 + ~a · ~n |Pπ〉 . (5.23)
Now we can substitute this into equation (5.22) as,
〈S|H2c|Pα〉 = 〈S| H2c |[~a · ~d |Pσ〉 + ~a · ~n |Pπ〉
]〉 , (5.24)
which expands to
〈S|H2c|Pα〉 = (~a · ~d) 〈S|H2c|Pσ〉 + (~a · ~n) 〈S|H2c|Pπ〉 . (5.25)
We gave the arguments in the previous section that the second term on the right
hand side of equation (5.25) is zero. With this consideration equation (5.25)
reduces to
57
〈S|H2c|Pα〉 = (~a · ~d) 〈S|H2c|Pσ〉 . (5.26)
Since ~a is defined as a unit vector along one of the cartesian coordinates, (~a · ~d) is
simply the directional cosine in the direction of α. Where the directional cosines
are defined by
dx =~r · x|~r| , dy =
~r · y|~r| , dz =
~r · z|~r| , (5.27)
with (x, y, z) being the global axis unit vectors. Using these definitions of direc-
tional cosines equation (5.26) can be rewritten as
〈S|H2c|Pα〉 = dα 〈S|H2c|Pσ〉 = dα tspσ(r). (5.28)
Here we have introduced the shorthand notation of:
tspσ(r) = 〈S|H2c|Pσ〉 . (5.29)
This tspσ(r) is the parameterized SK function that replaces the integral between
an S and P orbital in a σ configuration as a distance dependant function. In §
5.8 we will talk about the form and fitting of these functions. The methodology
of decomposing this general SPα integral into σ and π components with the help
of directional cosines is applied to the other integrals of interest in the sp basis.
The resulting integrals can all be expressed via four SK functions (t(r)’s) and the
directional cosines. The required t(r)’s for our basis are defined as
The reduced form of the other integrals (for ss and pp orbitals) are shown in
equation (5.31) in the following section.
58
tssσ(r) = 〈S|H2c|S〉tspσ(r) = 〈S|H2c|Pσ〉tppσ(r) = 〈Pσ|H2c|Pσ〉tppπ(r) = 〈Pπ|H2c|Pπ〉
5.6.3 Construction of tight binding matrices
For each atom pair i, j all matrix element terms between the orbitals on i and j are
calculated using the SK parameterized functions. Our S, Px, Py, Pz basis leads to
4× 4 sub-matrices for each atom pair, labeled Hsubij . The matrix below shows the
what integral each element of the sub-matrix represents ( i.e. sisj = 〈si| H2c |sj〉 )
Hsubij =
sisj sipjx sipjy sipjz
pixsj pixpjx pixpjy pixpjz
piysj piypjx piypjy piypjz
pizsj pizpjx pizpjy pizpjz
(5.30)
There are two types of these sub-matrices, on-site and hopping sub-matrices. The
latter occurs when the integral is defined between orbital on different sites ( i.e.
i 6= j). This 4 × 4 hopping sub-matrix is defined as
Hsubij =
tssσ dxtspσ dytspσ dztspσ
−dxtspσ d2xtppσ + (1 − d2
x)tppπ dxdy(tppσ − tppπ) dxdz(tppσ − tppπ)
−dytspσ dydx(tppσ − tppπ) d2ytppσ + (1 − d2
y)tppπ dydz(tppσ − tppπ)
−dztspσ dzdx(tppσ − tppπ) dzdy(tppσ − tppπ) d2ztppσ + (1 − d2
z)tppπ
(5.31)
For on-site integrals, when i = j, the contribution to the sub-matrix is simply the
energies of the individual s and p orbitals along the diagonal.
59
Hsubij =
Es 0 0 0
0 Ep 0 0
0 0 Ep 0
0 0 0 Ep
(5.32)
These sub-matrices are then inserted into their respective positions of the global
Hamiltonain matrix Hiα,jβ. To illustrate this assume the 4 × 4 sub-matrices are
indexed by x, and y. Their position in the global matrix is defined as
Hiα,jβ
(4(i− 1) + x, 4(j − 1) + y
).
Until this point all discussion of the tight-binding matrices have been in terms
of the Hamiltonian matrix. The procedure for the overlap matrix S follows the
same route. The integrals between orbitals discussed in the previous section are
decomposed the same and the the results presented in the construction of the sub-
matrices is the same except the overlap has different SK functions defined by t(r)’s.
The on-site sub-matrix is the identity matrix instead of the orbital energies. The
global overlap matrix is constructed from its sub-matrices just as was described
above.
For periodic solids each element is multiplied by the summation pre-factor∑~Rj
eik~Rji
given in equations (5.15) and (5.16).
5.7 Tight Binding Total Energy
Up to now we have seen the tight binding formalism and now address calculating
the total energy of a system of atoms. This energy contains two parts: band
structure energy (EBS), and a short range repulsive interaction (Erep) [137]. This
division of the energy is understood in light of earlier approximations. Under
the Born-Oppenheimer approximation all energies calculated via Schrodinger’s
60
equation are electronic energies (EBS). To account for the total energy the core-
core ionic repulsion of the nuclei must be accounted for. The total tight-binding
energy is expressed as
ETB = EBS + Erep. (5.33)
The repulsive contribution is between all atomic cores and is most often taken as
a two body potential Θ(rij) summed over all interactions
Erep =1
2
∑
i,j
Θ(rij). (5.34)
rij is the distance between atoms i and j. The form of the potential is not explicitly
known but is selected and fitted to reproduce points in the data base.
The second contribution to the total energy is the band structure energy. The
Hamiltonian and overlap matrices matrices are constructed for a configuration of
atoms and for a given wave vector k. These matrices are used in the generalized
eigenvalue problem given in equation (5.5) restated here in a form showing the
wave vector dependance
Hkc(k) = ε(k) Skc(k).
The solution to this generalized eigenvalue problem results in a spectrum of eigen-
values ε(k) corresponding to the energy of the different states, and the eigenvectors
c(k) constitute the expansion coefficients. The band structure energy is defined
as
EBS =∑
λ∈occ
∫
FBZ
fλ ελ(k) d3k. (5.35)
61
The sum is over the occupied states, and the occupation number of a state λ
is given by fλ ∈ {0, 1, 2}. The integral is over the first Brillouin zone (FBZ).
Computationally this integration is approximated by a weighted sum over discrete
k-points in the Brillouin zone expressed by
EBS =∑
λ∈occ
Nk∑
i=1
wi fλ ελ(ki). (5.36)
The wi is the weight assigned to the specific k-point, and must satisfy the condition
Nk∑
i=1
wi = 1. (5.37)
It is possible to uniformly sample the Brillouin zone at discrete points and assign
a uniform weight of w = 1/Nk in evaluating 5.36.
5.8 Fitting TB parameters
We have seen the ground work of tight binding this far but now must consider how
the various parameters are fit. Slater and Koster originally used first-principles cal-
culations of energy eigenvalues at k-points corresponding to high symmetry points.
By using a least squares method a set of parameters can be found such that the
mean squared error of the eigenvalues at the high symmetry points is minimized.
Of course fitting only to the band structure for one allotrope is the easiest. This
procedure for carbon can be carried out by simultaneously fitting/reproducing the
band structure for diamond and graphite. Material properties such as the bulk
modulus and other elastic constants can be used in the database for the fitting of
the parameters. The development of tight-binding methodology given above car-
ried along the overlap matrix. Many early parameterizations were designed such
that all basis functions were orthogonal and thus the overlap matrix is simply
the identity matrix. These orthogonal models have half the parameters as non-
62
orthogonal models to fit (i.e. no t(r)’s for the overlap). The extra parametrization
and flexibility of non-orthogonal models has been noted to improve the transfer-
ability of tight binding models.
5.8.1 Density functional based tight binding
The tight-binding parameters used in our code are those of Porezag et. al. [137].
They developed a SK parametrization for carbon based on DFT, that has proven
versatile and predictive. They started by writing the Kohn-Sham orbital ψks
as a linear combination of atom centered pseudoatomic wavefunctions φν , that
are sums of Slater-type orbitals and spherical harmonics. These pseudoatomic
wavefunctions contain parameters that are found through self-consistently solving
the Kohn-Sham equation given by
[T + V psat(r)
]φν = Epsat
ν φν , (5.38)
V psat(r) = Vnucleus(r) + VHartree[n(r)] + V LDAxc [n(r)] +
(r
ro
)N
. (5.39)
The term(
rro
)N
is introduced in order to compress the wavefunction such that the
band structure fitting is improved. This has consequences that will be addressed
in § 8.2.1.
The φν obtained by solving equation (5.38) are used as the basis functions in
a LCAO treatment of the systems and are used next in calculating the matrix
elements H and S. For the Hamiltonian matrix elements the two-center approx-
imation is utilized and the potential is a Kohn-Sham potential of a neutral pseu-
doatom. These elements are evaluated explicitly at discrete distances using the
method described in [40]. The tabulated values for the matrix elements as a func-
tion of internuclear separation are fit to 10th order Chebyshev polynomials. Its
63
computationally advantageous to evaluate the polynomials at discrete steps and
save the results to a database to be used in interpolation, rather than evaluate
them on the fly.
The repulsive two body potential is found through the energy difference between
the total system energy given by the self-consistent DFT calculation and the band
structure energy calculated through the fitted parameters in the previous step
Urep(R) = ELDAtot (R) − EBS(R). (5.40)
These points are similarly tabulated and fitted, in this case to a sum of polyno-
mials.
5.9 Use of Porezag parametrization and current code
Porezag’s parametrization method has been used extensively as can be seen in a
quick literature search. Parameters for Si,B,N,Ga,As and many other have been
developed in a similar manner. These parameters have been used in studying the
electronic and mechanical properties of perfect and distorted carbon nanotubes
[69, 142], diffusion of carbon in GaAs [103] electronic structure and vibrational
properties of a number of small clusters [152]. The code I am using was developed
originally by Graves [55] and has been modified by several people for different
purposes, and as mentioned is currently utilizing Porezag’s non-orthogonal carbon
parameters. This particular tight binding code has been used in several studies of
semiconductor response to ultrafast laser pulses [32, 33, 49].
64
6Quantum mechanical origin of dispersion
forces
This chapter contains theories of dispersion forces that are elucidated and applied
to the model graphitic system. The 1/r6 dependance in the dispersion energy will
be brought to light, along with calculation of C6 coefficient for graphite utilizing
different methods.
Theories of dispersion forces can be generally lumped into two camps: microscopic
and macroscopic theories. The microscopic theory is a bottom-up approach based
on the atom-atom dispersion interaction of London that has been mentioned sev-
eral times thus far. In these microscopic theories the total interaction is thought of
65
as a sum of contributing part of the microscopic constituents. Under the assump-
tion of pairwise addivity (discussed in the next section) the microscopic dispersive
interactions add up to describe the total interaction between macroscopic bodies
(i.e. molecules/solids).
In an opposite top-down fashion macroscopic theories start with the dispersive
energy between macroscopic bodies. Again under the assumption of additivity,
the effective microscopic contributions can be backed out of these theories. The
theories of Lifshitz and Hamaker are introduced in the eventual extraction of C6
for graphite via these methods.
6.1 Main assumptions in modeling dispersion interactions
Before discussing the physical basis of the dispersion theories in detail there are
some important points to keep in mind. There are some major assumptions that
are commonly made in dealing with dispersion forces that are rarely mentioned
in the first principles plus dispersion papers cited in § 4.2. These assumptions are
addivity, isotropicity and non-retardation [110, 111].
The addivity assumption is the idea that the total dispersion energy can be ex-
pressed as a pairwise summation of interaction between the atomic constituents.
The isotropicity assumption says that the dispersion energy is isotropic in nature.
This turns out to be only true for some special cases and as a general rule does not
hold. Retardation is an effect that arises at far separation. Since dispersion inter-
actions are electromagnetic in nature the instantaneous dipole that is realized on
one molecule takes a finite amount of time to register with the other molecule due
to the speed of light. At farther separations these systems can no longer correlate
with each other for a net lowering of energy. The 1/r6 energy distance dependance
decays to a 1/r7 dependance and finally disappears at further distances [110].
In later sections these assumptions will be further addressed in conjunction with
the development of the theories.
66
6.2 Microscopic dispersion theory (London)
The idea of “spontaneously induced dipoles”, termed London dispersion forces,
was introduced in § 2.3.4 and referenced many times since. Now the quantum
mechanical origin of the 1/r6 distance dependance and dispersion coefficient C6
will be laid bare through the application of perturbation theory to a system of
two hydrogen atoms as London first did in 1930 [107].
Perturbation theory in short is the calculating of corrections to the energy and
wavefunction of a known system under the influence of a disturbance (see Ap-
pendix E for more formalism). We will show the change in energy of interacting
charge clouds as they are brought together from infinite separation. First the sys-
tem is described, followed by the form of the perturbation and finally the solution
as given by perturbation theory.
6.2.1 System description
Consider an atom A who’s quantum numbers and, thus state, is described by λa,
and another atom B, with state λb. This system of two atoms when infinitely
separated can be treated individually with the Schrodinger equation.
Haψλa= Eλa
ψλa, (6.1)
Hbψλb= Eλb
ψλb, (6.2)
where the Hamiltonians contain terms associated with the isolated atoms. This
system is said to be unperturbed and its wave function is simply the product of
the individual wavefunctions, and its hamiltonian and energy just the sum of the
two, i.e.
67
ψ(0) = ψλaλb= ψλa
ψλb, (6.3)
H(0) = Hab = Ha +Hb, (6.4)
E(0) = Eλaλb= Eλa
+ Eλb. (6.5)
The superscripts “0” mean the zeroth order and stand for the unperturbed solu-
tion. The system of two non-interacting atoms now can be written as
H(0)ψ(0) = E(0)ψ(0). (6.6)
6.2.2 The perturbed system
When the two systems are separated by a finite distance rather than infinite,
the wavefunctions ψλaand ψλb
are nearly the same but are slightly distorted due
to the presence of one another. If the distortion is slight enough one can apply
perturbation theory to approximate the change in the system.
Perturbation theory starts by assuming the full knowledge of the zeroth-order
unperturbed solution to the Schrodinger equation given in equation (6.6). With
this knowledge its possible to describe the solution of the same system that is
slightly “perturbed”. This system is described by a new Hamiltonian that has the
same base Hamiltonian H(0) plus a small perturbation V
H = H(0) + V. (6.7)
The perturbation V can have many orders of contribution as can be see in Ap-
pendix E. Here we do not show higher order perturbations because they do not
68
add anything to the description of our system and it helps keep the notation
cleaner.
6.2.3 Energy corrections in perturbation theory
The correction to the energy and wavefunction in perturbation theory are added
to the zeroth order energy and wavefunction, and the actual wavefunction and
energy are expressed as
ψ = ψ(0) + ψ(1) + ψ(2) + . . . h.o.t′s, (6.8)
E = E(0) + E(1) + E(2) + . . . h.o.t′s. (6.9)
The superscripts on the energy and wavefunction denote the order of the correc-
tion. E(1) and E(2) are referred to as the first and second order corrections to the
energy respectively, and as mentioned earlier E(0) corresponds to the unperturbed
energy.
The derivation for the first and second order corrections to the energy and wave-
functions are shown in Appendix E. The first order correction to the energy, in
state k is
E(1)k = Vkk. (6.10)
Vkk is the short hand notation for 〈ψ(0)k |V |ψ(0)
k 〉. As mentioned in appendix E
the correction is not restricted to the ground state (k = 0). The second order
correction is given as
E(2)k =
′∑
n
|Vkn|2
E(0)k − E
(0)n
. (6.11)
69
The prime in the sum means over all states except n = k.
Using the two atom wave function given in (6.3), in equations (6.10), and (6.11),
the first and second order corrections to the energy are
E(1)λaλb
= 〈ψ(0)λaλb
| V |ψ(0)λaλb
〉 , (6.12)
E(2)λaλb
=′∑
n1
′∑
n2
∣∣∣〈ψ(0)λaλb
|V |ψ(0)n1n2
〉∣∣∣2
(E(0)λa
−E(0)n1
) + (E(0)λb
− E(0)n2
). (6.13)
6.2.4 Application to Hydrogen
If the atoms A and B are hydrogen and the quantum numbers correspond to the
ground state we know, from the full analytical solution of the wave functions in
the ground state that the charge distributions are spherically symmetric.
+ +
−
−
~R
A B
~r1
~r2
x
y
Figure 6.1: Two hydrogen atoms A and B. The nuclei are separated by ~R withelectrons referenced from the nuclei
Figure 6.1 shows the system of hydrogen atoms, separated by the internuclear
vector ~R. Atom A’s electron is denoted by ~r1 and the other electron is denoted as
~r2 for atom B. The perturbation in our case is the columbic interactions of charges
in atom A and those in atom B expressed by
70
V =
nA−nB︷︸︸︷e2
r+
eA−eB︷ ︸︸ ︷e2
|~R+ ~r2 − ~r1|−
nA−eB︷ ︸︸ ︷e2
|~R+ ~r2|−
eA−nB︷ ︸︸ ︷e2
|~R− ~r1|. (6.14)
r = |~R| in the first term, and the overbraces denote the interaction (e.g. nA −
eB is the term associated with the Coulomb interaction between nucleus A and
electron in B). Assuming the nuclei are sufficiently far apart the interaction can
be expanded in a Taylor series. For the general case of an arbitrary number of
electrons this expansion is referred to as a multipole expansion and is described in
Appendix D ( see Equation D.9 and table D.1). In this derivation the internuclear
vector ~R is taken as one of the cartesian directions ( e.g. x direction in figure 6.1).
For the case of non-ionic interaction, the first five terms of the expansion in table
D.1 are zero. The electrostatic energy between these neutral charge distribution,
begins with the dipole-dipole interaction energy and continues up into the higher
order terms. The perturbations first non-zero term is given by
V = ~R−3∑
ij
qiqj(xixj + yiyj − 2zizj), (6.15)
and termed the dipole-dipole operator. The sum goes over the i charges in atom
A and the j charges in atom B. The cartesian components of ~ri are xi, yi and
zi with origins centered at the nuclei, and qi is the magnitude of the charge. For
hydrogen, using atomic units (i.e. e = 1) the dipole-dipole interaction can be
written as
V =1
r3(x1x2 + y1y2 − 2z1z2). (6.16)
With this perturbation we can calculate the corrections to the energy of the iso-
lated hydrogens due to their mutual interaction. The change in energy between
the isolated states is simply
71
∆E = E(1)0 + E
(2)0 + . . . h.o.t′s. (6.17)
The first order correction to the energy given in equation (6.12) is zero (E(1)0 = 0).
This is because the ground state wavefunctions are spherically symmetric and the
integral described by them will always be zero.
For the second order correction to the energy we plug our perturbation from
(6.16) into the definition for E(2)0 given in equation (6.13). With the ground state
described by λa = λb = 0 the second order correction to the energy is
E(2)0 =
1
r6
′∑
n1
′∑
n2
∣∣∣〈ψ(0)0 ψ
(0)0 |x1x2 + y1y2 − 2z1z2 |ψ(0)
n1ψ
(0)n2〉∣∣∣2
(E(0)0 −E
(0)n1
) + (E(0)0 − E
(0)n2
). (6.18)
The 1/r3 term from the perturbation given in equation (6.16) is squared in the
numerator of the second order correction and is brought front as 1/r6.
( mention the magnitude of E(3)0 , and magnitude of the C8 term)
There is a net change in energy proportional to 1/r6, where the proportionality is
simply the double summation in equation (6.18). We will call this term C6 and
treat it as a constant for the time being.
∆E =C6
r6. (6.19)
The sign of the energy corrections ∆E, determines the attractive or repulsive
nature of the perturbation (If the system energy is lowered its attractive and vice
versa). In the ground state the denominator of E(2)0 is always negative because
E(0)0 < E
(0)n for all n in the sum, and the numerator is always positive so C6 will
always be negative. For excited states this condition is not necessarily true and
there can be a net repulsion.
72
6.2.5 Higher order corrections
We have truncated the perturbation at the second order and only considered the
dipole-dipole perturbation. It is wise to consider the magnitude of both the higher
order perturbations and multipole terms.
This change in energy will be referred to as the dispersion energy. If higher order
multipole terms were included at in the perturbation in (6.16) we would see that
the dispersion energy takes the form
Edisp =C6
r6+C8
r8+C10
r10+ . . . h.o.t′s =
∞∑
n=3
C2n
r2n. (6.20)
The right hand side shows the general form of the infinite expansion.
6.2.6 Applying London’s dispersion theory in Calculating C6 for graphite
The next few sections describe the approximate evaluation of C6 for graphite by
simplifying the double summation in equation 6.18.
6.2.6.1 London’s form and polarizability
London’s went beyond the derivation above for an approximate evaluation of C6
given by the sum of equation (6.18). This derivation of his includes the static
polarizability of the atom in question. The static polarizability describes the
ability of an atom to gain a dipole moment (µ) under the influence of a static
electric field (E )
µ = αE . (6.21)
The static polarizability, α, has units of [C2m2J−1]. Polarizability is usually re-
ported in the form of the polarizability volume given by
73
α′ =α
4πǫo, (6.22)
with units of volume [A3]. While the polarizability is properly a tensorial quantity,
here it is assumed that the measured value used in London’s approximation is the
average static polarizability over all orientations.
London’s approximation is based upon applying the closure approximation ( see
Appendix E.3) in evaluating the sum in (6.18), the form is [108]
CLond6ij = −3
2
(∆Ei ∆Ej
∆Ei + ∆Ej
)α
′
iα′
j . (6.23)
The ∆E is the average energy difference between all excited states and the ground
state as explained in E.3. This term has been proposed to be approximately
either the ionization potential (I), and the resonance or first transition energy (ω1)
[97]. Another interpretation will be discussed in the following section. Using the
interpretation of the average energy difference (∆Ei) as the ionization potential,
the London form of C6 between atoms i and j is
CLond6ij = −3
2
(IiIjIi + Ij
)α
′
iα′
j. (6.24)
While this is a rough approximation due to the assumptions made, this formula
provides a good starting point to evaluate C6 with experimental data.
In order to evaluate this expression for carbon in graphite one needs the static
polarizability, and the ionization potential in the graphitic state. The last point,
“in the graphitic” state is important. One can find these data points for carbon
in its isolated form, to its many forms of hybridization, and when it is hybridized
to other species.
The ionization potential for carbon in graphite measured experimentally is 11.22
74
eV [136]. The static polarizability, on the other hand is not as easy to measure.
The static polarizability for an atom is not just a function of its atomic number,
but it is also a function of its state of bonding or hybridization [82, 118]. Using
this logic one can see that since there isn’t such a thing as isolated carbon with
the properties of a carbon atom in a graphitic state, this is not a directly measur-
able quantity. Fortunately there are empirical combinations rules for describing
molecular polarizability in terms of atomic polarizability and hybridization that
work very well [82, 118]. The most extensively tested of these is Miller’s [118]. His
combination rules correctly reproduce experimental polarizabilities of over 400
compounds with in a average error of 2.2%. For carbon in the graphitic state
only bonded to other carbons, Miller finds that the static atomic polarizability is
α′ = 1.896 A3. Using this polarizability value with the ionization value quoted
above the C6 for graphite using equation (6.24) is 30.25 eV A6
6.2.6.2 Slater-Kirkwood approximation
A few years after London’s derivation and approximation Slater and Kirkwood
proposed another approximation in calculating the sum in London’s derivation
[153]. Starting with London’s derivation after the closure approximation (equation
(6.23)), they interpreted the ∆Ei as
∆Ei = (Ni/α′
i)1/2. (6.25)
Instead of the ionization potential as London used. N is the Slater-Kirkwood
effective number of electrons, originally taken as the number of valence electrons.
With this the Slater-Kirkwood approximation for C6 is
Csk6ij =
3
2
α′
iα′
j
(α′
i/Ni)1/2 + (α′
j/Nj)1/2. (6.26)
Halgren has applied a similar method as Miller for the combination rules of dis-
75
persion coefficients, but in terms of the Slater-Kirkwood formalism. Halgren’s
optimum value for the Slater-Kirkwood number of electrons for carbon in a sp2
hybridization is 2.49 [59]. Using this and the same value for the static polarizabil-
ity in the previous section the Slater-Kirkwood value for C6 is 30.89 eV A6.
6.2.6.3 Kirkwood approximation
As mentioned in § 4.1, the C6 used in the Girifalco LJ graphite potential was
calculated by the Kirkwood approximation. This approximation like the Slater-
Kirkwood approximation is again based on an interpretation of London’s deriva-
tion in (6.23). This version is in terms of the static polarizability, and the dia-
magnetic susceptibility χ [88]
Ckw6ij = 6mec
2α
′
iα′
j
α′
i/χi + α′
j/χj
. (6.27)
me is the mass of the electron, and c is the speed of light. Using this form, Kraus
[98], calculated C6 = 15.2 eV A6
for graphite from data provided in [6].
6.3 Macroscopic dispersion theory (Lifshitz + Hamaker)
The other way to tackle the problem of describing dispersion forces is to start
with a description of macroscopic bodies interacting through dispersion forces.
These theories are rigorously expressed with quantum electrodynamics (QED)
[102]. They have been reformulated in semi-classical terms and in this context
the dispersion energy “can be defined as the change in the zero-point energy of
the electromagnetic field modes (obtained by solving Maxwell’s equations) when
the latter are perturbed by the molecules through coupling of the field with the
polarization currents induced on the molecules” [110]. The general results and
implications of this macroscopic theory of dispersion forces will be briefly summa-
rized and stated in the next few sections.
76
6.3.1 Lifshitz dispersion theory
The dispersion interaction between two semi infinite slabs was described by Lif-
shitz, who developed a general theory of dispersion interactions between macro-
scopic bodies [106, 36]. Before the general form Lifshitz proposed is explained, a
necessary quantity, ǫ(iξ), which is utilized by this theory must be discussed.
6.3.1.1 Dielectric response and ǫ(iξ)
The permittivity of a material ǫ, relates how a time dependant externally applied
electric field (E), will effect the electric displacement vector (D) in a solid through
D = ǫE. (6.28)
The permittivity as a function of field frequency ω, is known as the dielectric
permeability, or dielectric function ǫ(ω). This is a complex quantity with real and
imaginary component defined as ǫ(ω) = ǫ′
(ω) + i ǫ′′
(ω). The dielectric function is
properly a tensorial quantity, but for isotropic materials, or a first approximation,
ǫxx = ǫyy = ǫzz. The dielectric function can be found through the measurement
of optical properties of materials specifically, refractive index and the extinction
coefficient(n(ω) and k(ω) respectively
). This relation is given by [89]
ǫ(ω) =(n(ω) − i k(ω)
)2. (6.29)
The Lifshitz theory of dispersion forces presented in the next section requires
the function ǫ′(iξ), which is the real part of the dielectric function evaluated at
purely complex frequencies ω = iξ, and will be referred to as the Lifshitz dielectric
function from here on. The dielectric permeability for an imaginary frequency is
related to the imaginary part of the dielectric permeability via the Kramer-Kronig
relation [72]
77
ǫ′(iξn) = 1 +2
π
∞∫
0
ω ǫ′′
(ω)
ω2 + ξ2n
dω. (6.30)
Thus if the imaginary part of the dielectric function evaluated at real frequencies
is known the Lifshitz dielectric function can be attained. One path to ǫ′(iξn) can
start with the experimental data n(ω) and k(ω), progress to ǫ′′
(ω) via equation
(6.29), and finally to ǫ′(iξn) via the Kramer-Kronig relation in equation (6.30).
Another route is the direct calculation of ǫ′′
(ω) from a first-principles or semi-
empirical technique and then the application of the Kramer-Kronig relation.
6.3.1.2 Lifshitz dispersion theory
Lifshitz original derivation was for two slabs of similar material separated by vac-
uum [106], but was later extended and generalized to two differen’t materials
separated by a third medium. The interaction energy per unit area between the
two plates is
L
1 23
Figure 6.2: Two semi-infinite slabs 1 and 2 separated by a distance L and amedium 3.
U123(L) =kbT
2π
∞∑
n=0
∞∫
0
k ln(1 − ∆12∆32 e−2kL) dk. (6.31)
78
k is the wavevector of the electromagnetic field. The ∆’s in this equation are
simply
∆kj =ǫk(iξn) − ǫj(iξn)
ǫk(iξn) + ǫj(iξn), (6.32)
and the Lifshitz dielectric function is evaluated at discrete points given by
ξn = n
(2πkbT
~
). (6.33)
Mahanty and Ninham have developed this theory beyond Lifshitz’s two slab prob-
lem to other geometries such as sphere-sphere, cylinder-cylinder etc. along with
theories for anisotropic dielectric functions [110].
6.3.2 Lifshitz theory applied to the evaluation of Hamaker constants
for graphite
By equating the definition of the surface-surface interaction energy given by the
Lifshitz method to that of the Hamaker surface-surface interaction energy given
by
Uss(L) =−A
12πL2, (6.34)
one can solve for the Hamaker constant A [10, 48, 72]. The derivation of this
energy relationship and how the Hamaker constant A is related to C6 will be
discussed in the next section. Here we note two case in which the evaluation
of the Lifshitz dispersion relation has been applied to graphite and a Hamaker
constant has been identified, in addition to this a single experimental evaluation
of the Hamaker constant has been found in literature.
Dagastine et al. calculated the Hamaker constant for graphite from experimental
79
spectroscopic data [30]. The isotropic assumption of the dielectric function men-
tioned in § 6.3.1.1, does not hold for graphite. The in plane dielectric function
(ǫxx = ǫyy) is quite distinct to that normal to the plane (ǫzz). Spectroscopic data
from both in plane and normal to the plane measurements was used in the con-
struction of ǫ′′
(ω) with the relation in (6.29), and used equation (6.30) to convert
this to ǫ(iξn). Using a modified version of the Lifshitz equation in (6.31) that
accounts for non-isotropic dielectric functions they calculated a Hamaker constant
of 2.53 × 10−19 J.
Li et al. have preformed a similar analysis on graphite but attained the dielectric
response via a tight binding model under linear response theory [105]. Unlike
Dagastine they assumed an isotropic dielectric response and there resulting long
wavelength Hamaker constant is reported as 2.2 × 10−19 J.
Parfitt and Picton reported a experimental Hamaker constant for graphite from
some experiments on colloidal suspensions of graphite flakes. Due to numerous
uncertainties in the analysis a rather broad range of 2.1 - 5.9 ×10−19 J is reported
[132].
6.3.3 Hamaker constant derivation
In conjunction with the last section this section further develops the ideas of
Hamaker in describing the interaction between macroscopic bodies and calcula-
tions of effective C6 dispersion constants are preformed utilizing the Hamaker
constants reported in above. Hamaker’s ideas came out of the context of colloidal
science. The main idea of the Hamaker energy relation between macroscopic bod-
ies is an assumption of addivity of dispersion forces and a treatment of the atoms
in a solid as a continuum [61]. With these assumptions the total interaction en-
ergy can then be expressed as volume integrals over the interacting bodies. These
calculations can be applied to any arbitrary geometric bodies (e.g. sphere-sphere,
cylinder-sphere etc). The derivation for the two slab problem is given below in
80
the format described in [110]. And starts with a description of a point-surface
interaction that is utilized in the surface-surface derivation.
6.3.3.1 Point-Surface Interaction
Consider a single atom offset from a surface by a distance L, and that it interacts
with the atoms in this solid through a dispersion force of the form
Upp(r) = −C6
r6. (6.35)
The underscore pp stands for point-point interaction. Assuming additivity the
total interaction energy can be attained by summing up all interaction with the
point and the atoms in the solid for a given offset L. Another way is to assign a
number density of atoms in the solid and attain the potential energy through a
volume integral.
Y
X
dx
r
x = L
x = 0
y
dy
Figure 6.3: Point-surface interaction diagram. The wavy line represents a visualtruncation of the infinite solid. The infinitesimal ring is shown from the side andthe dotted line represents its position in the solid into and out of the page
81
Figure 6.3 is a diagram of the system in question. The point particle is located at
x = 0, a distance L from the surface. The differential element is a infinitesimal
ring that is equidistant from the point. The volume of this ring multiplied by the
number density (ρ) is the number of atoms at this distance r,
N(r) = ρ(2πy)dxdy. (6.36)
r is simply r = (x2 + y2)1/2. By evaluating the dispersion energy with equa-
tion (6.35) with this distance r, and multiplying it by the number of atoms at
this distance given by N(r), gives the dispersion energy contribution from this in-
finitesimal ring. Integrating over all rings results in the total dispersion interaction
energy between a point and a half-slab in the form
Ups(L) =
∞∫
x=L
∞∫
y=0
ρ(2πy)−C6
(x2 + y2)6/2dy dx. (6.37)
The ps stands for point-half slab interaction. For the geometry shown in Figure
6.3 this reduces to
Ups(L) =−πC6ρ
6L3. (6.38)
6.3.3.2 Surface-Surface Interaction
Using the definition for the point-surface interaction it is possible to calculate the
surface-surface interaction defined per unit area (as in the Lifshitz derivation).
Figure 6.4 shows two half slabs, the element of surface area has a depth dx and
contains Nse = ρ dx atoms per unit area. The interaction energy of these Nse
atoms with the other surface is simply
Use(x) = NseUps(x), (6.39)
82
L
dxx
Y
X
Figure 6.4: Surface-Surface Hamaker integration
and the surface-surface total interaction is just these surface elements integrated
from x = L→ ∞
Uss(L) =
∞∫
L
Use(x) dx =
∞∫
L
−πC6ρ
6x3ρ dx. (6.40)
This simplifies to
Uss(L) =−πC6ρ
2
12L2=
−A12πL2
. (6.41)
The right hand side is the same form that was shown earlier in equation (6.34).
As can be seen from this equation the Hamaker constant A is defined as
A = π2ρ2C6. (6.42)
The graphite dispersion constant C6 can be backed out of the Hamaker constant
with knowledge of the number density of carbon atoms in graphite which is ρ =
83
1/(A0 ·Zeq)= 0.114 atoms/A3. Using this with equation (6.42) the three reported
Hamaker constants of Dagastine, Li, and Parfitt reported in § 6.3.2 correspond to
dispersion coefficients of 12.32, 10.71, and 10.22 − 28.72 eV A6
respectively.
6.4 Summary of calculated C6 coefficients
Later when we report our fitted value of C6 we must refer back to this the results
of this chapter to see if the value found is reasonable with what was presented
here in its direct calculation. The results of the various methods, including the
experimental Hamaker value, are tabulated in table 6.1. This table shows a decent
spread with the Lifshitz based methods clustered at the lower end and the other
methods showing a larger value. It is interesting to note that the one experimen-
tal value reported (calculated from the Hamaker decomposition method) almost
perfectly spans the range of calculated values.
C6 [eV A6] Method Method Reference Data Reference
30.25 London [108] [82]30.89 Slater-Kirkwood [153] [59, 118]16.34 Slater-Kirkwood [153] [167]15.2 Kirkwood [88] [6]12.32 Lifshitz/Hamaker [106, 61] [30]10.71 Lifshitz/Hamaker [106, 61] [105]10.22 − 28.72 Hamaker (experiment) [61] [132]
Table 6.1: Summary of C6 dispersion coefficients for graphite
84
7Damping functions
In the previous chapter London’s perturbative description of the dispersion attrac-
tion between hydrogen atoms was discussed amongst other theories. This analysis
was based on two approximations that breakdown when the two hydrogen atoms
come closer together. The first is related to the main assumption in perturbation
theory, that says the perturbation must be small enough such that perturbed sys-
tem is close to the unperturbed state. When the hydrogen atoms, or any other
system of atoms, are brought close enough to one another the resulting wavefunc-
tions are very different than those of the isolated system, and thus perturbation
theory fails. The second breakdown in London’s derivation at close proximity
is from the definition of the perturbation operator via the multipole expansion.
85
This was derived through a Taylor series expansion under the assumption that the
internuclear separation was much larger than the average radius of the electron
cloud. At small internuclear separation this expression for the perturbation losses
its meaning.
In order to properly treat this region were London’s assumptions break down
the idea of a damping function is introduced. The deviation of the dispersion
energy from the asymptotic limit description, given in equation (6.20), at close
internuclear separation can be described if the true dispersion energy is known.
The damping function is defined simply as the ratio of the true dispersion energy
and the London dispersion energy
f(r) =Edisp
o
EdispL
. (7.1)
Edispo is the true dispersion energy and Edisp
L is the London dispersion energy. If the
damping function is known then the true dispersion energy is simply the damping
function multiplied by the London dispersion energy,
Edispo = f(r)Edisp
L . (7.2)
There are some general features of this damping function which can be pointed
out. One is that it acts as a switching function and has a range between 0 and
1. As r approaches zero f(r) must necessarily go to zero at least as fast as the
asymptotic dispersion energy. This can be understood by saying the dispersion
energy must be finite at all distances r, without the damping function it would
approach −∞ as r tends to zero. In the limit of large r, the asymptotic limit
of the London derivation is valid and f(r) must be unity. The nature of how
this damping function switches from 0 to 1 is the key point to consider. As in
the development of the 1/r6 dispersion energy dependance the hydrogen dimer
86
provides some clues to the nature of this damping function.
7.1 Hydrogen damping function
The only damping function which is known to a very high degree is that of hydro-
gen. Koide, Meath and Allnatt (KMA) have preformed pseudo-state evaluation
of the dispersion energy [93]. These calculations are cited as the most accurate
analysis of the dispersion energy between hydrogen atoms, and are the benchmark
for proposing a general form of damping functions [157, 166].
7.2 Damping functions in literature
All of the first principles plus dispersion models discussed in § 4.2 employ a damp-
ing function. There are a handful of damping functions that are used often. A
early form was proposed by Ahlrichs et al. [1]
fdamp(r) = exp[−(c rm/r − 1)2]. (7.3)
This function has two parameters, c is dependant on the interacting species (c
= 1.28 for hydrogen dimers using the KMA data discussed above) and rm is the
position of the potential minimum.
Another common form is that of Mooij et al. which has continuous second deriva-
tives
fdamp(r) =(1 − exp[−c(r/rm)3]
)2. (7.4)
Again c and rm are fitted for the particular interaction. Fermi functions, which
also have two parameters to fit are also used as damping functions in some models.
Yet another damping function is that of Tang and Toennis [157]. They developed
87
a semi-universal damping function that has been shown to correctly describe the
functional form of the damping function for a number of interactions. This damp-
ing function has a single parameter, α, which describes the range of the overlap
of the interacting species and has the form
f 2ndamp(r) = 1 −
(2n∑
k=0
(αr)k
k!
)e−αr. (7.5)
Each term in the dispersion series given in equation (6.20) has an individual damp-
ing function. n = 3 in equation (7.5) corresponds to f6 which is the damping
function used for the dipole-dipole dispersion term C6/r6. The total damped dis-
persion series in Tang’s form is then
Edispdamped =
∞∑
n=3
f 2ndamp(r)
C2n
r2n. (7.6)
This damping function has a few positive qualities: its apparent universality, and
single parameter formulation. Tang and Toennies have shown that their damping
function does a very good job in reproducing the form of the damping function
as calculated by Koide et al. described above. Figure 7.1 shows Koide’s data set
for hydrogen as dots and the fitted damping function as a solid line for the first 8
terms in the dispersion series. Similar good fits are shown for He-He, Ar-Ar, Na-
K and Li-Hg based on the best available data for these interaction. Their paper
shows the shortcoming of the other discussed damping functions in describing these
interactions as compared to their own. This damping function will be adopted for
use in our formulation of the damped dispersion based on these arguments.
88
Figure 7.1: Plot of Tang’s damping function for b = 1.67 a.u. for n=3-10 as solidline, and Koide’s hydrogen dispersion energies as dots [93]
89
8Tight-binding plus dispersion
parametrization
This chapter details the marriage of the tight-binding model with a damped dis-
persion energy and our parametrization of the free parameters to model both the
inter and intra-planar properties graphite. The notation for the interlayer energy
per atom (E) defined in § 3.3 is used in this and following sections. A rhom-
bohedral supercell of four unit cells containing a total of sixteen atoms is used.
Three dimensional periodic boundary conditions are enforced to simulate bulk
graphite (periodic boundary conditions are discussed in Appendix A.2). We start
by defining the total energy in the model.
90
8.1 Total energy definition
The total energy for the system is defined in the same manner as was discussed
in § 4.2 and is repeated here:
Etot = ETB + Edisp. (8.1)
Where ETB is the tight-binding energy defined in equation (5.36) and also is
repeated here
ETB =∑
λ∈occ
Nk∑
i=1
wi fλ ελ(ki) +1
2
∑
i,j
Θ(rij). (8.2)
Edisp is the dispersion energy defined as
Edisp =1
2
∑
i,j
fdamp(rij) fcut(rij)C6
r6ij
. (8.3)
In this dispersion energy expression, fdamp(rij) is the damping function. We dis-
cussed in § 7.1 that we are using the Tang and Toennis damping function due to
its simplicity and seeming “semi-universality.” [157] Its form for the damping of
the C6 term (n=3 in equation (7.5)) is
fdamp(r) = 1 −(
6∑
k=0
(αr)k
k!
)e−αr. (8.4)
The term fcut(r) in equation 8.3 is a cutoff function. Cutoff functions are employed
to make the neighbor search routine more efficient, and improve the stability of
molecular dynamics simulations. Cutoff functions effectively define a radii beyond
which contributions are considered negligible and thus ignored. We have utilized
a Fermi function for the cutoff function within the defined cutoff distance Rcut,
and 0 otherwise.
91
fcut(r) =
1
exp( r−rc
rw) + 1
if r ≤ Rcut
0 if r > Rcut
(8.5)
The Fermi function is equal to unity and switches to zero with continuous deriva-
tives, and in our case the dispersion energy also has continuous derivatives. For
molecular dynamics having continuous derivatives of the energy at the cutoff
negates the possibility of spikes of high force when atoms enter each other’s do-
mains defined by the cutoff distance. This Fermi function is defined by two pa-
rameters; rc determines the distance at which the function is equal to 1/2, and
rw controls the smoothness of the cutoff. We simply choose a value of rw = 0.1
and find a corresponding rc such that the function is essentially zero at the cutoff,
i.e. f(Rcut) = 1 × 10−5. Figure 8.1 shows the cutoff functions for various values
of Rcut. The seeming arbitrariness in the definition of the cutoff function will be
addressed and rectified later in § 8.4.
0 5 10 15 200
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
r [A°]
f cut(
r )
Rcut
= 5
Rcut
= 10
Rcut
= 15
Figure 8.1: Fermi function for a number or Rcut’s with rw = 0.1
92
8.2 Evaluation of tight-binding energy
The tight-binding energy expression given in (8.2) requires the summation of en-
ergy over the Brillouin zone. The Brillouin zone was sampled with the special k-
points (ki) and weights (wi) of Monkhourst-Pack [120] as tabulated by [101]. The
density of k-points was increased to 370 to satisfy an energy convergence criteria
of ±0.1 meV. A summary of this convergence is shown in Table 8.1 for graphite
at its equilibrium interlayer spacing. This tight convergence is need because the
magnitude of the interlayer energy is on the order of 50 meV. § 8.3.2 discusses an
exception to the use of the Monkhourst-Pack points for certain calculations.
# k-points ETB
1 ( Γ) -46641.14328 -47341.229133 -47344.471370 -47344.801610 -47344.882
Table 8.1: Number of Monkhourst-Pack k-points and calculated energy[meV/atom]
Figure 8.2 shows a plot of the tight-binding interlayer energy per atom as a function
of interlayer spacing. As expected there is no minima due to the lack of dispersion
energy in the formalism of tight-binding and the resulting energy is of a purely
repulsive nature.
It should be noted that in the figure the interlayer energy effectively goes to
zero before Zeq. Though the tight-binding cut-off distance is around 6 A, there
is no interaction energy at this distance. This situation needs to be rectified
on the basis that in order for there to be a minima at Zeq, and thus a stable
graphite structure, there must be a non-zero tight-binding interlayer energy at Zeq.
This can be understood by a simple argument. Both energy terms monotonically
approach zero at large distances, but from different sides of zero. In order for a
minima to occur the first derivative of the total interlayer energy must be zero, and
consequently the sum of the first derivatives of the tight binding and dispersion
93
2.5 3 3.5−20
0
20
40
60
80
100
120
140
160
180
200
interlayer separation [A°]
inte
rlaye
r en
ergy
[meV
/ato
m]
At Zeq
, ETB < 1 × 10−4 meV/atom
Figure 8.2: Purely repulsive tight-binding interlayer energy
interlayer energy must sum to zero. The derivative of the attractive dispersion
term will always be positive, implying that there must be a negative-valued first
derivative of the tight-binding interlayer energy in order for the minima to occur.
Due to its decreasing monotonic nature the tight-binding energy must necessarily
be non-zero at the interlayer spacing at which the minima occurs, in order for its
derivative to be non-zero.
8.2.1 Orbital expansion
One may wonder why the DFT fitted tight-binding parametrization fails to pro-
duce a repulsive overlap energy at the equilibrium interlayer spacing while it nat-
urally must, in accordance to the previous argument. This result can be directly
traced to the artificial orbital compression described in § 5.8.1, done at the time in
the name of celerity. To undo this limitation on the proper modeling of graphite
interlayer properties one can simply inflate the compressed orbitals. This orbital
inflation is done in a simple manner. To inflate a particular pair of interacting
orbitals one can simply contract the distance in which they are separated when
94
evaluating the hopping integrals in the hopping sub-matrix. This is done by intro-
ducing a scaling parameter γ which is multiplied by the real internuclear distance
rij to get a reduced distance rredij = γrij which is used in the integral evaluation. In
order to preserve the in-plane properties of graphite (which are already properly
described with the unmodified TB code) we must introduce a numbering system
for the bodies interacting via dispersion forces. A layer of graphite, or later on,
a single tube is numbered and the distance scaling is only done between different
numbered systems so that in the case of infinite separation of the systems the
in-plane energetics are the same as with out the orbital stretching addition.
We have previously stressed the importance of the Pz orbital in the repulsive over-
lap interaction so it would make sense to only inflate this orbital when evaluating
hopping integrals between Pz orbitals on differing sheets. This works fine until we
bring our graphite dispersion model to use on the curved surfaces of carbon nan-
otubes. In this case there is no uniform z direction in which to inflate Pz orbitals
on differing nanotubes. Another formulation is to uniformly inflate all the orbitals
in the graphite formulation and fitting, thus bypassing this directional problem
when describing tube-tube interactions.
The introduction of this scaling parameter produces the desired repulsive energetic
effect, as is illustrated in Figure 8.3. With a decreasing γ we see the increased
repulsion and our necessary non-zero tight-binding interlayer energy at Zeq.
8.3 Parameter fitting procedure
At this point there are 3 free parameters, the scaling parameter γ , the dispersion
constant C6, and the damping function parameter α. This situation is similar to
the one with which Palser dealt with (§ 4.2). Two of the data points Palser used
in fitting have a relatively high uncertainty: the experimental interlayer cohesive
energy, and the DFT stacking difference between AAA and ABA. It seems that
a more logical choice in fitting these parameters would be to fit to the more well
95
2.5 3 3.5
0
50
100
150
200
250
300
interlayer separation [A°]
inte
rlaye
r en
ergy
[meV
/ato
m]
γ = 1.00γ = 0.95γ = 0.90γ = 0.85
Zeq
Figure 8.3: Tight-binding interlayer energy for different γ’s. Arrow shows trendof decreasing γ, and the associated orbital inflation
known E2g(1) shear mode frequency and the z-axis compressibility, in addition to
the equilibrium interlayer separation Zeq. These three data points constitute a
much firmer database for our fitting than Palser’s.
The fitting routine consist of the simultaneous functional minimization of the three
experimental points (Zeq, kz, Eex) minus those calculated in the tight-binding plus
dispersion code. This procedure is preformed via the hybrd.f routine provided
in the MINPACK library from Argonne National Laboratory [122]. This routine
is an implementation of Powell’s hybrid method, based on the conjugate gradient
search routine [138].
In order to proceed in computing the free parameters via the aforementioned
method it is necessary to construct functions to calculate the equilibrium spacing,
phonon frequency and the compressibility given a set of parameters γ, C6, α. The
next three sections address the computational details of attaining these values.
96
8.3.1 Equilibrium interlayer spacing
The equilibrium interlayer spacing is attained by requiring
∂ETB
∂Z
∣∣∣∣∣Zeq
+∂Edisp
∂Z
∣∣∣∣∣Zeq
= 0, (8.6)
which is simply a statement that the repulsive tight-binding interlayer force must
balance the attractive dispersion force at the equilibrium interlayer spacing. These
derivatives are evaluated through a finite central difference derivative evaluated
at a dilated interlayer spacing Zeq + δZ and a compressed spacing Zeq − δZ. For
the tight-binding term this is
∂ETB
∂Z
∣∣∣∣∣Zeq
=ETB(Zeq + δZ) − ETB(Zeq − δZ)
2 δZ. (8.7)
The same procedure is executed for the dispersion contribution in (8.6).
8.3.2 Phonon frequency calculation
The E2g(1) phonon frequency is evaluated via the frozen phonon method, model-
ing the interlayer sliding as approximately harmonic about the equilibrium ABA
stacking [114]. Under this assumption, the natural frequency of the sliding layers
can be described by
2πfE2g(1) =
√Kexp
c−c/mc. (8.8)
Here fE2g(1) is the frequency which corresponds to the E2g(1) mode (1.26 THz),
Kexpc−c is the experimental effective spring constant per atom along the carbon-
carbon bond length, and mc is the mass of a single carbon atom (12 amu =
1.9926 × 10−26 Kg). Solving for the effective spring constant in the units of our
code 1
97
Kexpc−c = 7.795 × 10−2 (eV/A
2)/atom. (8.9)
This effective one atom spring constant can be calculated for graphite in our tight-
binding code using the relation of a harmonic oscillator.
Etot(dc−c) =1
2Kc−c d
2c−c. (8.10)
Etot(dc−c) in this case represents the energy per atom in ABA graphite under
uniform translation (dcc) of every other layer in one of the carbon-carbon bond di-
rections. This motion corresponds to the relative translation of layers in the E2g(1)
mode described in § 3.5. The energy is evaluated at Zeq and dc−c is the magnitude
of the interlayer sliding (dc−c = 0 corresponds to a perfect ABA stacking).
Etot(dc−c) is evaluated at three points (dc−c = 0,±δ dc−c) and fitted to a quadratic
form via a least squares method 2.
Etot(dc−c) = a d2c−c. (8.11)
The value a is the leading coefficients of the quadratic polynomial. Equating the
right hand sides of this equation and equation (8.10) and canceling leads to
Kc−c = 2a, (8.12)
and the objective function to be minimized becomes
2a− Kexpc−c = 0. (8.13)
1This conversion is as follows: Kexp
c−c = (2π 1.26 × 1012 s−1)2 · 1.9926 × 10−26 Kg/atom =
1.249 (Kg/s2)/atom = 7.795 × 10−2 (eV/A2
)/atom. The conversion 1 eV/A2
= 16.022 Kg/s2
is employed.
98
Where the value of Kexpc−c is that reported above in (8.9).
8.3.3 Compressibility calculation
The z-axis compressibility was discussed in § 3.4, and is defined by [131]
kTB+dispz =
Ao
Zeq
d
2Etot(Z)
dZ2
∣∣∣∣∣Zeq
−1
. (8.14)
The second derivative of the interlayer energy is with respect to the interlayer
separation evaluated at the equilibrium ABA spacing. Ao is the area per atom in
the graphene sheet (Ao =√
3 a2/4 = 2.62 A2/atom).
This second derivative was calculated with the second order central difference
approximation [65]
d2Etot(Z)
dZ2
∣∣∣∣∣Zeq
=Etot(Zeq + δZ) − 2Etot(Zeq) + Etot(Zeq − δZ)
δZ2. (8.15)
Again δZ is an infinitesimal increment. The objective function is simply
kTB+dispz − kexp
z = 0. (8.16)
Where the experimental compressibility mentioned earlier in § 3.4 is kexpz = 2.74×
10−12 cm2 dyne−1
2A subtle point was discovered in evaluating these phonon frequencies by this method. Thereseemed to be some discrepancies in the symmetry of Etot(dc−c) about the ABA stacking. Specif-ically it was not found to be harmonic as expected. A closer look revealed that the Monkhorst-Pack special k-points are generated under the consideration of certain group theoretical con-siderations that are broken as the perfect hexagonal unit cell is disturbed. This problem wasremedied by population of the Brillouin zone with a sufficient number of k-points (15 × 15 × 3)
to attain the convergence mentioned previously. With this treatment Etot(dc−c) about the ABAstacking showed the expected harmonic behavior, and this brute force population is used in allcalculations of the E2g(1) mode.
99
8.4 Optimization results
Functions were created to evaluate the objective equations given in (8.6), (8.13)
and (8.16), detailed in the previous three subsections. These three objective func-
tions are utilized by the previously discussed hybrd.f routine in the parameter
optimization. There is a fourth semi-free parameter, the cutoff distance Rcut for
the dispersion energy. This parameter was found iteratively and effectively can-
celed the need for a cutoff function, as will be shown. The logic behind picking this
distance is based on a few ideas. Since we are assuming pairwise addivity in this
model the long range interactions are collectively contributing to the total energy,
but the contribution to the total energy drops off quickly as 1/r6. My logic was
to apply the parameter optimization procedure with progressively larger cutoff
distances until the parameters converged to steady values. This was done, and at
a cutoff distance of 12.25 A the three free parameters are changing less by than
1%. Table 8.2 shows a summary of the optimized parameters. Note that for the
optimized parameter values, the dispersion energy contribution at Rcut = 12.25 A
without the Fermi function cutoff, is 7.3 × 10−6 meV, which is quite small. Even
though this energy contribution without the Fermi function is small, the Fermi
function is retained to remain consistent and completely zero out any force spikes
while preforming MD.
C6 α γ(eV A6) (A−1)
24.8 1.45 0.905
Table 8.2: Optimized parameters for the tight-binding plus dispersion model witha cutoff of 12.25 A
The parameters shown in Table 8.2 result in an essentially zero error fit to the
database values (all less than 0.2% difference). The fitted TB+disp interlayer
energy is plotted and compared to the Girifalco empirical model in Figure 8.4.
The energy curves look similar except for the scale and curvature around Zeq.
Figure 8.5 shows more definitively the difference between the two models. In
100
this figure the interlayer energy as a function of layer separation is shown for
both the ABA and AAA sacking of the TB+disp model and Girifalco’s empirical
model. Note the small difference between these curves for the empirical model as
compared to the TB+disp model. Finally Figure 8.6 shows the energy landscape
of graphite, at Zeq, under a shear transition from an ABA stacking (the minima’s)
to AAA stacking (the maximum points). This landscape shows the corrugated
nature of graphite that was discussed earlier and expressed well in Figure 3.3.
The comprable landscape for the LJ models is much flatter as noted in § 4.1.
3 3.5 4 4.5 5−55
−50
−45
−40
−35
−30
−25
−20
−15
−10
Interlayer separation [A°]
Inte
rlaye
r en
ergy
[meV
/ato
m]
TBD − ABAgiri − ABA
Figure 8.4: Fitted tight binding plus dispersion interlayer energy for both ABAand AAA stacking plotted along with Girifalco’s empirical model
8.5 Discussion
It is necessary to discuss these results to make sure that the values of the pa-
rameters found are in accordance to what is known. In this fitting procedure the
database consisted of three experimental data points that are all found at the
equilibrium spacing. At this point we have both repulsion and attraction between
the layers. Shortly beyond the equilibrium distance there is practically no repul-
sion and only dispersive attraction. The database had no data points to constrain
the magnitude of C6, which dictates the strength of interactions beyond the equi-
101
3 3.1 3.2 3.3 3.4 3.5 3.6 3.7−55
−50
−45
−40
−35
−30
−25
−20
−15
Interlayer separation [A°]
Inte
rlaye
r en
ergy
[meV
/ato
m]
giri − AAAgiri − ABATBD − AAATBD − ABA
Figure 8.5: Tight-binding interlayer energy for AAA and ABA stacking, comparedto Girifalco’s LJ model
−2
−1
0
1
2
−2 −1.5 −1 −0.5 0 0.5 1 1.5 2
−50
−45
−40
−35
−30
−25
inte
ract
ion
ener
gy [m
eV/a
tom
]
Y [A°]
X [A°]
ABA
AAA
(a)
x
y
(b)
Figure 8.6: (a) Graphite energy landscape (b) ABA graphite showing correspond-ing X,Y orientation
102
librium spacing. Naturally it is good to question the magnitude of C6 with what
was discussed in Chapter 6 regarding the calculation of C6 for carbon in graphene.
Our value of 24.8 eV A6
falls between the minimum calculated value of 10.71
eV A6
via the Lifshitz-Hamaker method and maximum value of 30.89 eV A6
via
the Slater-Kirkwood method. It is unfortunate that the methods for calculating
C6 show such a range. As it is we can say our value is in the realm of possibility
defined by these methods and is larger than the Girifalco and Ulbricht Lennard-
Jones models. The parameter α in the damping function is commensurate with
what was mentioned earlier in the section on damping functions. As mentioned
there, α is a measure of the range of the overlap, with larger atoms having smaller
values of α. Our value follows this trend with Tang’s reported α for the hydrogen
interaction (smaller than sp2 carbon) of 3.14 A−1 compared to our value for sp2
carbon of 1.45 A−1. The last parameter to consider is the stretch parameter γ.
Our fitting calls for an approximate distance contraction of 10% (γ = 0.905). This
distance contraction is not absurdly large, but it is hard to conclusively say what
it should be without re-parameterizing the tight-binding parameters without the
artificial orbital contraction. With all three of the fitted parameters within agree-
able boundaries it is interesting to note the models predicted exfoliation energy.
The calculated value of Exf is 49.7 meV/atom. This value is right in the range
of the experimental values (∼45-55 meV/atom) discussed in § 3.3.1. Another pre-
diction that can be considered is the energy difference between ABA and AAA
stacking. Our TB+disp model reports a value of 22.3 meV/atom. This value is in
decent accordance with the DFT value of 15 and 17 meV/atom reported earlier
in § 3.6.2. The fact that it is larger than the DFT values is not worrisome since
the current local density approximation based DFT functions capture some, but
not all, of the dispersion energy.
Another test on our model to which there is experimental data to compare is the
shift of the E2g(1) mode under external pressure. Applying a hydrostatic pressure
to graphite causes the layers to come together and there is an associated increase
103
in this phonon frequency. We have calculated the frequency for discrete values
of interlayer spacing and plotted them in Figure 8.7 as the circled line. This is
a semi-static frozen phonon calculation done in the same manner as described
in equation (8.13). Experimental data on this phonon frequency and interlayer
spacing for the 0 to 14 GPa range has been gathered and fitted by Hanfland et al.
[62]. The dashed line in Figure 8.7 is the representation of this experimental work
and is expressed by
ω(z) = ωo
[δoδ′βo
β ′
[(z
zeq
)−β′
− 1
]+ 1
]δ′
. (8.17)
ωo and zeq are the STP values of the E2g(1) phonon frequency, and equilibrium
spacing (1.26 THz, and 3.35 A respectively). The other four values are fitted to
the experimental measurement of graphite under pressure from [62] (β ′ = 10.89,
βo = 35.7 GPa, δo = 0.110 GPa−1, δ′ = 0.43). The top abscissa in Figure 8.7,
shows the associated pressure (with zero referencing STP) to attain the interlayer
spacing shown on the bottom abscissa. Our curve, while not falling on top of the
experimental data, shows the expected trend. A difference is expected since our
TB method doesn’t calculate the electronic charge density in a self consistent way,
an issue that becomes important as the layers are squeezed together.
We have implemented our model into a molecular dynamics environment and ap-
plied it to some simple systems. The tight-binding code had an existing molecular
dynamics subroutine. The force contribution from the vdW term was added to
the existing code. Details on the MD method and the vdW force term are given
in Appendix A. Our test case was two graphene sheets with periodic boundary
conditions in only the in-plane directions. We want to check if the predicted equi-
librium spacing of the graphene layers is mirrored in the MD simulation. The 0
K static interlayer energy scans for both the AAA and ABA system are shown
as the dashed and solid black lines in Figure 8.8. Their equilibrium interlayer
spacings are approximately 3.4 and 3.6 A respectively. This simulation was pre-
104
Figure 8.7: Hydrostatic pressure effects on the E2g(1) shear mode. STP is refer-enced as zero pressure. Dashed line is experimental data from [62], circled line isour TBD prediction
formed at 600 K for 1500 femtoseconds with a time step of 0.1 fs. We ran this MD
without the vdW contribution, and as expected, saw the layers repel each other.
Turning on the vdW force we ran the simulation again starting with the layers
at 3.7 A and in a slightly offset ABA stacking. The red line in Figure 8.8 plots
the average distance between the layers as a function of time, which is shown on
the right-hand ordinate. As expected the layers oscillate around the 0 K predicted
equilibrium. Measuring the average time between peaks we estimate the frequency
of this oscillation is 2.27 THz.
With this successful fitting of our TB+disp model for graphite and testing in a
molecular dynamics simulation we are now able to examine the interaction between
carbon nanotubes.
105
Figure 8.8: Static 0 K energy scans of two graphene layers in AAA and ABAorientations in black and the results of a molecular dynamics simulation showingthe oscillation around the equilibrium interlayer spacing in red
106
9TBD description of carbon nanotube
interactions
This chapter outlines the application of our optimized tight-binding plus disper-
sion model applied to carbon nanotubes interactions. In the introduction of this
work we discussed the motivation for modeling interactions of collections of nan-
otubes for various nanomechanical based applications. Here we start with the
modeling of CNTs on a graphene substrate, followed by calculations of axially
parallel CNTs taken two at a time, 7 tube bundles, multiwalled tubes, and finally
the case of a C60 outside and inside of a CNT. We discuss some generalized results
of these static energy calculations and fit a universal binding curve for graphitic
107
interactions. Finally we show the results of a dynamic simulation of two (5,5)
interaction nanotubes.
The motivation for focusing on the static energy calculations for sets of tubes is
to investigate their energetically favorable orientations and compare our results to
existing data in literature. Treating these tubes statically (i.e. by not relaxing) is
done for two reasons. First it has been suggested by empirical models and direct
measurement that adjacent tubes start flattening along their mutual faces when
their diameters are greater than 20 A [70, 145], here we will only consider tubes
below this diameter. Secondly, for purposes of comparison, most data available in
literature for tube-graphene, tube-tube cohesive energies adopts this assumption
that they do not significantly distort the tube structure. We will see later though
that there is a noticeable flattening due to van der Waals attraction even for the
smallest (5,5) tubes.
9.1 Tube-graphene interactions
There is an interest in using a graphite substrate to organize and separate CNTs
based on their chirality. Due to the symmetry and similarity of the graphene
plane and the CNT structure there is an energetically favorable orientation (lock-
in position) for tubes on a graphite surface that repeats every 60◦ as illustrated
in Figure 9.1 [20, 41, 95]. These lock-in positions are simply orientations that are
equivalent to ABA stacking of graphite. In a lock-in position a tube is bound
approximately 10% more per unit length which corresponds to tens of eV’s for
a tube only a few hundred nm in length [95]. Experiments of CNTs on HOPG
surfaces manipulated with AFM’s have shown that when in a lock-in position
they roll very easily akin to a rack and pinion gear system. If one end of a tube is
stopped by an impediment and the other end is pushed the tube pivots and shows
a sliding, stick and slip behavior rather than a rolling behavior [41, 42].
We consider the static single point energy scans of tubes on graphene in a lock-in
108
θ = 60◦
Graphite
CNT
Figure 9.1: Bird’s eye view of the various lock-in orientations on graphite for aCNT
position as a function of tube graphene separation. The tubes considered are (n,n)
type tubes of n = 5, 10, 12, and 20. The diameter of these tubes are 3.39, 6.78,
8.14, 13.56 A respectively. The tubes sections are created with the code provided
in [148], and periodic boundary conditions along the axis are implemented to
model infinitely long tubes. We use 3 unit cells for each tube resulting in tubes
with 60, 120, 144, and 240 atoms apiece. The graphene substrate is created to
satisfy proper boundary conditions and orientated in a lock-in position relative
to the tube. A k-point mesh of 7x2x2 shows a convergence of ±0.1 meV for the
smallest system and is used for all tube calculations here. Figure 9.2 shows the
tube graphene interaction energy per unit length of the tube for the various tubes
orientated in one of their lock-in positions vs the wall to wall separation (dww).
The general trend of increasing equilibrium spacing and higher binding energies
with larger tubes can be seen here. These results are summarized in the first four
rows in Table 9.1.
9.2 Tube-tube interactions
Our second consideration is axially parallel tube-tube interactions. Figure 9.3
shows how the tubes are aligned and the notation for rotation (θ) and offset (dz).
109
2.8 3 3.2 3.4 3.6−600
−550
−500
−450
−400
−350
−300
−250
−200
−150
−100
dww
[A°]
inte
ract
ion
ener
gy [m
eV/A
° ]
(5,5)(10,10)(12,12)(20,20)
Figure 9.2: Interaction energy scans of tubes on graphite orientated in a lock-inposition
We are interested in finding the minima on the energy surface defined by θ, dz,
and dww and the absolute value of the cohesive energy at this position as given by
our tight-binding plus dispersion model.
Two tubes are simulated at a time and their initial configuration of θ, and dz are
arbitrary according to the details of the code that generates their coordinates. The
energy landscape defined by θ and dz at a fixed dww is analogous to the graphite
interlayer energy landscape in x and y at fixed z shown earlier in Figure 8.6.
Figure 9.4 shows this θ, dz landscape for two (10,10) tubes separated by 3.1 A.
One can see the analogous ABA and AAA-like stacking (for tubes we will refer to
this as ab and aa orientations). The absolute corrugation of this landscape is ∼
23 meV/A.
For computational efficiency, a minima in the dz direction is found first with the
empirical model of Girifalco. To do this an energy scan of θ and dww is performed
at discrete steps of dz. The dz which provides the deepest minima in the θ, dww
landscape is used in re-scanning the θ, dww landscape with our tight-binding plus
110
dww
dz
θ
Figure 9.3: (12,12) and (5,5) tubes showing adopted orientational notation. θ isa relative twist angle, dww is the wall to wall tube separation, and dz is a verticaloffset distance
0 5 10 15 20 25 30 350
2
4
−225
−220
−215
−210
−205
−200
Ene
rgy
per
unit
leng
th [m
eV/A
° ]
θ [degrees]
dz [A°]
Figure 9.4: Tube-tube θ, dz landscape for two (10,10) tubes at a wall to walldistance of dww = 3.1 A
111
dispersion model.
For all the θ, dww landscapes calculated the dww scans corresponding to the most
favorable and disfavorable θ’s are taken and fit via least squares to a modified
Lennard-Jones curve (see equation (9.2)). Using these fits, a bisection method is
used to calculate the equilibrium spacing (dabww and daa
ww) through the analytical
derivative of the fit. The cohesive energies are reported in energy per unit length
(Eab, Eaa) and are evaluated with the fitted curves at the minima’s. Table 9.1
reports the results of all the tube combinations of the 4 different tubes studied.
The energy scans (for Eab and dabww) as a function of wall separation around the
equilibrium spacing for each tube-tube combination are shown in Figure 9.5.
Eab Eaa Eaa −Eab dabww daa
ww
Tubes meV/A meV/A meV/A A A(5,5)‖graphene -284.44 . . . . . . 3.00 . . .(10,10)‖graphene -375.12 . . . . . . 3.04 . . .(12,12)‖graphene -393.69 . . . . . . 3.05 . . .(20,20)‖graphene -495.31 . . . . . . 3.10 . . .(5,5)‖(5,5) -168.16 -157.03 11.12 3.02 3.10(5,5)‖(10,10) -187.83 -177.17 10.65 3.07 3.14(5,5)‖(12,12) -195.49 -182.49 13.00 3.07 3.16(5,5)‖(20,20) -208.95 -197.28 11.67 3.10 3.18(10,10)‖(10,10) -220.92 -205.24 15.68 3.09 3.20(10,10)‖(12,12) -230.61 -214.13 16.48 3.10 3.21(10,10)‖(20,20) -256.30 -240.98 15.32 3.11 3.20(12,12)‖(12,12) -241.09 -224.07 17.02 3.10 3.22(12,12)‖(20,20) -270.99 -254.34 16.65 3.11 3.21(20,20)‖(20,20) -312.34 -292.96 19.38 3.12 3.22
Table 9.1: Tube-graphene and tube-tube interactions summary
The tube-tube minimum equilibrium separation follows a trend of increasing dabww
with tube radius. When plotted vs the sum of the curvatures defined as τ =
1/r1 +1/r2, with r1 and r2 being the radii of the two interacting tubes, it is nearly
linear with a fitted slope of ∼ −0.22 A2. The minimum cohesive energy per unit
length (Eab) plotted against the sum of the curvatures follows a nice trend too.
Figure 9.6 shows the cohesive energies plotted against τ . The noticeable trend of
the minimum cohesive energy is fitted to a curve of the form
112
2.95 3 3.05 3.1 3.15 3.2 3.25 3.3 3.35−340
−320
−300
−280
−260
−240
−220
−200
−180
−160
wall − wall separation [A°]
inte
ract
ion
ener
gy [m
eV/A
° ]
5−55−105−1210−1010−1212−125−2010−2012−2020−20
Figure 9.5: Tube-tube interaction energy scans corresponding to most favorable θand dz configuration for the various (n,n) tube combinations. Labeling notationis (a-b) = (a,a)‖(b,b).
ϑ(τ) =a
τ b+ c. (9.1)
The optimized parameters are: a = −54.8468 meV/A2, b = 0.7357, and c =
−88.4136 meV/A, with an associated sum squared error (SSE) of 4.8 meV/A.
The form of this curve was chosen firstly to be simple and secondly to satisfy the
condition that as τ tends to zero (i.e. graphene-graphene) the cohesive energy per
unit length tends towards −∞.
Our tube-tube results compare well with similar TB+disp based models of Biro
and Kwon already discussed in § 4.2 [100, 156]. Both of these models have been
applied to the (10,10)‖(10,10) system. The optimized cohesive energy per unit
length for this system with the Biro model is 281.7 meV/atom with an equilibrium
spacing of 3.11 A, compared to our -220.92 meV/atom and equilibrium spacing of
3.09 A (this data is not available for the Kwon model). The corrugation of the
(10,10)‖(10,10) energy surface (Eaa −Eab) for the Biro and Kwon model are 27.1
and 16.6 meV/atom respectively compared to our 15.68 meV/atom.
113
0.1 0.2 0.3 0.4 0.5 0.6
−340
−320
−300
−280
−260
−240
−220
−200
−180
−160(5−5)
(5−10)(5−12)
(10−10)(10−12)
(12−12)
(5−20)
(10−20)
(12−20)
(20−20)
τ − sum of curvature [A° −1]
ϑ −
Coh
esiv
e en
ergy
per
uni
t len
gth
[meV
/A° ]
Figure 9.6: Eab from Table 9.1 plotted against the sum of the curvatures of theinteracting tubes
9.3 Universal binding curve
One can nicely summarize the last two sections on CNT-CNT and CNT-graphene
interactions into a universal binding energy relation (UBER) similar to the original
work of Rose et. al [143]. An UBER curve is a systematic rescaling of the energy vs.
distance curves in a manner which shows a single curve. This has been attempted
for graphitic systems by Girifalco [53]. Using our data set of tube-graphene and
tube-tube interaction energies we first normalized the well depths by dividing
the interaction energies by their cohesive energies (φ(d)/|φ(do)| where φ(d) is the
energy per atom at a wall to wall separation of d and do = dabww the equilibrium
separation). The distance scaling utilized in Girifalco’s analysis is (d−ρ)/(do −ρ)
where ρ is a system dependent scaling parameter. In Girifalco’s representation ρ
was an adjustable parameter that was tuned such that all interaction energy curves
fell on top of one another. In his formulation the value of ρ for two (10,10) tubes
is different than that for two graphene sheets, etc. This factor partially negates
the true universality of an UBER curve. We tried a few distance scaling methods
with the most promising being a definition of ρ as the sum of the curvatures (τ)
of the interacting bodies times a constant σ with units of A2 (ρ = στ). Figure 9.7
114
shows the result of this scaling formulation with σ = 1 A2
for all the tube-tube
and graphene-tube interactions curves already presented and plotted here as the
solid dots. This UBER curve was fitted to a modified LJ potential of the form
Φ(d) =A6
d6+B12
d12+ C. (9.2)
The fitted parameters are A6 = −0.7198, B12 = 0.3565, and C = −0.6366 with an
associated SSE of 2.24 × 10−4, and is shown as the solid line in Figure 9.7. This
universal binding curve is a nice summary for tube-tube-graphene interactions.
Using this fitted curve one may back out estimates of cohesive energies for other
systems that were not considered explicitly here. It should be noted that the
effects of tube flattening for d > 20 A were not considered, so this will only be
valid for the d = 0-20 A regime.
0.9 0.95 1 1.05 1.1 1.15−1.02
−1
−0.98
−0.96
−0.94
−0.92
−0.9
−0.88
−0.86
−0.84
(d−ρ)/(do − ρ)
φ(d)
/ φ
(do)
Figure 9.7: Carbon nanotube/tube/graphene semi universal interaction energyper unit length. ρ is the sum of the curvatures of the interacting bodies
115
9.4 Nanotube loops
For a comparison we have computed the ab stacking energy with Girifalco’s model.
We obtained a (10,10)‖(10,10) intertube cohesive energy of 1.83 eV/nm. At this
point It is interesting to consider the implications of a full classical approach in
comparison with the TB + vdW one. Consider for instance the case of a (10,10)
CNT ring with the smallest bending radius of Rbend = 0.03 m mentioned in the
introduction [113]. The ring configuration is stable because the bending energy
penalty Ubend = 0.5k(1/Rbend)2 is compensated by the van der Waals attractions.
The bending stiffness k of a single-walled CNT of radius R is k = CR3, where C
is the in-plane CNT stiffness [170]. For a (10,10) CNT, the TB model gives C =
423 J/m2 [69]. Taking R = 0.7 nm then k = 455 × 10−18 J nm. Next, from the
balance of resulting bending energy (Ubend = 1.58 eV/nm) and our computed vdW
attraction (i.e., 2.21 eV/nm given in Table 2) we conjuncture that the observed
CNT rings of 0.03 m in radius can be formed from (10,10) CNT of 661 nm in
length, with an overlap portion of 472 nm. On the other hand, a substantially
different estimate is obtained based on the widely used combination of classical
covalent-bonding [18] (leading to C = 236 J/m2, k = 254× 10−18 J nm, and Ubend
= 0.88 eV/nm) and Girifalco’s vdW treatment [53]. It follows that the smallest
observed rings can be formed from (10,10) CNTs of at least 365 nm in length with
an overlap portion of 174 nm.
9.5 Nanotube bundles
We have also performed energy scans on two seven-tube CNT bundles. The two
different bundles modeled were comprised of (10,10) and (12,12) tubes. As in
the tube-tube interactions, PBCs were employed to negate end effects and model
infinitely long tubes. A section of the bundle is shown in Figure 9.8 (a). The
tubes are packed in a hexagonal shape as shown in Figure 9.8 (b). For the (12,12)
bundle the tubes are rotated and shifted such that they are in their optimal ab
116
configurations with respect to one another, as was found above. This is possible
for the (12,12) bundle because (n,n) tubes have rotational symmetry every 2π/n
radians. Since 2π/12 is a multiple of π/3 (i.e. half the interior angle of a hexagon)
there is a possibility of an optimal (12,12) ab bundle. We have made an educated
guess at the configuration of such a bundle by using our optimized results from
the tube-tube minima’s found above. Figure 9.8 (b) illustrates the ab like config-
uration in which we orientate the tubes in. This possible optimal configuration
is not the case for the (10,10) bundle, which will remain partially frustrated. For
this (10,10) bundle the tubes are orientated such that they are in their optimal
ab configuration only along the horizontal axis as shown by the bold ab in Figure
9.8 (b).
The energy scans are performed by dilating the tube bundle and uniformly increas-
ing dww with the energy points plotted in Figure 9.9. The cohesive energy and
equilibrium wall-to-wall separation are summarized in Table 9.2. Included in this
table are the results of Biro et al. whom preformed similar calculations with their
tight-binding plus dispersion model [156]. The cohesive energy per unit length of a
bundle has not to our knowledge been extracted experimentally. Measurements on
bundles comprised of hundreds of SWCNTs of average radius 6.9 ±0.1 A (slightly
larger than (10,10)’s) have shown a wall-to-wall separation of 3.15 A [160]. This
observation, while on a slightly larger and less perfect structure, is very close to
our ideal 7 tube bundle calculations.
(10,10) (12,12)Separation Cohesive energy Separation Cohesive energy
Model A eV/A A eV/ATB+disp 3.11 -2.479 3.12 -2.802Biro [156] 3.12 -3.201 3.13 -3.500
Table 9.2: Comparison between models of a 7 tube hexagonal bundle of (10,10)and (12,12) tubes reporting equilibrium separation and cohesive energy per unitlength
117
(a)
ab
ab
ab
a ba
b
a
b
a
bb
a
ab ba
a b
a b dww
(b)
Figure 9.8: (a) Seven tube bundle of (12,12) tubes (b) Bundle cross section showingwall to wall distance (dww)
3 3.05 3.1 3.15 3.2 3.25 3.3 3.35 3.4−3
−2.9
−2.8
−2.7
−2.6
−2.5
−2.4
−2.3
−2.2
dww
[A°]
Inte
ract
ion
ener
gy
[eV
/A° ]
(10,10)
(12,12)
Figure 9.9: Energy scan on CNT bundles around their equilibrium position
118
9.6 Multiwalled tube
We have also applied our potential to the characterization of MWCNTs. The
motivation for applying our model is to look at some of the finer points related
to MWCNT bearings and oscillators discussed in the introduction. We have done
this for a (5,5) tube nested inside of a (10,10) tube radii of 3.89 A and 6.78
A respectively, with a wall-to-wall separation of 3.39 A. Figure 9.10 schematically
shows the (5,5)‖(10,10) MWCNT system. This figure also shows the relative
angular orientation θ and relative axial displacement dz which are the independent
variables used in plotting the energy landscapes shown for this system in Figure
9.11.
dww = 3.39 A θ
y
dz
z
x
Figure 9.10: Nested (5,5)‖(10,10) CNT pair. The dz direction referred to in Figures9.11 and 9.12 is along the tube axis labeled as z
One sees a relatively large barrier under rotation of 9.7 meV/A, and a much lower
energy barrier under relative translation of 0.268 meV/A. These barriers are easier
to see in Figure 9.12, which shows two slices of the surface plot. The barrier for
rotation is about 36 times that for sliding. These results are similar to Palser who
found rotational and sliding barriers of 7.19 meV/A and 2.07 meV/A respectively
[131], and Charlier who found barriers of 12.68 meV/A and 5.61 meV/A with a
119
DFT LDA approach [25]. Another first principles DFT LDA study of the rota-
tional energy barrier of this system has been reported as 7.75 meV/A [96]. Our
model shows more corrugation and a smaller sliding barrier than these models. In
spite of the larger contact area, the nested tube corrugation is smaller than when
they are placed side-by-side (Table 9.1). Such a reduction in the corrugation due
to the difference in radii between layers is in qualitative agreement with earlier
results [96, 131].
We have preformed similar energy scans for a (10,0)‖(20,0) system which is of
comparable size to the (5,5)‖(10,10) system. As expected we see a similar energy
landscape but with the high and low energy barriers for θ and dz switched around.
Thus making (n,0) MWCNTs better candidates for rotational bearings and (n,n)
MWCNTs a superior translational bearing. MWCNT’s composed of chiral tubes
are expected to have screw like energy landscapes.
For our (5,5)‖(10,10) system we have also calculated a shear modulus of 0.62 GPa,
and a rigid shear mode frequency of 57 MHz which is the analogous shear mode
of E2g(1) = 1.26 THz discussed previously.
9.7 C60 and carbon nanotube interactions
We have not discussed the C60 molecule since the introduction but here we make
some calculations of its interaction energy within and outside of a carbon nan-
otube. We first do a static interaction energy scan of an infinitely long (10,10)
tube with a C60 molecule. The two are aligned to be in a ab like stacking configura-
tion as can be seen in the left hand side of Figure 9.13. The right hand side shows
the relative displacement of the two bodies. Figure 9.14 shows the interaction
energy as a function of wall-to-wall separation. The maximum cohesive energy
is -101 meV/atom at a separation of 2.99 A. For the same system with the C60
nested in the tube we calculate an interaction energy of -560 meV/atom. There
has been discussion of a small energy barrier at the entrance to an open CNT
120
05
1015
20
0
0.5
1
1.5
−1.068
−1.066
−1.064
−1.062
−1.06E
nerg
y pe
r un
it le
ngth
[eV
/A° ]
dz [A°]
θ [degrees]
Figure 9.11: Nested (5,5)‖(10,10) energy landscape with TB+disp model
Figure 9.12: Nested (5,5)‖(10,10) energy landscape with TB+disp model showingthe translational energy change as the circled line
121
[163], but it’s interesting to note that it is 5 times more favorable energetically for
a C60 molecule to be inside of a tube than on the outside. This is supported by
the direct observation of stable “peapod” structures. Peapods are a CNTs filled
with C60 molecules as shown in Figure 9.15 [56].
Figure 9.13: energy scan of a C60 molecule external to a (10,10) tube
9.8 Tube-tube MD simulation
We have nearly exhausted the possible static energy calculations to which we
can compare our results to that found in literature and so far have seen decent
agrement. Our next step is the molecular dynamic simulation of these systems.
For all the static calculations we preformed above we noted that tubes less than
20 A in diameter did not show flattening. We have reason to believe that this is
not true based on our model.
Our system is a pair of infinite aligned (5,5) tubes. We started the simulation with
them near their equilibrium separation of 3.02 A. We ran this system for 1 ps at
dt = 0.1 fs at 600 K. We expected to see a collective rigid body motion of the two
122
2.5 3 3.5 4−110
−100
−90
−80
−70
−60
−50
−40
−30
−20
dww
[A°]
inte
ract
ion
ener
gy [m
eV/a
tom
]
Figure 9.14: energy scan of a C60 molecule external to a (10,10) tube
Figure 9.15: Peapod micrograph. Scale bare = 2 nm
123
tubes slightly rotating and sliding into their equilibrium configuration. Instead of
this we saw a slight rotation and sliding but also a general flattening. Figure 9.16
(a) shows the before shot and (b) shows the two tubes after 1 ps. There is a slight
flattening along the tubes face.
(a) (b)
Figure 9.16: Two (5,5) tubes (a) shows configuration at t=0, (b) shows configu-ration after 1 ps.
124
10Conclusion and future work
We have undertaken the study of the model system of graphite in order to gain
insight to the nature of vdW attraction and repulsion between layers of sp2 bonded
carbon. With the data garnered from experimental insights and first principles
simulations we have developed a tight-binding plus dispersion model which ac-
curately describes both the intra and inter-layer properties of graphite. In this
pursuit we have considered the various physical models that describe dispersion
forces and used them to calculate the C6 dispersion constant for sp2 hybridized
carbon. Our model was extended to describe the interactions between carbon
nanotubes in a molecular dynamics environment. The model was compared to
literature values of equilibrium orientations and cohesive energies of nanotubes.
125
We have preformed some promising initial molecular dynamics studies on two
graphene layers and two small (5,5) tubes. The dynamics proved to be stable
and reproduced the equilibrium interlayer spacing as calculated from static en-
ergy scans. We have reached a point at which the study of the interworkings of
the many carbon nanotube based NEMs devices discussed in the introduction can
be studied further.
126
ATight-binding + dispersion molecular
dynamics overview
Molecular dynamics (MD) is a numerical technique to model the time evolution
of a many body system governed by Newton’s laws of motion
mid2 ~Ri
dt2= ~F i, (A.1)
mi is the mass of particle i, ~Ri is the position vector, and ~F i a vector describing
the forces on i. In atomistic systems the particles denoted by ~Ri are the nuclei.
The treatment of the nuclei in this manner is a consequence of the application of
127
the Born-Oppenheimer approximation which is explained in section Appendix B.7.
There are many flavors of molecular dynamics simulations. The most common MD
simulation consists of N particles in a volume V evolved in time while conserving
energy E, referred to as a NVE ensemble. This is the approach we use, though
there are many other ensembles that may be simulated with the MD technique.
The time integration algorithm used to follow the trajectory of the system is a
major part of a molecular dynamics simulation. Our code utilizes the velocity
Verlet method [164]. This method is symplectic, meaning energy conserving and
is based on the Taylor series expansion of the position and is expressed by two
updating steps of
R(t+ dt) = R(t) + V(t)dt+1
2dt2A(t), (A.2)
V(t+ dt) = V(t) + dt
(A(t) + A(t+ dt)
2
). (A.3)
Where R, V and A are the position, velocity and acceleration matrices respec-
tively (dimensions of 3 x number of atoms). The time step is denoted by dt. The
accelerations are attained through ~Ai = ~F i/mi. We will discuss the force calcula-
tions below, but now start with how to start a MD simulation and the subsequent
steps in running one.
A molecular dynamics simulation needs a few things to start, namely the initial
positions of the atoms to be simulated and initial velocities. The initial velocities
are assigned as a function of the desired temperature. This is done by assigning
random velocities with a Maxwell-Boltzmann distribution, and scaling the total
kinetic energy to reflect the desired temperature. The net momentum of the
system is also zeroed. After these setup steps are complete the program enters
the MD loop where forces are calculated and the system is evolved in time. These
128
two steps are looped for a desired time period and then the simulation is stopped.
These general steps are illustrated in a flow chart in Figure A.1.
Initial nuclear
Assign initial
Stop
yes
no: t = t + dt
Evolve coordinates with
Calculate the three forces
Electronic – Ionic – vdW
velocity Verlet algorithm
Is t > trun?
Post process:
thermodynamic averages
coordinates (Ro)
velocity fMaxwell(T )
Figure A.1: Molecular dynamics flowchart
A.1 Calculation of Forces
We have discussed the MD technique and have seen the need to calculate the
forces on the ions in order to evolve the system. Our system has three distinct
129
force contributions and we can start by defining the force on an ion i in the α
direction where α is one of the cartesian directions x, y, z.
F itot,α = −∂Etot
∂Riα
(A.4)
The forces defined by the tight-binding method are of two distinct types in ac-
cordance with the definition of the energy. Equation (5.33) showed us that the
total energy was the sum of the band structure energy and the repulsive energy,
and analogously the total force is simply the sum of the band structure force
and the repulsive force. We have added a van der Waals energy term which also
contributes to the total force. We can expand equation (A.4) to reflect this
F itot,α = −
(∂EBS
∂Riα
+∂Erep
∂Riα
+∂EvdW
∂Riα
)= F i
BS,α + F irep,α + F i
vdW,α (A.5)
The repulsive ionic and van der Waals forces can be calculated analytically and
are treated differently than the band structure forces; we will cover them first.
A.1.1 Ionic and van der Waal’s forces
Since the repulsive and vdW terms are two body potentials with a known form
the force between two ions is expressed simply as the negative derivatives of their
energy, we call these φrep and φvdW . Both of these forces are simply a function of
distance between two interacting atoms. The analytical negative derivative of our
vdW energy term (equation (??)) with our fitted values substituted and expanded
is shown in equation (8.3) on the last page of this Appendix. The ionic repulsive
force is the negative derivative of its its fitted chebyshev polynomial detailed in
[137].
After finding the force between two interacting bodies it is a simple manner to
decompose the force into the cartesian directions with use of the directional cosine
130
introduced in § 5.6.2.
F iα = dαφ(d) (A.6)
where d is the distance between the atom i and the neighbor considered and dα is
the directional cosine for direction α.
A.1.2 Tight-binding band structure forces
While the repulsive and vdW forces were quite easy to describe the band struc-
ture forces prove a little more difficult to conceptualize. The Hellmann-Feynman
theorem can be used to calculate the forces due to the electronic portion of the en-
ergy in a quantum mechanical manner [66, 44]. The Hellmann-Feynman theorem
posits that the change in energy of a quantum system with respect to a change of
a system dependant parameter P is given by [39].
dE
dP= 〈Ψ|∂H
∂P|Ψ〉 (A.7)
If the parameter P , is a nuclear coordinate and one utilizes the Born-Oppenheimer
approximation the force on a nucleus in the α direction is given by
F iα = −
∑
λ
∂ελ
∂Riα
= − ∂
∂Riα
〈Ψλ|H |Ψλ〉〈Ψλ|Ψλ〉 (A.8)
With our non-orthogonal tight-binding basis we must account for the dependance
of the overlap on the force and the term that represents the change in the expansion
coefficients with respect to the ionic coordinate, the latter term is referred to as
the Pulay force [140]. Appendix C shows the derivation of this force term and is
simply given here as
131
F iα = −
∑
λ
fλ
∑
p,q
cλ∗p cλq
[∂Hpq
∂Riα
− Eλ∂Spq
∂Riα
](A.9)
or equivalently described via the density (ρ) and energy (σ) matrices as
F iα = −Tr
[ρ∂Hpq
∂Riα
]+ Tr
[σ∂Spq
∂Riα
](A.10)
A.2 Periodic boundary conditions
It becomes useful computationally when simulating bulk materials to implement
periodic boundary conditions on a small cell of atoms. This concept can be vi-
sualized by considering a three dimensional cell defined by three lattice vectors
(~a1,~a2,~a3) that contains a set of atoms. When calculating forces or evaluating the
energy of the system an atoms local environment is not only those atoms in the
cell but also periodic images (rigid translations) of its own cell. For an atom at
vector position ~r it has images of itself at
~rimage = ~r + l~a1 +m~a2 + n~a3, (A.11)
where l,m, and n ∈ Z. Figure A.2 shows a two-dimensional example of PBCs for
a square box containing two entities.
132
Figure A.2: Schematic of periodic boundary conditions. Center solid box is theunit cell surrounded by 8 copies of itself in two dimensions. The circle sees notonly the triangle in its own box (long arrow) it also sees its images (short arrow).
133
φvdw(d) = −dEvdW
dd
−248(1−e−1.45 d−1.45 de−1.45 d−1.05 d2e−1.45 d−0.51 d3e−1.45 d−0.18 d4e−1.45 d−0.05d5e−1.45 d−0.013 d6e−1.45 d)e10 d−110.99
(e10.0 d−110.99+1)2
d6−
148.8 1−e−1.45 d−1.45 de−1.45 d−1.05 d2e−1.45 d−0.508 d3e−1.45 d−0.18 d4e−1.45 d−0.053 d5e−1.45 d−0.013 d6e−1.45 d
(e10 d−110.99+1)d7+
24.8 1.0×10−10 d3e−1.45 d−1.0×10−11 d5e−1.45 d+0.019 d6e−1.45 d
(e10.0 d−110.99+1)d6
134
“I don’t like it, and I’m sorry
I ever had anything to do with
it.”
Erwin Schrodinger, speaking
about quantum mechanics
BQuantum Mechanics Overview
The following is a very brief review of the relevant quantum mechanical postulates
that are needed as back ground to the work contained in this document.
The quantum mechanical wave function for a single particle, Ψ(x, y, z, t) or Ψ(~r, t),
describes the temporal and spatial evolution of a quantum mechanical particle.
The product of Ψ∗(~r, t) Ψ(~r, t) is the probability density function of a quantum-
mechanical particle (*denotes the complex conjugate). Ψ∗(~r, t) Ψ(~r, t)dτ is the
probability of finding a particle in the differential volume dτ = dxdydz. Therefore
∞∫
−∞
Ψ∗(~r, t) Ψ(~r, t)dτ = 1 (B.1)
135
if a wave function Ψ(~r, t) fulfills equation (B.1), then Ψ(~r, t) is called a normalized
wave function. Equation (B.1) is the normalization condition and implies the fact
that the particle must be located somewhere in 3 space. Another restriction on
wave functions is that they be single valued and continuous [116]. Wave function
are easily normalized by applying a normalization constant which is found by
An =
( ∞∫
−∞
Ψ∗(~r, t) Ψ(~r, t)dτ
)−1/2
(B.2)
and then the normalized wave function is simply
Ψn(~r, t) = AnΨ(~r, t) (B.3)
B.1 Observables and expectation values
For every observable dynamic variable there exists a Hermitian operator in quan-
tum mechanics. Table B.1 shows a few dynamical variables and their correspond-
ing quantum mechanical operators.
Observable Symbol Operatorr r rp p −i~∇E E i~ ∂
∂t
Table B.1: Position, momentum, and total energy and their corresponding oper-ator symbols and operators, where ∇ = ( ∂
∂x+ ∂
∂y+ ∂
∂z)
The expectation value, 〈ξ〉, of any dynamical variable ξ, is calculated from the
wave function according to
〈ξ〉 =
∫Ψ∗(~r, t)ξopΨ(~r, t)dτ∫Ψ∗(~r, t)Ψ(~r, t)dτ
(B.4)
where ξop is the operator of the dynamical variable ξ. The expectation value of a
136
dynamical variable is also referred to as an average value or ensemble average.
B.2 Dirac Notation
The integrals in equation (B.4) show up often in quantum mechanics and are given
a short hand notation.... and is denoted by the brackets 〈...〉. We have introduced
the Dirac bra-ket notation for representing the integration. Equation
〈ξ〉 =〈Ψ|ξop|Ψ〉〈Ψ|Ψ〉 (B.5)
B.3 Time-dependant Schrodinger equation
The wave function evolves in time according to the time-dependant Schrodinger
equation. The Schrodinger equation is not derivable from elementary principles,
rather it is stated as a postulate.
We can gain some insight into the time-dependant Schrodinger equation by con-
sidering the following pseudo-derivation. Starting with a classic single particle
system we describe the total energy as the sum of the kinetic and potential energy
E =p2
2m+ V (r) (B.6)
where p is the momentum, m is the particle’s mass and V (r) is the potential energy
as a function of position. If we replace the classic variables in equation (B.6) with
the corresponding operators from table B.1 operating on the wave function we
arrive at the single-particle time-dependant Schrodinger equation.
i~∂Ψ(~r, t)
∂t= HspΨ(~r, t) (B.7)
where Hsp is the single-particle hamiltonian
137
Hsp = − ~2
2m∇2 + V (~r) (B.8)
note that there is nothing in (B.7) that accounts for spin or relativistic effects.
B.4 Time-independent Schrodinger equation
If the potential V is independent of time, the total wave function can be separated
into temporal and spatial terms.
Ψ(~r, t) = ψ(~r)υ(t) (B.9)
plugging equation (B.9) into the time-dependant Schrodinger equation we get
ψ(~r)i~dυ(t)
dt= υ(t)Hspψ(~r) (B.10)
Moving time dependant terms to the left and spatial terms to the right
i~
υ(t)
dυ(t)
dt=
1
ψ(~r)Hspψ(~r) (B.11)
both sides must be equal, so the introduction of a separation constant, E, results
in two ordinary differential equations
1
υ(t)
dυ(t)
dt= −iE
~(B.12)
Hspψ(~r) = Eψ(~r) (B.13)
Equation (B.12) is the time dependant part which has a solution of
138
υ(t) = e−iEt/~ (B.14)
Now we can rewrite the total wave function as
Ψ(~r, t) = ψ(~r) e−i(E/~)t (B.15)
Equation (B.13) is the the time-independent Schrodinger equation. The time-
independent Schrodinger equation is in the form of an eigenvalue equation.
motivation for solving. Solving the Schrodinger equation for the wave function
of a general system of electrons and nuclei, leads the modeler to any property of
interest. This is of course great interest to the scientist and philosopher. But
actually solving the Schrodinger is a beast of a problem.
B.5 General Hamiltonian
So far we have been looking at a single particle problem with no reference to the
type of particle or the type of potential. In a condensed matter system we are
dealing with collections of many electrons and nuclei. In order to describe this
general system we must define the many particle hamiltonian, which in a simplified
form is
H = Tn(R) + Te(r) + VeN(r,R) + VNN (R) + Vee(r) (B.16)
where r = {~r1, ~r2, . . . , ~ri} are the electron coordinates, and R = {~R1, ~R2, . . . , ~RI}
are the nuclear coordinates.
Tn(R) = −~2∑I
∇2
I
2MIis the kinetic energy of the nuclei
Te(r) = − ~2
2me
∑i
∇2i is the kinetic energy of the electrons
139
VeN(r,R) = 12
∑Ii
ZIe2
RIiis the electron nuclei Coulomb attraction
VNN(R) = 12
∑I 6=J
ZIZJe2
RIJis the nuclei nuclei repulsion
Vee(r) = 12
∑i6=j
e2
Rijis the electron electron repulsion
The corresponding many-particle wave function for a general system is a function
of all coordinates.
Ψ = Ψ(~r1, ~r2, . . . , ~ri, ~R1, ~R2, . . . , ~RI) (B.17)
B.6 Complexity
Solving the Schrodinger equation for this system and describing the eigenstates of
this many body problem is enormously complex. The complexity of this problem
is somewhat elucidated by walking through the analytic solution for the simplest
atomic system, the hydrogen atom, see [116]. In fact this is the only atomic system
that can be treated analytically, moving up to helium, or even a hydrogen dimer
proves impossible to solve. In order to make these and other more complex systems
tractable within a quantum mechanical treatment, a series of approximations must
be applied, the two most important are the Born-Oppenheimer and one-electron
approximation.
B.7 Born-Oppenheimer approximation
The Born-Oppenheimer approximation underlies almost all atomistic models. Col-
loquially it is explained by considering the nuclei electron mass ratio (which is
approximately 1820 : 1). At any given time the nuclei are moving at a much
slower pace relative to the electrons. The electronic wave function is assumed to
instantaneously (adiabatically), adjust to any nuclear movement. This thinking
lead Born and Oppenheimer to posit a decoupling of electronic and ionic degrees
of freedom [14].
140
Ψ({ri}, {RI}) ≈nuclei︷ ︸︸ ︷
χ({Ri})electrons︷ ︸︸ ︷
ψ({ri}; {RI}) (B.18)
Here we see that the electronic wave function depends on the electronic coordinates
and parametrically on the nuclear coordinates, denoted by the semi-colon. The
electronic hamiltonian for the electronic wave function is
He = Te(r) + VeN(r;R) + Vee(r) (B.19)
The nuclear part of the total wavefunction χ({Ri}) is not typically solved via
the Schrodiger equation. Rather the nuclei are usually treated as classic particles
coulombically interacting with one another. This approximation is good if the de
Broglie wavelength (Λ) of the nuclei is much less then the average nearest neighbor
distance (a), where the de Broglie wavelength is given as
Λ =
√2π~2
MkBT(B.20)
where M is the nuclear mass, T is the temperature and kB is Boltzmann’s constant.
This approximation fails when one is interested in considering phenomena includ-
ing but not limited to: excited state transitions, vibronic coupling, electron-hole
pair excitations [87, 123, 46]
B.8 One-Electron Approximation
The one-electron approximation is attributed to Hartree, the idea is that instead
of a total electronic wave function ψ({ri}; {RI}), one can assign to each electron
an individual wave function and energy. The hamiltonian for the one-electron
wavefunctions of the three terms in (B.19), the first two depend on the coordinates
of only one electron.
141
H1−e = Te(r) + VeN(r;R) (B.21)
B.9 Variational Method
We have already stated how it is futile to try to solve the time-independent
Schrodinger equation beyond the hydrogen atom. Solving the TISE for an ex-
tended system can be done approximately via the variational method otherwise
known as the Rayleigh-Ritz method.
With respect to solving the TISE we will restate it here for the ground state of a
given system
Hψo = εoψo (B.22)
where ψo and εo are the exact ground state wave function and energy. Multiplying
both sides of equation (B.22) by ψ∗o then integrating over space and solving for
the ground state energy we get
εo =〈ψo|H|ψo〉〈ψo|ψo〉
(B.23)
According to the variational theorem if any trial wave function Ψ is substituted
into equation (B.23) in lieu of ψo, the energy expectation value will be greater
than that of the true wave function ψo [5].
εtrial ≥ εo (B.24)
The equality holds only if Ψ = ψo. The trial function Ψ may contain variational
parameters. A particularly intuitive and productive trial function consists of a
linear combination of atomic orbitals (LCAO). The trial function generally takes
142
the form
Ψ =n∑
i=1
ciφi (B.25)
Where n is the number of basis functions, the c’s are the n unknown expansion
coefficients and φi are atomic like orbitals. The goal is to minimize εtrial and thus
attain the best emulation of the true wave function for the assumed basis. This is
done by by the method of variations and leads to n simultaneous linear equations
of the form
∂εtrial
∂cn= 0 (B.26)
B.10 Extended example
By walking through an explicit example of this problem we will see the method-
ology and elucidate its transformation into a generalized eigenvalue problem. To
keep things simple we will consider the simplest non-trivial trial function. This
wave function has the form
Ψ = c1φ1 + c2φ2 (B.27)
Taking this trial wave function and plugging it into equation (B.23).
εtrial =
∫(c∗1φ
∗1 + c∗2φ
∗2)H(c1φ1 + c2φ2)dτ∫
(c∗1φ∗1 + c∗2φ
∗2)(c1φ1 + c2φ2)dτ
(B.28)
Expanding the numerator and denominator
εtrial =c∗1c1H11 + c∗1c2H12 + c∗2c1H21 + c∗2c2H22
c∗1c1S11 + c∗1c2S12 + c∗2c1S21 + c∗2c2S22(B.29)
143
Hij and Sij are referred to as Hamiltonian and overlap matrix elements respec-
tively, and are are defined as
Hij =
∫φ∗
i Hφjdτ = 〈φi|H|φj〉 (B.30a)
Sij =
∫φ∗
iφjdτ = 〈φi|φj〉 (B.30b)
We can again simplify equation B.29.
εtrial =
n∑i,j=1
c∗i cjHij
n∑i,j=1
c∗i cjSij
(B.31)
Now applying the method of variations.
∂εtrial
∂c∗i=
n∑j=1
cjHij
n∑j=1
c∗i cjSij
−
n∑j=1
c∗i cjHij
(n∑
j=1
c∗i cjSij
)2
n∑
j=1
cjSij = 0 (B.32)
Note that this leads to n equations and the sums is now over j only. Rearranging
and multiplying both sides byn∑
j=1
c∗i cjSij
n∑
j=1
cjHij =
n∑j=1
c∗i cjHij
n∑j=1
c∗i cjSij
n∑
j=1
cjSij (B.33)
Notice that the fraction on the right hand side of equation (B.33) is simply the
energy expectation value from equation (B.23).
n∑
j=1
cjHij = εtrial
n∑
j=1
cjSij (B.34)
144
At this point to put the equations in a more understandable form we can expand
them with our sample basis function. These n = 2 equations are
c1H11 + c2H12 = εtrial(c1S11 + c2S12) (B.35a)
c1H21 + c2H22 = εtrial(c1S21 + c2S22) (B.35b)
(we could show how the rearrangement of these equations leads to the secular
determinant, and then substituting in to find the a’s) Putting these equations in
matrix format shows the form of the generalized eigenvalue problem.
H11 H12
H21 H22
c1
c2
= εtrial
S11 S12
S21 S22
c1
c2
(B.36)
Hc = εSc (B.37)
Equation B.37 is simply a generalized eigenvalue problem. H and S are the Hamil-
tonian and overlap matrices, and c is a column vector of the expansion coefficients.
There will be as many eignevalues and eigenvectors as there are terms in the ex-
pansion n.
It is now clear how minimizing the energy function by the method of variations is
equivalent to solving the generalized eigenvalue problem.
145
CNon-orthogonal Hellmann-Feynman
forces
We use the Hellmann-Feynman theorem to calculate electronic forces on the nuclei
in our molecular dynamics code. Since we are working with a non-orthogonal basis
the derivation is a bit more convoluted than that for the orthogonal basis.
C.1 Model Review
We begin this derivation with a little restatement of the nuts and bolts of the
system we are working with. Since we are working with in the framework of the
variational principle we write our wave functions as linear combinations as
146
Ψλ =∑
p
cλpφp (C.1)
The wave-function Ψλ corresponds to state λ, the index p in our case goes over
number of electrons × size of basis. cλp is just the element of the eigenvector
corresponding to the λth state. Also keep in mind the non-orthogonality of our
basis and the definition of the overlap matrix elements.
〈φp|φq〉 = Spq (C.2)
After the diagonalization of the generalized eigenvalue problem we have all neces-
sary information to proceed with calculating the forces via the Hellmann-Feynman
theorem which follows.
C.2 Force derivation
The electron energy for state λ is simply
Eλ = 〈Ψλ|H|Ψλ〉 (C.3)
Using equation (C.1) to express the wave-function we can write Eλ as
Eλ =∑
p,q
cλ∗p cλqHpq (C.4)
Taking the partial derivative with respect to a cartesian direction α = (x, y, z),
for a particular atom i (Riα) we get
∂Eλ
Riα
=∂ 〈Ψλ|H|Ψλ〉
∂Riα
(C.5)
147
or in terms of the expansion coefficients
∂Eλ
Riα
=∑
p,q
[∂cλ∗p
∂Riα
Hpqcλq + cλ∗p
∂Hpq
∂Riα
cλq + cλ∗p Hpq
∂cλ∗q
∂Riα
](C.6)
Rearranging this in the form
=∑
p,q
cλ∗p
∂Hpq
∂Riα
cλq +∑
p
∂cλ∗p
∂Riα
∑
q
Hpqcλq +
∑
q
∂cλq∂Ri
α
∑
p
Hpqcλ∗p (C.7)
Shows that the two terms underlined are simply the left hand side of equation
(B.34) repeated here for convenience
∑
p
Hpqcλp = Eλ
∑
p
Spqcλp (C.8)
Switching these underlined terms with their counter parts just noted leaves us
with
∂Eλ
Riα
=∑
p,q
cλ∗p
∂Hpq
∂Riα
cλq +∑
p
∂cλ∗p
∂Riα
Eλ∑
q
Spqcλq +
∑
q
∂cλq∂Ri
α
Eλ∑
p
Spqcλ∗p (C.9)
Consider the last two terms on the right hand side. Moving the sums to the left
and isolating the expansion coefficients and their derivatives leaves
∂Eλ
Riα
=∑
p,q
cλ∗p
∂Hpq
∂Riα
cλq +∑
p,q
EλSpq
(∂cλ∗p
∂Riα
cλq +∂cλq∂Ri
α
cλ∗p
)
︸ ︷︷ ︸∂cλ∗
p cjq
∂Riα
(C.10)
As noted by the underbrace the term in the brackets is simply the partial derivative
of the product of the expansion coefficient and its complex conjugate with respect
to the ionic degree of freedom. Now putting the double sum out front we have a
148
clean form given as
∂Eλ
Riα
=∑
p,q
[cλ∗p
∂Hpq
∂Riα
cλq + EλSpq
∂cλ∗p cjq
∂Riα
](C.11)
The evaluation of this term is complicated by the fact of the dependance of
the expansion coefficients and the ionic coordinate. This would require a re-
diagnolization each time this force is computed. This is an expensive operation
and is to be avoided.
To make progress we go a little off course by looking at the partial derivative of
the overlap with respect to the ionic coordinate. First we restate the condition
that any two eigenvectors of a linear operator are orthogonal to one another. This
is succinctly expressed by
〈Ψi|Ψj〉 = δij (C.12)
Considering the non-trivial example (i.e. i=j), and using the definition of the
wave-function given in equation (C.1) we have
〈Ψλ|Ψλ〉 =∑
p,q
cλ∗p cλqSpq = 1 (C.13)
Now taking the partial derivative of the overlap with respect to the ionic coordinate
Riα, and not fully expanding the term in the expansion coefficients we have
∑
p,q
[∂cλ∗p c
λq
∂Riα
Spq + cλ∗p cλq
∂Spq
∂Riα
]= 0 (C.14)
Rearranging this we have
149
∑
p,q
Spq
∂cλ∗p cλq
∂Riα
= −∑
p,q
cλ∗p cλq
∂Spq
∂Riα
(C.15)
Taking a look back at equation (C.11) we see that we can replace the partial
over the expansion coefficients with the partial over the overlap matrix elements.
Making this substitution equation (C.11) is now written as
∂Eλ
Riα
=∑
p,q
[cλ∗p
∂Hpq
∂Riα
cλq −Eλcλ∗p cλq
∂Spq
∂Riα
](C.16)
or more compactly
∂Eλ
Riα
=∑
p,q
cλ∗p cλq
[∂Hpq
∂Riα
−Eλ∂Spq
∂Riα
](C.17)
These Hamiltonian and overlap derivatives are evaluated via a finite difference
method in our code. We can now express the force on atom i in the α direction
with the negative of the derivative of the energy as
F iα = −
∑
λ
fλ∂Eλ
Riα
= −∑
λ
fλ
∑
p,q
cλ∗p cλq
[∂Hpq
∂Riα
−Eλ∂Spq
∂Riα
](C.18)
Where as mentioned earlier fλ is the occupation of state λ. This result can be
equivalently and succinctly stated through the use of the density matrix which is
defined by
ρp,q =∑
λ
fλcλ∗p c
λq (C.19)
and the energy matrix σ defined by
σp,q =∑
Eλfλcλ∗p c
λq (C.20)
150
Equation (A.9) can now be stated as
F iα = −Tr
[ρ∂Hpq
∂Riα
]+ Tr
[σ∂Spq
∂Riα
](C.21)
where Tr is the trace of the matrix.
151
DMultipole Expansion
We seek a power series expansion of the Columbic energy between all charges in
one molecule and those in another. This expansion is known as the multipole
expansion and we present it here in the formalism of Margenau [112].
Molecule A has i charged particles with position ~ri and charge qi with a origin at
its nucleus, and Molecule B has j charged particles with position ~rj and charge qj
with a origin at its nucleus. The vector between the nuclei is ~R. Components of
the vectors ~R are X, Y and Z and for the charge coordinates the components of
~rj will be referenced as xj , yj and zj .
The total charge on molecules A and B is simply the sum of their charges.
152
+
−
−
+
−
−
−−
~r1~r1
~r2
~r2
~ri ~rj~R
A B
Figure D.1: Two separated charge clouds A and B
q =∑
i
qi (D.1)
q′
=∑
j
qj (D.2)
The prime denotes molecule 2 were as no prime is simply molecule 1. This is used
through out this derivation.
The dipole vector is defined as
p1 =∑
i
qi~ri (D.3)
A given component of the dipole vector, for example the z component is
p1z =∑
i
qizi (D.4)
With these definitions of the system we can start by first defining the Columbic
potential seen at the origin of molecule 2 due to the charges in molecule 1. This
potential can be written as
153
ϕ =∑
i
qi
|~R− ~ri|(D.5)
Expanding this in a multi-variable Taylor series in the components of ~ri about
zero, and assuming |~ri| ≪ |~R|, we arrive at
ϕ =1
r
∑
i
qi +1
r2
(X
r
∑
i
qixi +Y
r
∑
i
qiyi +Z
r
∑
i
qizi
)+
1
r3
[1
2
(3X2
r2− 1
)∑
i
qix2i +
1
2
(3Y 2
r2− 1
)∑
i
qiy2i +
1
2
(3Z2
r2− 1
)∑
i
qiz2i +
3XY
r2
∑
i
qixiyi +3XZ
r2
∑
i
qixizi +3Y Z
r2
∑
i
qiyizi
]+ . . . h.o.t′s (D.6)
r is the magnitude of the vector ~R.
Now we write the energy terms associated with each charge in molecule 2 in-
teracting with the potential developed in the previous equation. This is written
as
V =∑
j
qj ϕ(|~R+ ~rj|) (D.7)
expanding this in a Taylor series again
V =∑
j
qjϕ+∑
j
qj
(xj∂ϕ
∂X+ yj
∂ϕ
∂Y+ zj
∂ϕ
∂Z
)+
1
2!
∑
j
qj(x2j
∂2ϕ
∂X2+ . . .+ 2yizi
∂2ϕ
∂Y ∂Z) + . . . h.o.t′s (D.8)
This expression can be simplified considerably if one picks the internuclear axis
154
along one of the cartesian axis. We will do this and state that Z = r and X =
Y = 0. Equation (D.9) is the form and the terms are given in table D.1.
V =∞∑
n
Pn (D.9)
n Pn
1 r−1qq′
2 r−2(q′pz − qp′
z)3 r−3(q′w3 + qw
′
3)4 r−4(q′w4 + qw
′
4)5 r−5(q′w5 + qw
′
5)6 r−3
∑ij
qiqj(xixj + yiyj − 2zizj)
7 r−4∑ij
qiqj [r2i rz − z2
j rz + (2xixj + 2yiyj − 2zizj)(zi − zj)]
......
∞ ...
Table D.1: Tabulation of selected terms in the multipole expansion
The terms wn and w′
n, in terms 3-5 above are defined by
wn ≡∑
i
qirni Pn(cosθi) (D.10)
w′
n ≡∑
j
qjrnj Pn(cosθj) (D.11)
Where Pn is the Legendre polynomial of order n.
155
EPerturbation Theory
Here we show the derivation of the non-degenerate perturbation theory.
We start with a system that has a known full solution
H(0) |n〉 = E(0)n |n〉 (E.1)
The eigenstates |n〉 forms a complete orthonormal basis. The perturbation in this
derivation will be done for the ground state energy and wave functions n = 0, but
the result is true for any n.
If this system is slightly perturbed by for example an external field it is assumed
156
that the solution is only slightly different that the unperturbed solution.
We introduce λ which is....
H = λ0H(0) + λ1V (1) + λ2V (2) + . . . (E.2)
ψ0 = λ0ψ(0)0 + λ1ψ
(1)0 + λ2ψ
(2)0 + . . . (E.3)
E0 = λ0E(0)0 + λ1E
(1)0 + λ2E
(2)0 + . . . (E.4)
Now we use these definitions in the Schrodinger equation Hψ0 = E0ψ0, and group-
ing terms by in their contribution in λi we get
λ0(H(0)ψ
(0)0 −E
(0)0 ψ
(0)0
)+
λ1(H(0)ψ
(1)0 + V (1)ψ
(0)0 − E
(0)0 ψ
(1)0 − E
(1)0 ψ
(0)0
)+
λ1(V (1)ψ
(1)0 + V (2)ψ
(0)0 + H(0)ψ
(2)0 − E
(1)0 ψ
(1)0 −E
(0)0 ψ
(2)0 − E
(2)0 ψ
(0)0
)+ . . . = 0
(E.5)
Each term in this series has to be zero individually because of the arbitrariness of
the variable λi. This can be written as i equations, which are just each term in λi
equating to zero.
H(0)ψ(0)0 = E
(0)0 ψ
(0)0 (E.6)
(H(0) − E
(0)0
)ψ
(1)0 =
(E
(1)0 − V (1)
)ψ
(0)0 (E.7)
157
(H(0) − E
(0)0
)ψ
(2)0 =
(E
(2)0 − V (2)
)ψ
(0)0 +
(E
(1)0 − V (1)
)ψ
(1)0 (E.8)
E.1 First order correction to energy
We begin this by writing the perturbed wavefunction as a linear combination of
the unperturbed eigenvectors
ψ(1)0 =
∑
n
anψ(0)n =
∑
n
an |n〉 (E.9)
This is plugged into equation (E.7)
∑
n
an
(H(0) − E
(0)0
)|n〉 =
(E
(1)0 − V (1)
)|0〉 (E.10)
The left hand side can be expanded to
∑
n
anH(0) |n〉 −
∑
n
anE(0)0 |n〉 (E.11)
The first term is simply the left hand side of the the Schrodinger equation summed
over all states n and can be substituted with the right hand side of the Schrodinger
equation summed over all states
∑
n
anH(0) |n〉 =
∑
n
anE(0)n |n〉 (E.12)
Using this identity and substituting it into equation (E.10) we get
∑
n
an
(E(0)
n − E(0)0
)|n〉 =
(E
(1)0 − V (1)
)|0〉 (E.13)
multiplying both sides by the bra 〈0|
158
∑
n
an
(E(0)
n −E(0)0
)〈0|n〉 = E
(1)0 〈0|0〉 − 〈0|V (1)|0〉 (E.14)
Considering the left hand side first, we note that because of the orthonormality of
the set
〈a|b〉 = δab (E.15)
The only remaining non-zero term due to orthogonality in the sum is when n = 0,
but the difference of the eigenvalue E(0)0 with its self is of course zero, leaving the
whole left hand side zero. The right hand side of (E.14) is then rearranged and
we see that the first order correction to the energy is
E(1)0 = 〈0|V (1)|0〉 (E.16)
or in the matrix element shorthand.
E(1)0 = H
(1)00 (E.17)
This is simply the expectation value of the perturbation acting on the unperturbed
state |0〉.
E.2 First order correction to the wavefunction
We have in the previous section proposed the form for the corrected wavefunction
as a linear combination of the unperturbed state, see equation (E.9). We now
need to find a definition for the expansion coefficients an.
Starting with equation (E.13), we multiply both sides by the bra 〈k|.
159
∑
n
an 〈k|(E(0)
n −E(0)0
)|n〉 =
(E
(1)0 − V (1)
)|0〉 (E.18)
Again invoking the orthonormality constraint shown in equation (E.15), the only
surviving term of the sum in the previous equation is the kth
ak
(E
(0)k −E
(0)0
)= E
(1)0 〈k|0〉 − 〈k|V (1)|0〉 (E.19)
now considering the case when k 6= 0, the first term on the right hand side is zero
and we can solve the previous equation for ak.
ak =H
(1)k0
E(0)0 − E
(0)k
(E.20)
writing the
ψ(1)0 =
′∑
k
(H
(1)k0
E(0)0 − E
(0)k
)ψ
(0)0 (E.21)
The prime denotes the sum does not include zero.
E.3 Second order correction to the energy
following an analogous process for the first order correction to the energy we define
the second order correction to the wavefunction again as a linear combination of
the unperturbed state.
ψ(2)0 =
∑
n
bnψ(0)n =
∑
n
bn |n〉 (E.22)
This form is substituted into equation (E.8), which is repeated here
160
(H(0) − E
(0)0
)ψ
(2)0 =
(E
(2)0 − V (2)
)ψ
(0)0 +
(E
(1)0 − V (1)
)ψ
(1)0
∑
n
bn
(H(0) −E
(0)0
)|n〉 =
(E
(2)0 − V (2)
)|0〉 +
∑
n
an
(E
(1)0 − V (1)
)|n〉 (E.23)
multiplying through by the bra 〈0|
∑
n
bn
(H(0) −E
(0)0
)〈0|n〉 =
E(2)0 〈0|0〉 − 〈0|V (2)|0〉 +
∑
n
an 〈0|(E
(1)0 − V (1)
)|n〉 (E.24)
The left hand side is zero via the same argument used in the first order correction
to the energy. The first term on the right is simply the second order correction to
the energy because of orthonormality. The last term on the right hand side can
be simplified significantly. First lets take it apart by isolating the n = 0 term in
the sum and taking the sum over everything but zero.
∑
n
an 〈0|(E
(1)0 − V (1)
)|n〉 =
a0
(E
(1)0 − 〈0| V (1) |0〉
)
︸ ︷︷ ︸=0
+
′∑
n
anE(1)0 〈0|n〉
︸ ︷︷ ︸=0
−′∑
n
an 〈0| V (1) |n〉 (E.25)
With these simplifications we can rewrite equation (E.24) and solve for the second
order correction to the energy.
161
E(2)0 = 〈0| V (2) |0〉 +
′∑
n
an 〈0| V (1) |n〉 (E.26)
Using the the definition of the expansion coefficients derived earlier and stated in
equation (E.20) and the shorthand matrix notation equation (E.26) can be stated
as
E(2)0 = V
(2)00 +
′∑
n
V(1)n0
E(0)0 −E
(0)n
V(1)0n (E.27)
If the operator V is hermitian than V(1)n0 V
(1)0n = |V (1)
0n |2 and we can write our final
form of the second order correction to the energy as
E(2)0 = V
(2)00 +
′∑
n
|V (1)0n |2
E(0)0 − E
(0)n
(E.28)
162
FOptimization
A major part of this work was in the fitting of the van der Waals energy term. This
process utilized the hybrid method of Powell as implemented in the MINPACK
library from Argonne National Laboratory. Here we give the basic outline of how
this method works. First we start with the general steps to a minimization routine.
1. The vector (~xo) contains the initial guess of the solution
2. Compute a search direction (~Sk)
3. Compute length of step (αk) in search direction
4. Advance system to new point ~xk+1 = ~xk + αk~Sk
163
5. Check for convergence δΠδ~x< ǫ
6. If system is not converged increment k and got to step 2
In the steepest decent minimization algorithm the search direction is simply the
gradient of the function evaluated at the initial guess. This is followed by a line
minimization to find αk. This procedure guarantees that the gradient at the
new point ~xk+1 is conjugate to the previous search direction ~Sk. This makes this
method simple in that the search direction is known and preforming step 3 can be
done analytically with an assumed quadratic form. Yet this method is inefficient
and has very slow convergence. A more robust search algorithm is the conjugate
gradient search algorithm which is basically the same but finds its search directions
in another fashion.
We can understand this method through an example. Imagine that we have pre-
formed the first step of the steepest descent method and the new position is ~xk+1.
We will call the old search direction, ~Sk, ~u. We want to find a new search direction
~v such that the gradient ∇Π remains conjugate to ~u as we move along this new
direction ~v [139]. We can find this direction by satisfying this condition
0 = ~u · A · ~v (F.1)
Were A is the Hessian matrix of the function at the current solution vector ~xk.
This is defined by
[A]ij =δ2Π
δ~xiδ~xj
∣∣∣∣∣~x=~xk
. (F.2)
Computing the Hessian matrix can be prohibitive either from a computational
or time point of view so it is avoided. The new search direction ~v can be found
approximately through either the Fletcher-Reeves, or the Polak and Ribiere meth-
164
ods which both avoid computing the Hessian matrix. Another conjugate gradient
scheme is Powell’s method [45]. Starting with a set of directions ~ui initialized to
the basis vectors
~ui = ei i = 1 . . .N (F.3)
N is the dimension of the system. The recipe for this algorithm is the following
steps.
1. Save starting position as ~xo
2. For i = 1 to N move ~xi−1 to the minimum along direction ~ui and call this
point ~xi
3. For i = 1 to N − 1, set ~xi = ~xi+1
4. Set ~uN = ~xN − ~xo
5. Move ~xN to the minimum along direction ~uN and call this point ~xo
6. Repeat until converged
Powell proved that it takes N iterations of the above method and N(N + 1) total
line minimizations to exactly minimize a quadratic form.
165
Bibliography
[1] R. Ahlrichs, R. Penco, and G. Scoles, Intermolecuar forces in simple systems,
Chemical Physiscs 19 (1977), 119–130.
[2] R. Al-Jishi and G. Dresselhaus, Lattice-dynamical model for graphite, Phys-
ical Review B 26 (1982), no. 8, 4514–4522.
[3] M.P. Anantram and F Leonard, Physics of carbon nanotube electronic de-
vices, Rep. Prog. Phys. 69 (2006), 507–561.
[4] Neil W. Ashcroft and N. David Mermin, Solid state physics, Saunders Col-
lege Publishing, 1976.
[5] Peter Atkins and Ronald Freidman, Molecular quantum mechanics, Oxford
University Press, 2005.
[6] R.M Barrer, An analysis by adsorption of the surface structure of graphite,
Proceeding of the royal society of London A 161 (1937), 476–493.
[7] Y. Baskin and L. Meyer, Lattice constants of graphite at low temperatures,
Physical Review 100 (1955), no. 2, 544.
[8] V. Bayot, L. Piraux, J.P. Michenaud, and J.P. Issi, Two-dimensional weak lo-
calization in partially graphitic carbons, Physical Review B 41 (1990), no. 17,
11770–11779.
166
[9] L. Benedict, N. Chopra, M.L. Cohen, A. Zettl, S. Louie, and V Crespi, Mi-
croscopic determination of the interlayer binding energy in graphite, Chem-
ical Physics Letters 286 (1998), no. 5, 490–496.
[10] L. Bergstrom, Hamaker constants of inorganic materials, Advances in Col-
loid and Interface Science 70 (1997), 125–169.
[11] J.D. Bernal, Proceeding of the royal society of london series A 160 (1924),
749.
[12] D.S. Bethune, C.H. Klang, M.S. de Vries, G. Gorman, R. Savoy, J. Vazquez,
and R. Beyers, Cobalt-catalysed growth of carbon nanotubes with single-
atomic-layer walls, Nature 363 (1993), 605–607.
[13] O.L. Blakslee, D.G. Proctor, E.J. Seldin, G.B. Spence, and T. Weng, Elas-
tic constants of compression-annealed pyrolytic graphite, Journal of Applied
Physics 41 (1970), no. 8, 3373–3382.
[14] M. Born and R. Oppenheimer, Zur quantentheorie der molekeln, Ann. Phys.
84 (1927), 457.
[15] D. Bozovic, M. Bockrath, J.H. Hafner, C.M. Lieber, H. Park, and M. Tin-
kham, Plastic deformations in mechanically strained single-walled carbon
nanotubes, Physical Review B 67 (2003), no. 3, 033407.
[16] R.O. Brennan, The interlayer binding in graphite, Journal of Chemical
Physics 20 (1952), no. 1, 40–48.
[17] D.W. Brenner, O.A. Shenderova, J.A. Harrison, S.J. Stuart, B. Ni, and S.B
Sinnott, A second-generation reactive empirical bond order (rebo) potential
energy expression for hydrocarbons, Journal of Physics: condensed matter
14 (2002), no. 4, 783–802.
[18] D.W. Brenner, O.A. Shenderova, J.A. Harrison, S.J. Stuart, B. Ni, and S.B.
Sinnott, A second-generation reactive empirical bond order (rebo) potential
167
energy expression for hydrocarbons, Journal of Physics: condensed matter 4
(2004), 783–802.
[19] O. Breuer and U. Sundararaj, Big returns from small fibers: A review of
polymer/carbon nanotube composits, Polymer Composites 25 (2004), no. 6,
630–645.
[20] A. Buldum and Jian Ping Lu, Atomic scale sliding and rolling of carbon
nanotubes, Physical Review Letters 83 (1999), no. 24, 5050–5053.
[21] Dr. Christian Burger, Figure of rolled nanotube,
http://ms.cc.sunysb.edu/ cburger.
[22] W.D. Callister, Materials science and engineering an introduction, 6 ed.,
Wiley, 2003.
[23] J.C. Charleir, X. Gonze, and J.P. Michenauid, Graphite interplanar bonding:
electronic delocalization and van der waals interaction, Europhysics letters
28 (1994), no. 6, 403–408.
[24] J.-C. Charlier, X. Gonze, and J.-P. Michenaud, First-principles study of the
stacking effect on the electronic properties of graphite(s), Carbon 32 (1994),
no. 2, 289 – 299.
[25] J.-C. Charlier and J.-P. Michenaud, Energetics of multilayered carbon
tubules, Physical Review Letters 70 (1993), no. 12, 1858–1861.
[26] L. Chen, C.L. Cheung, P.D. Ashby, and C.M. Lieber, Single-walled car-
bon nanotube afm probes: Optimal imaging resolution of nanoclusters and
biomolecules in ambient and fluid environments, Nano Letters 4 (2004),
no. 9, 1725–1731.
[27] B. Chu, Molecular forces, Interscience publishers, 1967.
168
[28] J.F. Colomer, J.M. Benoit, C. Stephan, S. Lefrant, G. Van Tendeloo, and
J. B.Nagy, Characterization of single-wall carbon nanotubes produced by ccvd
method, Chemical Physics Letters 345 (2001), 11–17.
[29] J. Cumings and A. Zettl, Low-friction nanoscale linear bearing realized from
multiwall carbon nanotubes, Science 289 (2000), no. 5479, 602–604.
[30] R.R. Dagastine, D.C. Prieve, and L.R. White, Calculations of van der waals
forces in 2-dimensionally anisotropic materials and its application to carbon
black, Journal of Colloid and Interface Science 249 (2002), 78–83.
[31] D.P. DiVincenzo, E.J. Mele, and N.A.W, Density-functional study of inter-
planar binding in graphite, Physical Review B 27 (1983), no. 4, 2458–2469.
[32] T. Dumitrica and R. Allen, Femtosecond-scale response of gaas to ultrafast
laser pulses, Physical Review B 66 (2002), no. 081202, 1–4.
[33] T. Dumitrica, A. Burzo, Y. Dou, and R.E. Allen, Response of si and insb to
ultrafast laser pulses, Physica Status Solidi (B) 10 (2004), no. 2331.
[34] T. Dumitrica, M. Hua, and B.I. Yakobson, From the Cover: Symmetry-,
time-, and temperature-dependent strength of carbon nanotubes, PNAS 103
(2006), no. 16, 6105–6109.
[35] T. Dumitrica and R.D. James, (unpublished).
[36] I.E. Dzyaloshinskii, E.M. Lifshitz, and L.P. Pitaevskii, The general theory
of van der waals forces, Adv. Phys. 10 (1961), no. 38, 165–209.
[37] T.W. Ebbesen and P.M. Ajayan, Large-scale synthesis of carbon nanotubes,
Nature 358 (1992), 220–221.
[38] M. Elstner, P. Hobza, T. Frauenheim, S. Suhai, and E. Kaxiras, Hydro-
gen bonding and stacking interactions of nucleic acid base pairs: A density-
functionsl-theory based treatment, Journal of Chemical Physiscs 114 (2001),
no. 12, 5149–5155.
169
[39] S.T. Epstein, The force concept in chemistry, Van Nostrand Reinhold Com-
pany, 1981.
[40] H. Eschrig, Readily computable expressions for lcao two-centre integrals over
slater-type orbitals with arbitrarily high quantum numbers, Physica Status
Solidi (B): Basic Research 96 (1979), no. 1, 329 – 342.
[41] M. R. Falvo, J. Steele, R. M. Taylor, and R. Superfine, Gearlike rolling mo-
tion mediated by commensurate contact: carbon nanotubes on hopg, Phys-
ical Review B 62 (2000), no. 16, 10665–10667.
[42] M. R. Falvo, R. M. Taylor, A. Helser, V. Chi, F.P. Brooks, S. Washburn,
and R. Superfine, Nanometre-scale rolling and sliding of carbon nanotubes,
Nature 397 (1999), 236–238.
[43] M.R. Falvo, G.J. Clary, R.M. Taylor, V. Chi, F.P. Brooks, S. Washburn, and
R. Superfine, Bending and buckling of carbon nanotubes under large strain,
Nature 389 (1997), 582–584.
[44] R. P. Feynman, Forces in molecules, Physical Review 56 (1939), no. 340.
[45] R. Fletcher, Powell, and M.J. Davidson, A rapid convergent descent method
for minimization, Computer Journal 6 (1964), no. 2.
[46] Rene Franchy, Surface and bulk photochemistry of solids, Rep. Prog. Phys.
61 (1998), no. 41, 691753 (English).
[47] R.E. Franklin, The structure of graphitic carbons, Acta Crystallographica 4
(1951), 253–261.
[48] R.H. French, Origins and applications of london dispersion forces and
hamaker constants in ceramics, Journal of the American Ceramic Society
83 (2000), 2117–2146.
170
[49] M.E. Garcia, T. Dumitrica, and H.O. Jeschke, Laser induced coherent
phonons in graphite and carbon nanotubes: model and simulations, Applied
Physics A 79 (2004), 855–857.
[50] W. B. Gauster and I. J. Fritz, Pressure and temperature dependences of
the elastic constants of compression-annealed pyrolytic graphite, Journal of
Applied Physics 45 (1974), no. 8, 3309–3314.
[51] Maohui Ge and Klaus Sattler, Vapor-condensation generation and stm anal-
ysis of fullerene tubes, Science 260 (1993), no. 5107, 515 – 518.
[52] P. Giannozzi, R. Car, and G. Scoles, Oxygen adsorption on graphite and
nanotubes, Journal of Chemical Physics 118 (2003), no. 3, 1003–1006.
[53] L.A. Girifalco, M. Hodak, and R.S. Lee, Carbon nanotubes, buckyballs, ropes
and a universal graphitic potential, Physical Review B 62 (2000), no. 19,
13104–13110.
[54] L.A. Girifalco and R.A. Lad, Energy of cohesion, compressibility, and the po-
tential energy functions of the graphite system, Journal of Chemical Physics
25 (1956), 693.
[55] J.S. Graves, Electronic and structural response of semiconductors to ultra-
intense laser pulses, Ph.D. thesis, Texas A&M University, 1997.
[56] L. Guan, K. Suenaga, Z. Shi, Z. Gu, and S. Iijima, Direct imaging of the
alkali metal site in k-doped fullerene peapods, Physical Review Letters 94
(2005), no. 4.
[57] T. Guo, P. Nikoleav, A.G. Rinzler, D. Tomanek, D.T. Colbert, and R. Smal-
ley, Self-assembly of tubular fullerenes, Journal of Physical Chemistry 99
(1995), 10694–10697.
[58] J.H. Hafner, C.L. Cheung, and C.M. Lieber, Growth of nanotubes for probe
microscopy tips, Nature 398 (1999), 761–762.
171
[59] Thomas A. Halgren, Representation of van der waals (vdw) interactions in
molecular mechanics force fields. potential form, combination rules, and vdw
parameters, Journal of the American Chemical Society 114 (1992), no. 20,
7827 –.
[60] N . Hamada, S. Sawada, and A. Oshiyama, New one-dimensional conductors:
Graphitic microtubules, Physical Review Letters 68 (1992), no. 10, 1579–
1581.
[61] H.C. Hamaker, The london - van der waals attraction between spherical par-
ticles, Physica 4 (1937), no. 10, 1058–1072.
[62] M. Hanfland, H. Beister, and K. Syassen, Graphite under pressure: equation
of state and first-order raman modes, Physical Review B 39 (1989), no. 17,
12598–12603.
[63] M. Hasegawa and K. Nishidate, Semiempirical approach to the energetics
of interlayer binding in graphite, Physical Review B 70 (2004), no. 205431,
1–7.
[64] Martin Head-Gordan, Quantum chemisty and molecular process, Journal of
Physical Chemistry 100 (1996), 13213–13225.
[65] M.T. Heath, Scientific computing: An introductory survey, 2 ed., McGraw-
Hill, 2002.
[66] H. Hellmann, Einfuhrung in die quantumchemie, Ph.D. thesis, Deuticke,
Leipzig, 1937.
[67] S. Hembacher, F.J. Giessibl, J. Mannhart, and C.F. Quate, Revealing the
hidden atom in graphite by low temperature atomic force microscopy, Pro-
ceedings of the National Academy of Sciences 100 (2003), 12539–12542.
172
[68] J. Hepburn and G. Scoles, A simple but reliable method for the prediction of
intermolecuar potentials, Chemical Physiscs Letters 36 (1975), no. 4, 451–
456.
[69] E. Hernandez, C. Goze, P. Bernier, and A. Rubio, Elastic properties of c
and bxcynz composite nanotubes, Physical Review Letters 80 (1998), no. 20,
4502–4505.
[70] T. Hertel, R.E. Walkup, and P. Aouris, Deformation of carbon nanotubes by
surface van der waals forces, Physical Review B 58 (1998), 13870–13873.
[71] P. Hohenberg and W. Kohn, Inhomogeneous electron gas, Physical Review
136 (1964), no. 3B, 864–871.
[72] D. Hough and L.R. White, The calculation of hamaker constants from lifshitz
theory with applications to wetting phenomena, Advances in Colloid and
Interface Science 14 (1980), 3–41.
[73] M. Hua and T. Dumitrica, (unpublished).
[74] S. Iijima, Helical microtubules of graphitic carbon, Nature 354 (1991), 56–58.
[75] S. Iijima, P. M. Ajayan, and T. Ichihashi, Growth model for carbon nan-
otubes, Physical Review Letters 69 (1992), no. 21, 3100–3103.
[76] S. Iijima and T. Ichihashi, Single-shell carbon nanotubes of 1-nm diameter,
Nature 363 (1993), 603–605.
[77] Carbon Nanotechnologies Inc., Company website accessed 6-2-06,
www.cnanotech.com.
[78] Nantero Inc., Company website accessed 6-17-06, www.nantero.com.
[79] E.R. Johnson and A.D. Becke, A post-hartree-fock model of intermolecular
interactions, The Journal of Chemical Physics 123 (2005), no. 024101, 1–7.
173
[80] C. Journet, W. K. Maser, P. Bernier, A. Loiseau, M. Lamy de la Chapelle,
S. Lefrant, P. Deniard, R. Lee, and J.E. Fischer, Large-scale production
of single-walled carbon nanotubes by the electric-arc technique, Nature 388
(1997), 756–757.
[81] G. Scuseria K. Kudin and B. Yakobson, C2f, bn, and c nanoshell elasticity
from ab initio computations, Physical Review B 64 (2001), no. 235406.
[82] Y. Kang and M.S. Jhon, Additivity of atomic static polarizabilities and dis-
persion coefficients, theoretical chemistry accounts: Theory, computation,
and modeling, Theoretica Chimica Acta 61 (1982), no. 1, 41 – 48.
[83] B.T. Kelly, Physics of graphite, Applied Science Publishers, 1981.
[84] C.H. Kiang, M. Endo, P.M. Ajayan, G. Dresselhaus, and M.S. Dresselhaus,
Size effects in carbon nanotubes, Phys. Rev. Lett. 81 (1998), no. 9, 1869–
1872.
[85] P. Kim and C.M. Lieber, Nanotube nanotweezers, Science 286 (1999),
no. 5447, 2148–2150.
[86] J. M. Kinaret, T. Nord, and S. Viefers, A carbon-nanotube-based nanorelay,
Applied Physics Letters 82 (2003), no. 8, 1287–1289.
[87] J. T. Kindt, J. C. Tully, M. Head-Gordon, and M. A. Gomez, Electron-
hole pair contributions to scattering, sticking, and surface diffusion: CO on
Cu(100), Journal of Chemical Physics 109 (1998), 3629–3636.
[88] J.G. Kirkwood, Z. Phys. 44 (1937), 57–60.
[89] C. Kittel, Introduction to solid state physics, Wiley, 1995.
[90] W. Kohn, A.D. Becke, and R.G. Parr, Density functional theory of electronic
structure, Journal of Physical Chemistry 100 (1996), 12974–12980.
174
[91] W. Kohn, Y. Meir, and D. Makarov, Van der waals energies in density
functional theory, Physical Review Letters 80 (1998), no. 19, 4153–4156.
[92] W. Kohn and L.J. Sham, Self-consistent equations including exchange and
correlation effects, Physical Review 140 (1965), no. 4A, 1133–1138.
[93] A. Koide, W.J. Meath, and A.R. Allnatt, Second order charge overlap ef-
fects and damping functions for isotropic atomic and molecular interactions,
Chemical Physics 58 (1981), 105–119.
[94] A.N. Kolmogorov and V.H. Crespi, Smoothest bearings: Interlayer sliding
in multiwalled carbon nanotubes, Physical Review Letters 85 (2000), no. 22,
4727–4730.
[95] , Nanotube-substrate interactions: distinguishing carbon nanotubes by
the helical angle, Physical Review Letters 92 (2004), no. 8.
[96] , Registry-dependant interlayer potential for graphitic systems, Phys-
ical Review B 71 (2005), no. 235415, 1–6.
[97] H.L. Kramer and D.R. Herschbach, Combination rules for van der waals
force constants, The Journal of Chemical Physics 53 (1970), no. 7, 2792–
2800.
[98] G. Kraus, Progress report no. 2, Army air force contract af33(610) 231,
University of Cincinnati, 1953.
[99] H.W. Kroto, J.R. Heath, S.C. O’Brien, R.F. Curl, and R.E. Smalley, c60:
Buckminsterfullerene, Nature 318 (1985), 162–163.
[100] Y.K. Kwon and D. Tomanek, Orientational melting in carbon nanotube
ropes, Physical Review Letters 84 (2000), no. 7, 1483–1486.
[101] Naval Research Laboratory, Pre defined k-point sets for
crystalline calculations: hexagonal lattices, http://cst-
www.nrl.navy.mil/bind/kpts/hex/index.html.
175
[102] Dieter Langbein, Theory of van der waals attraction, Springer Tracts in
Mondern Physics, vol. 72, Springer-Verlag, 1974.
[103] C. D. Latham, M. Haugk, R. Jones, T. Frauenheim, and P. R. Briddon,
Density-functional calculations of carbon diffusion in gaas, Physical Review
B 60 (1999), no. 15117.
[104] S.B. Legoas, V.R. Coluci, S.F. Braga, P.Z. Coura, S.O. Dantas, and D.S
Galvao, Gigahertz nanomechanical oscillators based on carbon nanotubes,
Nanotechnology 15 (2004), 184–189.
[105] J.L. Li, J. Chun, N.S. Wingreen, R. Car, I.A. Aksay, and D. Saville, Use of
dielectric functions in the theory of dispersive forces, Physical Review B 71
(2005), no. 235412, 1–6.
[106] E.M. Lifshitz, The theory of molecular attractive forces between solids, Sov.
Phys. JETP 2 (1956), 73–83.
[107] F. London, Zur theorie und systematic der molekularkrafte, Z. Phys. 63
(1930), 245–279.
[108] , The general theory of molecular forces, Transactions of the Faraday
Society 33 (1937), 8–26.
[109] M.J. Lopez, A. Rubio, J.A. Alonso, L.C. Qin, and S. Iijima, Novel polygo-
nized single-wall carbon nanotube bundles, Physical Review letters 86 (2001),
no. 14, 3056–3059.
[110] J. Mahanty and B.W. Ninham, Dispersion forces, Academic Press, 1976.
[111] G.C. Maitland, M. Rigby, E.B. Smith, and W.A. Wakeham, Intermolecular
forces: Their origin and determination, Clarendon Press, 1981.
[112] H. Margenau and N.R. Kestner, Theory of intermolecular forces, Pergamon
Press, 1969.
176
[113] R. Martel, H.R. Shea, and P. Avouris, Rings of single-walled carbon nan-
otubes, Nature 398 (1999), 299.
[114] R.M. Martin, Electronic structure: basic theroy and practical methods, Cam-
bridge University Press, 2005.
[115] J. McMurry, Organic chemistry, 6 ed., Brooks and Cole, 2004.
[116] Donald McQuarrie and John Simon, Physical chemistry: a molecular ap-
proach, Viva Books, 2005.
[117] E.J. Meijer and M. Sprik, A density-functional study of intermolecular in-
teractions of benzene, Journal of Chemical Physics 105 (1996), no. 19, 8684–
8689.
[118] Kenneth J. Miller, Additivity methods in molecular polarizability, Journal of
the American Chemical Society 112 (1990), no. 23, 8533 – 8542.
[119] T. Mishina, K. Nitta, and Y. Masumoto, Coherent lattice vibration of in-
terlayer shearing mode of graphite, Physical Review B 62 (2000), no. 4,
2908–2911.
[120] H.J. Monkhorst and J.D. Pack, Special points for brillouin-zone integrations,
Physical Review B 13 (1976), no. 5188-5192.
[121] M. Monthioux and V.L. Kuznetsov, Who should be given the credit for the
discovery of carbon nanotubes?, Carbon 44 (2006), 1621–1623.
[122] J. J. More, B. S. Garbow, and K. E. Hillstrom, User guide for MINPACK-1,
Tech. Report ANL-80-74, Argonne National Laboratory, Argonne, IL, 1980.
[123] M Morin, N. J. Levinos, AL. Harris, and M. A. Gomez, Vibrational energy
transfer of CO/Cu(100): Nonadiabatic vibration/electron coupling, Journal
of Chemical Physics 96 (1992), 3950–3956.
177
[124] M.B. Nardelli, B. I. Yakobson, and J. Bernholc, Brittle and ductile behavior
in carbon nanotubes, Physical Review Letters 81, no. 21, 4656–4659.
[125] G.C. Nasreen, L.X. Benedict, V.H. Crespi, M.L. Cohen, S.G. Louie, and
A. Zettl, Fully collapsed carbon nanotubes, Nature 377 (1995), 135–138.
[126] R. Neitola, H. Ruuska, and T.A. Pakkanen, Ab-initio studies on nanoscale
friction between graphite layers: effect of model size and level of theory,
Journal of Physical Chemistry B 109 (2005), 10348–10354.
[127] R.J. Nemanich, G. Lucovsky, and S.A. Solin, Infrared active optical vibra-
tions of graphite, Solid state communications 23 (1977), 117–120.
[128] R. Nicklow, N. Wakabayashi, and H.G. Smith, Lattice dynamics of pyrolytic
graphite, Physical Review B 5 (1972), no. 12, 4951–4962.
[129] A. Oberlin, M. Endo, and T. Koyama, Filamentous growth of carbon through
benzene decomposition, J. Cryst. Growth 32 (1976), 335349.
[130] F. Ortmann, F. Bechstedt, and W.G. Schmidt, Semiempirical van der waals
correction to the density functional description of solids and molecular struc-
tures, Physical Review B 73 (2006), no. 205101, 1–10.
[131] A.H. Palser, Interlayer interactions in graphite and carbon nanotubes, Phys.
Chem. Chem. Phys. 1 (1999), 4459–4462.
[132] G.D. Parfitt and N.H. Picton, Stability of dispersion of graphitized carbon
blacks in aqueous solutions of sodium dodecyl sulphate, Transactions of the
Faraday Society 64 (1968), 1955–1964.
[133] Linus Pauling, The nature of the chemical bond, 3 ed., Cornell University
Press, 1960.
[134] H. Y. Peng, N. Wang, Y. F. Zheng, Y. Lifshitz, J. Kulik, R. Q. Zhang, C. S.
Lee, and S. T. Lee, Smallest diameter carbon nanotubes, Applied Physics
Letters 77 (2000), no. 18, 2831–2833.
178
[135] J.M. Perez-Jorda and A. D. Becke, A density-functional study of van der
waals forces: rare gas diatomics, Chemical Physics Letters 233 (1995), no. 1-
2, 134–137.
[136] G. Pilcher and H.A. Skinner, Valence-states of boron, carbon, nitrogen and
oxygen, Journal of Inorganic Nuclear Chemistry 24 (1962), 937–952.
[137] D. Porezag, Th. Frauenheim, and Th. Kohler, Construction of tight-binding-
like potentials on the basis of density-functional theory: Application to car-
bon, Physical Review B 51 (1995), no. 19, 12947–12957.
[138] M.J. Powell, A hybrid method for non-linear equations, in numerical methods
for non-linear algebraic equations, Gordon and Breach, 1970.
[139] W. Press, S. Teukolsky, W. Vetterling, and B. Flannery, Numerical recipes
in c: The art of scientific computing, 2 ed., Cambridge University Press,
1992.
[140] Peter Pulay, Modern theoretical chemistry iv, Applications of Electronic
Structure Theory, vol. 4, Plenum Press, 1977.
[141] L.V. Radushkevich and V.M. Lukyanovich, O strukture ugleroda, obrazu-
jucegosja pri termiceskom razlozenii okisi ugleroda na zeleznom kontakte,
Zurn Fisic Chim 26 (1952), 8895.
[142] A. Rochefort, P. Avouris, F. Lesage, and D.R. Salahub, Electrical and me-
chanical properties of distorted carbon nanotubes, Physical Review B 60
(1999), no. 13824.
[143] J.H. Rose, J.R. Smith, and J. Ferrante, Universal features of bonding in
metals, Physical Review B 28 (1983), no. 4, 1835–1845.
[144] T. Rueckes, K. Kim, E. Joselevich, G.Y. Tseng, C. Cheung, and C.M.
Lieber, Carbon nanotube-based nonvolatile random access memory for molec-
ular computing, Science 289 (2000), no. 5476, 94–97.
179
[145] R.S. Ruoff, J. Tersoff, D.C. Lorents, S. Subramoney, and B. Chan, Radial
deformation of carbon nanotubes by van der waals forces, Nature 364 (1993),
514–516.
[146] H. Rydberg, M. Dion, N. Jacobson, E. Schrodier, P. Hyldgaard, S.I. Simak,
D.C. Langreth, and B.I. Lundqvist, Van der waals density functional for
layered structures, Physical Review Letters 91 (2003), no. 2, 1–4.
[147] H. Rydberg, B.I Lundqvist, D.C. Langreth, and M Dion, Tractable nonlocal
correlation density functionals for flat surfaces and slabs, Physical Review
B 62 (2000), 6997–7006.
[148] R. Saito, G. Dresselhaus, and M.S. Dresselhaus, Physical properties of carbon
nanotubes, Imperial College Press, 1998.
[149] R. Saito, M. Fujita, G. Dresselhaus, and M. S Dresselhaus, Electronic struc-
ture of chiral graphene tubules, Applied Physics Letters 60 (1992), no. 18,
2204–2206.
[150] S. Sapmaz, Ya. M. Blanter, L. Gurevich, and H. S. J. van der Zant, Carbon
nanotubes as nanoelectromechanical systems, Physical Review B (Condensed
Matter and Materials Physics) 67 (2003), no. 23, 235414.
[151] M.C. Schabel and J.L. Martins, Energetics of interplanar binding in graphite,
Physical Review B 46 (1992), no. 11, 7185–7188.
[152] A. Sieck, D. Porezag, T. Frauenheim, M. R. Pederson, and K. Jackson, Struc-
ture and vibrational spectra of low-energy silicon clusters, Physical Review
A 56 (1997), no. 4890.
[153] J.C. Slater and J.G. Kirkwood, The van der waals forces in gases, Physical
Review 37 (1931), 682–697.
[154] J.C. Slater and G.F. Koster, Simplified lcao method for the periodic potential
problem, Physical Review 94 (1954), 1498 – 1524.
180
[155] M. Su, B. Zheng, and J. Liu, A scalable cvd method for the synthesis of single-
walled carbon nanotubes with high catalyst productivity, Chemical Physics
Letters 322 (2000), 321–326.
[156] A. Szabados, L.P. Biro, and P.R. Surjan, Intertube interactions in carbon
nanotube bundles, Physical Review B 73 (2006), no. 195404, 1–9.
[157] K.T. Tang and J.P. Toennies, An improved simple model for the van der
waals potential based on universal damping functions for the dispersion co-
efficients, Journal of Chemical Physiscs 80 (1983), no. 8, 3726–3741.
[158] S.J. Tans, A. Verschueren, and C. Dekker, Room-temperature transistor
based on a single carbon nanotube, Nature 393 (1998), 49.
[159] J. Tersoff and R.S. Ruoff, Structural properties of a carbon nanotube crystal,
Physical Review Letters 73 (1994), no. 5.
[160] A. Thess, R. Lee, P. Nikolaev, H. Dai, P. Petit, J. Robert, C. Xu, Y.H Lee,
S.G. Kim, A.G. Rinzler, D.T. Colbert, G.E. Scuseria, D. Tomanek, J.E.
Fischer, and R.E. Smalley, Crystalline ropes of metallic carbon nanotubes,
Science 273 (1996), 483–487.
[161] S.B. Trickey, F. Muller-Plathe, and G.H.F. Diercksen, Interplaner binding
and lattice relaxation in a graphite dilayer, Physical Review B 45 (1992),
no. 8, 4460–4468.
[162] F. Tuinstra and J.L. Koenig, Raman spectrum of graphite, Journal of Chem-
ical Physics 53 (1970), no. 3, 1126–1130.
[163] H. Ulbricht, G. Moos, and T. Hertel, Interaction of c60 with carbon nanotubes
and graphite, Physical Review Letters 90 (2003), no. 9.
[164] Loup Verlet, Computer “experiments” on classical fluids. i. thermodynamical
properties of leonard-jones molecules, Physical Review 159 (1967), no. 1.
181
[165] P.R. Wallace, The band theory of graphite, Physical Review 71 (1947), 622–
534.
[166] R.J. Wheatley and W.J. Meath, Dispersion energy damping functions, and
their relative scale with interatomic separation, for (h,he,li)-(h,he,li) inter-
actions, Molecular Physics 80 (1993), no. 1, 25–54.
[167] Qin Wu and Weitao Yang, Empirical correction to density functional theory
for van der waals interactions, Journal of Chemical Physics 116 (2002),
no. 2, 515 – 524.
[168] X. Wu, M.C. Vargas, S. Nayak, V. Lotrich, and G. Scoles, Towards extend-
ing the applicability fo density functional theory to weakly bound systems,
Journal of Chemical Physiscs 115 (2001), no. 19, 8748–8757.
[169] B. I. Yakobson, C. J. Brabec, and J. Bernholc, Nanomechanics of car-
bon tubes: Instabilities beyond linear response, Physical Review Letters 76
(1996), no. 14, 2511–2514.
[170] B.I. Yakobson and L.S. Couchman, Persistence length and nanomachanics
of random bundles of nanotubes, Journal of nanoparticle research 8 (2006),
105–110.
[171] Z. Yao, C.L. Kane, and C. Dekker, High-field electrical transport in single-
wall carbon nanotubes, Physical Review Letters 84 (2000), no. 13, 2941–
2944.
[172] M. T. Yin and Marvin L. Cohen, Structural theory of graphite and graphitic
silicon, Physical Review B 29 (1984), no. 12, 6996–6998.
[173] M.F. Yu, B.S. Files, S. Arepalli, and R.S. Ruoff, Tensile loading of ropes
of single wall carbon nanotubes and their mechanical properties, Physical
Review Letters 84 (2000), no. 24, 5552–5555.
182
[174] M.F. Yu, O. Lourie, M.J. Dyer, K. Moloni, T.F. Kelly, and R.S. Ruoff,
Strength and breaking mechanism of multiwalled carbon nanotubes under
tensile load, Science (2000), no. 287, 637–640.
[175] R. Zacharia, H. Ulbricht, and T. Hertel, Interlayer cohesive energy of
graphite from thermal desorption of polyaromatic hydrocarbons, Physical
Reivew B 69 (2004), no. 155406, 1–7.
[176] X. Zhao, Y. Liu, S. Inoue, T. Suzuki, R. O. Jones, and Y. Ando, Smallest
carbon nanotube is 3 [angstrom] in diameter, Physical Review Letters 92
(2004), no. 12, 125502.
[177] Q. Zheng and Q. Jiang, Multiwalled carbon nanotubes as gigahertz oscilla-
tors, Physical Review Letters 88 (2002), no. 4, 045503.
[178] Q. Zheng, J.Z. Liu, and Q. Jiang, Excess van der waals interaction energy
of a multiwalled carbon nanotube with an extruded core and the induced core
oscillation, Physical Review B 65 (2002), no. 24, 245409.
[179] H.W. Zhu, C.L. Xu, D.H. Wu, B.Q. Wei, R. Vajtai, and P.M. Ajayan, Direct
synthesis of long single-walled carbon nanotube strands, Science 296 (2002),
no. 5569, 884–886.
[180] U. Zimmerli, M. Parrinello, and P. Koumoutsakos, Dispersion corrections
to density functionals for water aromatic interactions, Journal of Chemical
Physiscs 120 (2004), no. 6, 2693–2699.
183