New domains in automatic mechanism generationcj82qg12w/... · NEW DOMAINS IN AUTOMATIC MECHANISM...
Transcript of New domains in automatic mechanism generationcj82qg12w/... · NEW DOMAINS IN AUTOMATIC MECHANISM...
NEW DOMAINS IN AUTOMATIC MECHANISM GENERATION
A Dissertation Presented
By
Belinda Leigh Slakman
To
The Department of Chemical Engineering
In partial fulfillment of the requirementsFor the degree of
Doctor of Philosophy
In the Field of
Chemical Engineering
Northeastern UniversityBoston, Massachusetts
August 2017
ii
Abstract
Deeper understanding of complex chemical systems can be aided by detailed kinetic mod-
eling, in which processes are broken down into their individual elementary reactions. An
important industrial goal is to move from postdictive to predictive modeling, where new
chemical vapor deposition (CVD) precursors, for example, can be tested for efficiency
without performing tedious and expensive experiments. Some of these microkinetic mod-
els may contain hundreds of reacting chemical species, and thousands of reactions; thus,
it is desirable to build the models automatically with a computer to speed up model gen-
eration and reduce errors. Automatic mechanism generation is now commonly used for
applications such as combustion, but extension to other systems presents challenges. This
dissertation describes the extension of the Reaction Mechanism Generator (RMG) soft-
ware to two less-studied chemical systems: the oxidation of liquid fuels and the gas-phase
decomposition of silicon hydrides.
To model liquid fuel oxidation, the software’s existing gas-phase thermodynamics
and kinetics databases needed to be supplemented, or corrected to account for solvated re-
actions. Existing correlations and data for solvation thermodynamics and diffusion were
improved and added to RMG. Solvation kinetics data were obtained by developing a ma-
chine learning algorithm to systematically predict the change in barrier height when going
from gas-phase to various solvents. The algorithm was trained with quantum chemistry cal-
culations on a simple set of hydrogen abstraction and intra-hydrogen migration reactions.
The method was used to change the rates in a model for the oxidation of dodecane/methyl
oleate blends, showing a marked change in the models prediction for the fuel’s induction
period.
iii
The second part of this dissertation involves gas-phase silicon hydride decomposi-
tion, for the application to CVD. Thermodynamic and kinetic data were added from liter-
ature to RMG’s database. Specifically focusing on radical reaction types, additional data
were calculated via quantum chemistry for hydrogen bond increment (HBI) values of sili-
con hydride species, as well as hydrogen abstraction reaction rates. A SiH4 decomposition
model was built with the updated RMG and compared to experiment, with good agreement.
This work provides new insight on both of these chemical systems and contributes
new calculated thermodynamics and kinetics parameters. Importantly, it also guides future
developers in adding capabilities for new phases or elements to mechanism generation
software.
iv
ACKNOWLEDGEMENTS
My dissertation work would not have been possible without the support of many
people, near and far.
Thank you to my advisor, Dr. Richard West, for supporting me these past five years.
Your intelligence and insights have been invaluable, but at the same time, you have always
let me run with the ideas I have had and let me make mistakes on my own- the marks of
a great advisor. Thanks for giving me the opportunities to travel and present my research,
teach, and mentor. I also want to thank my other committee members: Dr. Anand Asthagiri,
Dr. Carolyn Lee-Parsons, Dr. Mary Jo Ondrechen, and Dr. Harsono Simka for your time and
helpful discussions over the years. I would especially like to thank Harsono for mentoring
me throughout two internships at Intel Corporation and beyond, and for all of your personal
and professional advice.
I want to thank my other colleagues at Intel, particularly Karson Knutson, Dr. Har-
inath Reddy, and other members of the TCAD-IPAG group. I learned a lot from all of you,
and thanks for giving me the opportunity to come back and work with you a second time.
This dissertation would surely not have been possible without the hard work of past
and present RMG developers in the Green Group at MIT. I would especially like to thank
Dr. William Green for his insights over the years, and Dr. Amrit Jalan and Yunsie Chung
for helpful discussions on solvation in RMG.
This work could not have been completed without the help of Research Comput-
ing at Northeastern University, especially Nilay Roy, who helped maintain the Discovery
cluster.
v
I want to thank Michael Li, and my instructors and colleagues at The Data Incuba-
tor, for teaching me about data science, and providing guidance and friendship during the
spring of 2017. I would also like to acknowledge Lilian Tsang and the organizers of the
Combustion Energy Frontier Research Center summer school, which I attended in 2013
and 2014.
I would like to thank the current and former Northeastern Department of Chemical
Engineering staff for their support: Jessica, Brandon, Francesca, Kelly, Sarah and espe-
cially Pat and Rob. I also have to thank my 11 classmates and friends who started this
journey with me: Chris, Dan, Hunter, Luting, Mark, Negar, Oljora, Sue, Sydney, Tanya
and Taylor.
To Dr. Pierre Bhoorasingh and Dr. Fariba Seyedzadeh Khanshan: I couldn’t have
done this work without your guidance, good example, and levity, and thank you for the
friendship and advice you continue to provide. Thank you to Jason for being our “scientist”
and for all of your research assistance. I’d also like to thank Nate for letting me vent, not
just about kinetics. I also want to thank current West group members Yawei, Mike, Rasha
and Krishna and past members Jacob, Victor, Elliot, Claudia, and Drew for your useful
discussions and companionship; it’s been a pleasure to work with all of you.
I have to thank all of my friends, in Boston and beyond. In particular, I want to
thank Amanda for being my roommate for 4 years, Ina for sharing in the best friendship
that has ever come from Craigslist (#18westwood), Katarina for reminding me that my
Ph.D. project is not the only measure of my worth, and Jennifer for judgment-free support
of my decisions.
Last, and most importantly, I have to thank my family, who has patiently supported
me for the past 28 years. Mom, Dad, Jordan and Casey, thank you for standing with me as
I take this next step. I am lucky to have such love in my life.
vi
Contents
1 Introduction 1
1.1 Automatic mechanism generation . . . . . . . . . . . . . . . . . . . . . . 2
1.1.1 Reaction Mechanism Generator (RMG) . . . . . . . . . . . . . . . 6
1.2 Parameter estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.2.1 Thermodynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.2.2 Kinetics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.2.3 Transition state geometries . . . . . . . . . . . . . . . . . . . . . . 13
1.3 Machine learning with decision trees . . . . . . . . . . . . . . . . . . . . . 13
1.3.1 Decision trees in chemistry and biology . . . . . . . . . . . . . . . 14
1.4 Dissertation overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2 Automatic calculation of solvation thermodynamics 17
2.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.1.1 Explicit thermodynamic calculations . . . . . . . . . . . . . . . . . 18
2.1.2 Estimation techniques . . . . . . . . . . . . . . . . . . . . . . . . 18
2.2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.2.1 Adaptation of solvation thermodynamics from RMG-Java . . . . . 22
2.2.2 New additions in RMG-Py . . . . . . . . . . . . . . . . . . . . . . 23
2.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.3.1 Estimation of solute descriptors . . . . . . . . . . . . . . . . . . . 27
2.3.2 Calculation of solvation thermodynamics . . . . . . . . . . . . . . 28
vii
2.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.5 Recommendations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.5.1 Temperature dependence of solvation thermodynamics . . . . . . . 30
2.5.2 Calculation of solvation thermodynamics for lone pair species . . . 30
2.5.3 Improved benchmarking of group additivity values . . . . . . . . . 30
2.5.4 Expansion and enhancement of solvents . . . . . . . . . . . . . . . 31
3 Implementing kinetic solvent effects in automatic mechanism generation 32
3.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
3.1.1 Experimental techniques for determining reaction rates in liquids . 34
3.1.2 Computational chemistry . . . . . . . . . . . . . . . . . . . . . . . 35
3.1.3 Kinetic solvent effects within reaction families . . . . . . . . . . . 38
3.2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
3.2.1 Diffusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
3.2.2 Intrinsic kinetics . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
3.2.3 Fuel oxidation model modification . . . . . . . . . . . . . . . . . . 54
3.2.4 Reactor simulations . . . . . . . . . . . . . . . . . . . . . . . . . . 54
3.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
3.3.1 Solvation kinetics trends . . . . . . . . . . . . . . . . . . . . . . . 55
3.3.2 New reactor simulations . . . . . . . . . . . . . . . . . . . . . . . 58
3.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
3.5 Recommendations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
3.5.1 On-the-fly estimation of solvation kinetics . . . . . . . . . . . . . . 61
3.5.2 Benchmarking the estimates . . . . . . . . . . . . . . . . . . . . . 62
3.5.3 Check thermodynamic consistency with LSERs . . . . . . . . . . . 62
3.5.4 Data-driven approaches . . . . . . . . . . . . . . . . . . . . . . . . 63
viii
4 Automated silicon hydride mechanism generation 66
4.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
4.1.1 Experimental work on SiH4 chemistry . . . . . . . . . . . . . . . . 67
4.1.2 Detailed mechanisms for SiH4 CVD . . . . . . . . . . . . . . . . . 68
4.1.3 Importance of radical chemistry in silicon hydride thermal decom-
position . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
4.2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
4.2.1 RMG source code . . . . . . . . . . . . . . . . . . . . . . . . . . 71
4.2.2 Updating RMG’s database . . . . . . . . . . . . . . . . . . . . . . 71
4.2.3 RMG model generation . . . . . . . . . . . . . . . . . . . . . . . . 74
4.2.4 Reactor modeling . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
4.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
4.3.1 Kinetics of hydrogen abstraction reactions . . . . . . . . . . . . . . 75
4.3.2 Calculated thermodynamic data . . . . . . . . . . . . . . . . . . . 76
4.3.3 RMG generated mechanisms . . . . . . . . . . . . . . . . . . . . . 79
4.4 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
4.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
4.6 Recommendations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
4.6.1 Expansion of thermodynamic libraries for radical species . . . . . . 88
4.6.2 Calculation of rates . . . . . . . . . . . . . . . . . . . . . . . . . . 89
4.6.3 Sensitivity analysis . . . . . . . . . . . . . . . . . . . . . . . . . . 90
4.6.4 Surface chemistry . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
5 Conclusion 92
5.1 Liquid-phase fuel oxidation . . . . . . . . . . . . . . . . . . . . . . . . . . 92
5.2 Thermal decomposition of silicon hydrides . . . . . . . . . . . . . . . . . 94
5.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
ix
References 96
A Supplementary Info for Solvation Kinetics 114
A.1 Solvation kinetics molecular structure group trees and values . . . . . . . . 114
A.2 Script for modifying Chemkin files for solvation kinetics corrections . . . . 116
A.3 Modified Cantera input file for n-dodecane/ methyl oleate oxidation . . . . 116
A.4 Cantera script to simulate liquid fuel oxidation reactor . . . . . . . . . . . . 116
A.5 Code for automatic tree building . . . . . . . . . . . . . . . . . . . . . . . 116
B Supplementary Info for Silicon Hydrides 117
B.1 Geometries of reactants and transition states for hydrogen abstraction reac-
tions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
B.2 Geometries of silicon hydride species . . . . . . . . . . . . . . . . . . . . 119
B.3 Largest SiH4 decomposition mechanism . . . . . . . . . . . . . . . . . . . 122
B.4 Cantera script for simulating reactor . . . . . . . . . . . . . . . . . . . . . 122
B.5 Code for residence time comparison . . . . . . . . . . . . . . . . . . . . . 122
x
List of Figures
1.1 Rules for the generation of an elementary reaction network in GRACE . . . 3
1.2 Lumped approach for the primary oxidation of n-pentane in MAMOX . . . 4
1.3 Template and recipe for the hydrogen abstraction reaction family . . . . . . 7
1.4 Part of hydrogen abstraction hierarchical tree in RMG . . . . . . . . . . . . 7
2.1 Examples of molecular structure fragments . . . . . . . . . . . . . . . . . 20
2.2 Entry in the solute group database . . . . . . . . . . . . . . . . . . . . . . 22
2.3 Molecular structure fragments in g-decalactone . . . . . . . . . . . . . . . 27
2.4 Comparison of calculated and experimental values for ∆Hsolv and ∆Gsolv . 28
3.1 Example of a potential energy surface . . . . . . . . . . . . . . . . . . . . 33
3.2 Concept of peroxyl radical clock . . . . . . . . . . . . . . . . . . . . . . . 35
3.3 Comparison between continuum and hybrid implicit/explicit solvation models 36
3.4 The solvent effect on hydrogen abstraction from α-tocopherol . . . . . . . 38
3.5 Example of the PCET mechanism . . . . . . . . . . . . . . . . . . . . . . 41
3.6 β-scission rates correlate with Dimroth-Reichardt parameter ET . . . . . . 43
3.7 Diels-Alder reaction of cyclopentadiene and (-)-menthyl acrylate . . . . . . 44
3.8 Acetylation mechanism . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
3.10 Possible mechanisms in the hydrolysis of formamide . . . . . . . . . . . . 48
3.11 ∆EA for the reaction XH + ·OH←→ ·X + H2O . . . . . . . . . . . . . . . 56
3.12 Illustration of the first few levels of group trees for hydrogen abstraction . . 57
xi
3.13 Comparison of experiments, original model and updated model with ki-
netic solvent effects for 0% methyl oleate. . . . . . . . . . . . . . . . . . . 59
3.14 Comparison of experiments, original model and updated model with ki-
netic solvent effects for 5% methyl oleate. . . . . . . . . . . . . . . . . . . 59
3.15 Comparison of experiments, original model and updated model with ki-
netic solvent effects for 10% methyl oleate. . . . . . . . . . . . . . . . . . 60
3.16 Comparison of experiments, original model and updated model with ki-
netic solvent effects for 30% methyl oleate. . . . . . . . . . . . . . . . . . 60
3.17 Algorithm for obtaining solvation energy estimation values from a large
transition state dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
4.1 G3//B3LYP calculations of ∆fH◦298 compared to high level calculations . . 77
4.2 Group additivity calculations of ∆fH◦298 and HBI corrections compared to
high level calculations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
4.3 Simulation results compared with SiH4 thermal decomposition experiment . 81
4.4 SiH4 concentration vs. temperature at different residence times . . . . . . . 82
4.5 Simulation results for full pressure dependent mechanisms generated by
RMG, compared with a mechanism generated without radical reaction fam-
ilies allowed . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
4.6 Flux diagram for Si at 6× 104 seconds for full, pressure dependent mech-
anism generated by RMG and simulated at 613 K . . . . . . . . . . . . . . 83
4.7 Variation in concentration profiles of SiH4 and Si2H6 with initial SiH4 con-
centration at 873 K. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
xii
List of Tables
2.1 Radical corrections to A for solvation thermodynamics calculations . . . . 25
2.2 Comparison of group additivity and experimental solute parameters . . . . 27
3.1 Training reactions used to deduce kinetic solvent effects . . . . . . . . . . . 52
3.2 Solvents used for single-point energy calculations on training reactions . . . 53
4.1 Reaction families used to generate mechanisms for silicon hydrides in RMG 73
4.2 Hydrogen abstraction rates calculated from M062X/6-311+(3d2f) and tran-
sition state theory using Cantherm . . . . . . . . . . . . . . . . . . . . . . 75
4.3 Hydrogen bond increment (HBI) corrections calculated with G3//B3LYP . . 78
A.1 Group tree for hydrogen abstraction reactions . . . . . . . . . . . . . . . . 114
B.1 Transition state geometries for silicon hydride hydrogen abstractions . . . . 117
B.2 Geometries of silicon hydride species . . . . . . . . . . . . . . . . . . . . 119
1
1 INTRODUCTION
Understanding the oxidation of fuels in the liquid phase is important, as autoxida-
tion during storage leads to a loss of reactivity and may cause fuels to fail national standards
of oxidative stability [1]. Over time, autoxidation also causes large increases in viscosity
[2], making the fuels more difficult and less cost-effective to use. On the other hand, oxi-
dation of fuels in the liquid-phase can also be used to produce higher-value petrochemical
products [3]. Fuel oxidation is a complex process, dependent upon the detailed chemistry of
fuel components and additives. Many of these details are unknown, particularly for newer,
biologically derived fuels [4]. Building microkinetic models can be an effective method for
studying these systems; however, since these models can be quite large (174 species and
3275 reactions for a recent n-dodecane/methyl oleate autoxidation model [4]), building the
models automatically using a computer is preferable to increase speed and reduce errors.
Another complex chemical system which can benefit from detailed kinetic model-
ing is chemical vapor deposition (CVD). CVD is an important process in the semiconductor
industry for making silicon wafers. Predictive models can guide experimentalists on how
to design new silicon precursors and choose process conditions that will improve the effi-
ciency of CVD and the quality of silicon produced. Modeling just the gas-phase portion of
CVD is complex; one large manually built model, for the application of silicon nanopar-
ticle formation, contains 220 chemical species and 2600 reactions [5]. Again, building
these models by hand is slow, error-prone and important pathways may be missed, so using
automatic mechanism generation is more desirable. Mechanism generation software was
utilized in one such study of silicon hydrides in the gas phase [6]. Proper modeling of gas-
2
phase chemistry of CVD using automatic mechanism generation leads the way for similar
modeling of gas-surface chemistry, which is integral to these systems.
This dissertation will address these two chemistry domains using automatic mecha-
nism generation, and will focus on one software package in particular, the Reaction Mech-
anism Generator (RMG) [7, 8]. Studying such systems using RMG (and automatic mecha-
nism generation in general) is novel; most applications of automatic mechanism generation
are in combustion and pyrolysis of hydrocarbon fuels, with recent extensions to oxygenated
biofuels [9] and fuels containing sulfur [10]. Extensions beyond combustion and pyrolysis
of fuels include refrigerant formation from chlorinated hydrocarbons [11] and the pyrolysis
of ethyl nitrite [12].
The completed work falls at the intersection of chemical kinetics, computational
chemistry, and machine learning, with the latter two used for parameter estimation in mi-
crokinetic models. Background on each of these facets of the dissertation will be discussed
in this chapter, including a brief history of automatic mechanism generation.
1.1 Automatic mechanism generation
As early as 1979, Ugi et al. introduced the idea of using matrices to represent chem-
ical species, reactions and distances [13]. The concept was extended to mechanism gen-
eration, defined as the problem of finding all possible sets of bond/electron matrices that
fit a given reaction matrix, and then using rules based on chemical knowledge to limit the
size of the mechanism. At the same time, Yoneda created GRACE, a network generator
based upon matrix theory using square matrices [14]. The reactant and product matrices
are also divided into atom groups, consisting of a center atom and its attached hydrogen
atoms. GRACE can handle both radical and ionic reactions but does not take into account
stereochemistry. The sub-systems of GRACE include decomposing an overall reaction into
a set of elementary reactions; gathering the Arrhenius parameters for these reactions; and
setting up the the material balances for each species. This mechanism generator also in-
3
cludes additional constraints; an example of these is given in Figure 1.1, reproduced from
the paper. The authors acknowledge that the constraints must be carefully chosen by an
experienced chemist, so that one does not generate unreasonable reactions or species.
Figure 1.1: Rules imposed for the generation of an elementary reaction network, repro-duced from [14].
The matrix theory above is also applied in the 1992 network generator KING [15].
A combinatorial approach is taken, in which the set of products to be formed or possible
reactions are unknown. In this way, a large network is formed, but it may be possible
to discover new reaction types. It is suggested that constraints, including the number of
species that are allowed to interact and the number of bonds that can be broken or formed,
be implemented in order to reduce the size of the networks formed.
Ranzi et al. used a lumping method in their mechanism generator, MAMOX, de-
veloped in Fortran for oxidation and pyrolysis of fuels [16]. In this procedure, the de-
composition of the primary products are simply“lumped” and products beyond this first
decomposition are ignored. An overall rate constant is given for each type of reaction of
the primary products (see the reproduced Figure 1.2). Lumping reduces the number of
species and reactions to a manageable number, but one may miss important pathways or
products.
Alternatively, NetGen is a mechanism generator which, instead of lumping, in-
4
Figure 1.2: Lumped approach for the primary oxidation of n-pentane, reproduced from[16]. Intermediates and products are grouped together, with k1 - k9 representing lumpedrate constants for each step of the mechanism.
cludes the reactions deemed important in the network and removes the ones which are not
[17]. If a species has a production rate that is greater than some minimum rate, it is in-
cluded in the reaction network. This minimum rate is determined by a user-specified factor
multiplied by a characteristic rate, defined as
Rchar =amount of reactant converted
time it takes for conversion
If a species exceeds the minimum rate, it is included in the network and then re-
acted with all other species, and these reactions added to the mechanism. The mechanism
generation stops when no more species exist which have a rate greater than the minimum
rate and the desired conversion of a starting species has been achieved. The concept of
characteristic rate is included in several other mechanism generators, including Genesys
and RMG, to be described in the following sections.
EXGAS is a software for generating mechanisms of alkane and ether fuels [18].
The mechanisms created consist of a reaction base which involves all unimolecular or bi-
molecular reactions of molecules C0 to C2; a primary mechanism with only initial species
and oxygen as reactants; and a secondary mechanism, which is lumped. The lumping is
5
done by grouping the species formed in the primary mechanism that contain the same func-
tional groups and have the same chemical formula. A distinguishing feature of EXGAS is
that internally, non-cyclic species are represented as treelike structures. Thus, species can
be compared for redundancy using an algorithm which is called the “algorithm of canon-
icity”, based on graph theory. The concept of graph representation of species was used in
all future automatic mechanism generators (with additional layers of complexity added in
some cases).
A mechanism generator developed in 2003 by Ratkiewicz et al., COMGEN, uses
chemical graphs to represent species both internally and externally [19]. Information such
as atomic charges, valences and bond types can also be included in these graphs. To gen-
erate reactions, the program relies on reaction patterns, which are matched with the same
pattern in the reacting molecules.
The Rule Input Network Generator (RING) takes not only the initial species and
network analysis instructions as user inputs, but also the chemistry rules it uses to gener-
ate reactions [20]. The language used to provide input to RING is English-like and the
compiler “proofreads” this in order to communicate between the user and network genera-
tion. RING has the ability to represent intermediates and also has a post-processing module
which analyzes the network topologically to enable lumping of pathways and mechanisms.
This mechanism generator also implements a lumping scheme that groups together isomers
with the same functional groups. For a network representing the dehydration of fructose to
form hydroxymethylfurfural, for example, lumping reduced the number of reactions by a
factor of two. RING has the ability to represent user-defined entities that the program does
not automatically identify, such as inorganic atoms or active sites on catalysts. It can also
represent non-bonded interactions such as partial bonds and hydrogen bonds.
Genesys, developed at Ghent University, is similar to NetGen and RING in that it
uses only elementary reactions, limiting the network size by characteristic rate and uses a
rule-based approach for defining how molecules react. However, it differs in its internal
6
representation of chemical species; rather than using a graph, it utilizes the Chemistry De-
velopment Kit (CDK). The representation allows for different stereo-isomers to be uniquely
identified, which connectivity graphs cannot distinguish [21]. This capability is beneficial,
since different conformers may have different reactivity.
1.1.1 Reaction Mechanism Generator (RMG)
The mechanism generator utilized in this work is the Reaction Mechanism Gener-
ator (RMG) [7, 8, 22]. RMG is an open-source program with two versions, one written in
the Java programming language and the other in Python. The Python version will be used
and modified in this work, due to its simplicity of use and code readability as compared
to the Java version. Given a set of starting species, temperature, and pressure, RMG uses
the literature and chemical knowledge to propose a network of elementary reactions, their
rates, and thermodynamic properties.
In RMG, like several of the aforementioned mechanism generators, chemical species
are represented by graphs, with nodes as atoms and edges as bonds. Graph theory can then
be used to compare species and ensure uniqueness in the network. The network is built
by expanding a ‘core’ set of species, starting with the user input. When a reaction results
in a new species, that species is put on the ‘edge’. When a species flux reaches a critical
amount, determined by a user-defined tolerance and the characteristic rate of the core, it is
brought into the core. The simulation ends when there are no more edge species which fit
this criteria for a given simulation time or species conversion [23]. The concept of charac-
teristic rate has been previously described [17].
A unique aspect of RMG is its way of estimating reaction rates. Reaction families
in RMG have a specific “recipe” that defines the change in radicals, lone pairs and bond
order occurring in a reaction. Figure 1.3 gives an example of the template and recipe for the
hydrogen abstraction reaction family. Starred atoms indicate the reacting atoms. Kinetic
groups include the reacting atom in the family’s recipe and the structures surrounding that
7
Figure 1.3: Example of the template and recipe for the hydrogen abstraction reaction fam-ily. In the recipe, starred atoms participate in each action; ‘S’ refers to a single bond beingformed or broken, and ‘1’ indicates that the radical count increases or decreases by 1.
Figure 1.4: An example of part of the hierarchical tree for the hydrogen abstraction reactionclass from RMG documentation
8
atom. These groups are defined in RMG in a tree-like structure. The root of each tree is the
most generic group which follows the reaction recipe, while nodes make up more specific
groups. An example of a hierarchical tree is given in Figure 1.4. When a combination
of groups reacts, a rule describes its kinetic parameters. If a rule exactly matches the
combination of groups in the reaction it will be returned, but if not, the nearest neighbors
in the hierarchical trees will be averaged to determine the reaction’s kinetic parameters.
One way new kinetics can be added to RMG’s database is as part of a reaction
library. Rates in a reaction library, such as GRI-mech [24], are used without modification
whenever the reactant or product species are found in the model core. This method of
addition to RMG’s database should be used for reactions that do not belong to a reaction
family. For example, reactions with small molecules may not follow the same trend as
others in the reaction family, or reactions may not match an existing recipe at all. Also, if
a pressure-dependent reaction rate is known, this reaction should be added to a library in
order bypass RMG’s Master Equation code for pressure dependence, explained below.
Reaction kinetics added as training reactions to a reaction family are placed into
the hierarchical tree of estimate rules, using the most specific functional group definitions
possible for those reactants. They are then used when completing the more general nodes
in the tree by averaging. Training reaction rates must provide high-pressure limit kinetics
for an elementary reaction of a specific reaction family, and will influence the estimates of
other similar reactions generated by the reaction family. This is the preferred way to add
kinetic data to RMG’s rule-based estimation database [8].
In the case of unimolecular reactions, when the number of nonreactive collisions
with a third body is rate-limiting, rate coefficients are dependent on both temperature and
pressure. RMG contains a methodology for using the high-pressure limit kinetics to esti-
mate these pressure dependent rate coefficients, which is described in ref. 25.
When a new chemistry domain is studied with RMG, new reaction libraries, fami-
lies and rate estimates should be added. This poses particular challenges when extending
9
to new elements, and especially when extending to a new phase. Prior to this work, all
reaction rates included in RMG’s database, as well as its methods for rate estimation, were
based upon data for gas-phase reactions. Additionally, RMG previously contained capa-
bility for chemical species containing only carbon, hydrogen, oxygen, nitrogen, sulfur and
chlorine.
1.2 Parameter estimation
Most parameters in microkinetic models are estimated, due to a lack of known and
trusted kinetic and thermodynamic parameters from experimental or high level theoretical
calculations [26]. The following sections will outline estimation techniques that are used to
calculate these kinetics and thermodynamics values at low computational cost. These esti-
mation techniques have been incorporated into automatic mechanism generation software
and form the basis for the some of the work in this dissertation.
1.2.1 Thermodynamics
Thermodynamic values can be calculated using group additivity, with radical chem-
ical species adding more complexity. When group additivity is insufficiently accurate, the
parameters can also be calculated using quantum chemistry.
1.2.1.1 Benson’s group additivity scheme
Benson developed a scheme in which chemical species are decomposed into groups,
defined as a central atom connected to its ligands. Group values for enthalpy and entropies
of formation, and heat capacity as a function of temperature, can then be summed to obtain
these thermodynamic values for the overall species [27]. Group values exist for molecules
containing a variety of atoms and also for open-shell species, but groups do not yet exist for
every single type of chemical species that might be present in a microkinetic model. Group
values can either be derived from experimental data, or high level quantum calculations
10
can be used to calculate new group values. The accuracy of the group additivity method is
dependent on the accuracy of the methods used to derive the group values, and also how
similar the chemical species to be estimated are to the species from which the groups were
derived. For example, many group additivity values exist for stable molecules, but less for
radicals or ions, meaning that less appropriate or analogous values are sometimes applied
to calculate their thermodynamic properties [26].
1.2.1.2 Hydrogen Bond Increment method
An alternative method to Benson’s group additivity for determining the thermody-
namic properties of radical species was proposed by Lay et al. [28] and is known as the Hy-
drogen Bond Increment (HBI) method. The approach uses the thermodynamic properties
of parent molecules and a single group value to account for loss of a hydrogen atom. For
enthalpy of formation, ∆fH◦298, the group value for a radical R∗ is simply the bond strength
of the R–H bond. The HBI values for Cp(T ) and S◦298 are obtained from molecular struc-
ture differences between the radical and parent molecule using the rigid-rotor/harmonic
oscillator (RRHO) approximation, which essentially means that translations, rotations, and
vibrations of the species are treated as uncoupled when solving the Schrodinger equation.
The S◦298 value calculated for the radical does not include symmetry corrections, which
should be added in later based on the point group of the radical.
The equations for the thermodynamic properties are as follows:
∆fH◦298(R∗) = HBI(∆fH
◦298) + ∆fH
◦298(RH)−∆fH
◦298(H)
C◦p(R∗) = HBI(C◦p) + C◦p(RH)
S◦298(R∗) = HBI(S◦298) + S◦298(RH)
where R∗ is the radical chemical species, RH is the parent molecule created by saturating
the radical chemical species with hydrogen atoms, and HBI are the group values for each
11
of enthalpy, entropy and heat capacity.
1.2.1.3 Quantum Mechanics Thermodynamic Property estimation
RMG contains a module for calculating thermodynamics known as Quantum Me-
chanics Thermodynamic Property (QMTP) estimation. During the course of a simulation,
if this option is turned on, RMG will send information about the molecule’s 2D graph to
a computational chemistry program to obtain the 3D geometry and eventually the species’
vibrational frequencies and enthalpy. Other thermodynamic properties can be calculated
from the computational chemistry output files using statistical mechanics. Typically, this
option is only used for cyclic and polycyclic species. QMTP uses the RRHO approxi-
mation and it was built for semi-empirical methods such as PM3, PM6 and PM7. While
these methods are not as accurate as other quantum chemistry methods, such as density
functional theory (DFT), they are sufficient when group additivity estimates are very inac-
curate, at low computational cost [29].
1.2.2 Kinetics
Similar to thermodynamics, unknown reaction kinetics can be calculated using esti-
mation techniques. Kinetic parameters can also be calculated directly using transition state
theory; these are used to formulate rate rules for a given type of reaction [30].
1.2.2.1 Evans-Polanyi
One estimation correlation, which relates reaction rates to enthalpy of reaction, is
the Evans-Polanyi relationship [31]:
k(T ) = A exp
((E0 + α∆H◦)
RT
)The correlation trends with reaction family; parameters A, α and E0 are fitted for a
given reaction family. While computationally efficient, the relationship is not accurate for
12
all classes of reactions.
1.2.2.2 Transition State Theory
With increases in computing power, more reaction rates can be calculated using
quantum chemistry and statistical mechanics concepts in a framework known as transi-
tion state theory [32]. Transition state theory says that a quasi-equilibrium exists between
reactant and activated complexes, or transition states. Classical transition state theory is
described by the following equation:
k(T ) =kBT
hexp
(−∆G‡
RT
)kB is the Boltzmann’s constant, T is temperature, h is Planck’s constant, R is the
ideal gas constant, and ∆G‡ is the difference in Gibbs free energy between transition state
and reactant. The equation can also be expressed in terms of partition functions, which can
be calculated using statistical mechanics and molecular parameters from quantum chem-
istry calculations. CanTherm, included as part of the RMG software but often used stand-
alone, is an automated kinetics calculator that calculates reaction rates via classical transi-
tion state theory (and also can calculate thermodynamic properties) from quantum chem-
istry output files [33]. Classical transition state theory is only appropriate for reaction types
with a clear reaction barrier; variational transition state theory should be used when the di-
viding surface between reactant and product is less clear. Other kinetics calculators include
POLYRATE [34], Variflex [35], and MultiWell [36].
13
1.2.3 Transition state geometries
To allow kinetics to be calculated via transition state theory, a transition state ge-
ometry is required. This is often the bottleneck of kinetics calculations, as a very close
estimate is needed for computational chemistry programs to correctly optimize the transi-
tion state geometry, which lies at a first order saddle point on the potential energy surface.
Double ended methods have been developed, which require the user to provide the reactant
and product geometries and use these to find the transition state. These methods fall under
the categories of interpolation, nudged elastic band, or string methods. Some of these dou-
ble ended methods have been automated, such as an algorithm developed by Zimmerman
[37], and KinBot [38], but the computational cost of these programs is prohibitive for use
in detailed kinetic models that may require thousands of rate parameter estimates.
Bhoorasingh and West developed a group contribution algorithm for creating the
transition state geometry estimate directly. It uses a machine learning approach to train key
distances in the transition state geometry, based on the molecular structure of the reactants
and products. Once these distances are known, correct transition states can be optimized
about 70% of the time, for three reaction families [39]. The algorithm was recently inte-
grated with computational chemistry packages, as well as CanTherm, for a fully automated
method to calculate reaction rates. This automated kinetics algorithm, known as AutoTST,
was used to generate some of the transition states and reaction rates used in this work [40].
1.3 Machine learning with decision trees
Thermodynamics, kinetics, and transition state geometry estimation using group
contribution is essentially a regression problem. Group values, based on molecular struc-
ture, are calculated by “regressing”, or minimizing the overall error (based on some cost
function) between estimated and true values from a training set. The way these regres-
sions are currently set up in RMG are in the format of decision tree regressors, which are
14
retrained when new reaction rates or transition state geometries are added to the training
set.
Decision trees can be applied to classification or regression problems. The class
label, or value in the case of regression, is chosen based on a set of questions about features
of data items. The tree is constructed by training on data items whose class label or value
is known, and then used on new data points. Decision trees are advantageous because they
are easier to understand than other, more complex machine learning models, such as neural
networks, but are more robust than linear or logistic regressions. The predictions can also
be improved by combining the results of multiple decision trees, known as ensembling
[41]. Depending on the application, the decision tree method is sometimes preferable over
simpler methods such as cluster analysis or data partitioning, which are used to develop
quantitative structure activity relationships (QSAR), because of its applicability to large
sets of diverse compounds which may also contain erroneous data [42]. Some examples of
using decision trees in chemistry and biology are described below.
1.3.1 Decision trees in chemistry and biology
There are many examples of bioinformatics and cheminformatics problems that
have been approached with decision trees. Han et al. describe a method for choosing
biologically interesting compounds for drug discovery from a high throughput screening
(HTS) data set. The decision tree was formulated based on the PubChem chemical struc-
ture fingerprint system, and the C4.5 algorithm was used to construct the decision trees
[43]. 10-fold cross validation was used to verify the validity of the decision tree model.
It was shown that the model can be used to determine commonalities in an HTS data set,
select compounds, and eliminate selections which arise from noisy data [42].
DNA sequencing is also a promising area for use of decision trees. Decision tree
regression was used by Thornley et al. for prediction of unknown bases in a sequence. The
decision tree was trained on all the peak heights near the base to be predicted, as well as
15
the bases in the neighborhood of those peaks. A neural network was further used to regress
the information that was most successful for the decision tree regressor. [44]
Decision tree regression has also been used for environmental applications. Hu and
Cheng used a decision tree method known as the conditional inference tree (CIT) to under-
stand which factors are important for predicting heavy metal distribution among the surface
soils of the Pearl River Delta in China. They used a random forest approach, a bootstrap-
ping algorithm where the CITs are subsampled without replacement. They combined the
CIT approach with a finite mixture distribution model (FMDM), which can be used to dis-
tinguish between natural and arthropogenic causes of heavy metal concentration in soils
[45].
Decision tree classification and regression have proved useful for several applica-
tions in chemistry and biology. Diverse features and data sets can be used with ease of
understanding and dealing with missing data. They are also more robust than other ma-
chine learning techniques. Furthermore, decision tree regressors are already implemented
for some estimation in RMG, and can be extended to applications in this dissertation. As it
stands, these trees are built by hand from chemical intuition, and the structure of these trees
also influence the accuracy of the predictive models. Automated decision tree generation
will be briefly discussed in Chapter 3.
1.4 Dissertation overview
The dissertation will apply automatic mechanism generation, quantum chemical
calculations and machine learning to two new domains: silicon hydride chemistry and
liquid-phase fuel chemistry. Chapter 2 of the dissertation discusses the implementation
of solvation thermodynamic corrections to the gas phase data and methods in the Python
version of RMG (RMG-Py). The implementation draws largely from the implementation
in RMG-Java, with several improvements and additional data. Chapter 3 continues on the
liquid-phase application, introducing a new method which uses machine learning to predict
16
a change in reaction rate between gas and liquid-phase. The method is used to modify the
rates of an existing fuel oxidation model. Chapter 4 introduces a different application:
the use of automatic mechanism generation to generate detailed kinetic models for silicon
hydrides. The process of calculating and adding thermodynamic and kinetic data for silicon
hydrides to RMG is outlined, and a new model for gas-phase thermal decomposition of
SiH4 is generated and discussed. Chapter 5 concludes by wrapping up the previous chapters
and recommending some future directions for the research.
17
2 AUTOMATIC CALCULATION OF SOLVATION THERMODYNAMICS
Several environmentally, medically, and industrially relevant chemical systems in-
volve liquid-phase reactions, including secondary organic aerosol formation, oxidation
of fuels in the condensed phase, and radical scavenging in the body [4, 46–49]. When
these systems are large and complex, containing thousands of radical-radical and radical-
molecule reactions, it is difficult to elucidate all reaction pathways by hand. It is much
easier and less error-prone to generate these mechanisms automatically. Thermodynamic
solvation corrections are one necessary component in automatic mechanism generation for
liquid-phase systems. It is important to have correct thermodynamics, because the equi-
librium constant, Keq, is calculated from ∆Grxn. The accuracy of these Keq and therefore
∆Grxn can change the reaction mechanism dramatically. Because of solute/solvent interac-
tions, the thermodynamics for individual chemical species, which are used to calculate the
overall reaction thermodynamics, are different in gas and liquid phase. Experimental in-
formation is not available for every species (and is especially sparse for radicals), so there
exists a need for some other way to estimate the thermodynamic parameters. Literature
data and estimation methods are already implemented into the Reaction Mechanism Gen-
erator for gas-phase thermodynamics [7, 8, 22]. Using these gas-phase data, we can make
so-called “corrections” to the thermodynamic data for the liquid-phase. Such methods exist
in the Java version of RMG [50]. I have implemented these methods in the Python version
of RMG, and extended the capabilities further as part of this dissertation.
18
2.1 Background
Below is a review of methods used by chemists to calculate the liquid-phase ther-
modynamics parameters that are necessary for mechanism generation. The parameters can
either be calculated explicitly using computational chemistry, or they can be estimated us-
ing empirical relationships.
2.1.1 Explicit thermodynamic calculations
Calculation of thermodynamic parameters in solution is performed using a variety
of methods, including discrete models such as quantum mechanical (QM), molecular me-
chanical (MM) and hybrid (QM/MM) models [51], and continuum models [52]. Discrete
solvation models treat each solvent molecule separately and can be computationally ex-
pensive, especially in the case of pure QM methods. These methods also cannot represent
long-range, bulk phenomena in solvent. Continuum methods, including the polarizable
continuum methods (PCM), multipole expansion (MPE) and Generalized Born (GB), are
less expensive when taking a QM approach. However, they have the disadvantage of ne-
glecting local interactions between solute and solvent. A more thorough description of
these methods can be found in Jalan et al. [53], and in Chapter 3.
2.1.2 Estimation techniques
Another approach to calculating solvation thermodynamics are estimation meth-
ods, such the linear solvation free energy relationship method (LSER). This approach is
based on the assumption that the solvation thermochemistry of a single species can be bro-
ken down into individual contributions from solute/solvent properties such as cavitation,
dispersion, hydrogen bonding and polarizability. Kamlet and Taft suggested that these con-
tributions could be quantified in terms of electronic transitions, such as π to π? and p to
p?, which occur when a solute is solvated. Kamlet and Taft’s observations along with other
19
contributions resulted in the solvatochromatic equation [54]:
SP = SP0 + sπ?1 + dδ + aα1 + bβ1 + h(δh)2
SP refers to any solvation parameter (with SP0 an intercept); for example, one solvation
property to be calculated is the logarithm of the partition coefficient between gas and sol-
vent (logK). The lowercase parameters are fitted parameters for a given solvent, while the
other parameters represent the following: π?1 is the electrostatic contribution due to dipo-
larity/polarizability, α1 is hydrogen bond donation, β1 is hydrogen bond acceptance, δh
represents cavity formation, and δ is a polarizability correction factor [55].
Abraham et al. [56] later refined this equation:
SP = c+ eE + sS + aA+ bB + lL
where the capital letters purely represent solute properties and again, the lowercase letters
are fitted solvent parameters. These solvent parameters have been previously tabulated and
published for many solvents; for example, those for water and 1-octanol can be found in
[57]. Of the solute descriptors, S represents the solute’s electrostatic interactions due to
dipolarity and polarizability and was derived from the π? parameter; L is a representation
of size based upon the solute’s gas-hexadecane partition coefficient; E is derived from the
Kamlet-Taft δ parameter and serves as a correction to S; A represents hydrogen bond do-
nation ability/acidity of the solute; and B is the hydrogen bond acceptance ability/basicity
[55].
Some correlations are used to find the solute (uppercase) Abraham parameters indi-
vidually (reviewed in [53]), but molecular structure group additivity methods can be used
to calculate all of the parameters at once. In particular, Platts et al. devised 81 molecular
structure fragments to be used in the calculation of S,B,E and L, and another 51 frag-
ments for A [58]. These fragments include atom-centered contributions, such as a carbon
20
(a) (b)
Figure 2.1: Examples of molecular structure fragments. (a) An atom-centered methylgroup, consisting of a carbon atom with three single bonds to hydrogen atoms. (b) Anon atom-centered ester group, specified by several atoms.
atom attached to four non-hydrogen atoms; group-based corrections, such as fused rings;
and intra-atomic interactions, such as ortho, meta and para interactions. Examples of some
of these groups are given in Figure 2.1. The contributions for each of these fragments are
added together to obtain a value for each solute descriptor. Then, the Gibbs free energy of
solvation, ∆Gsolv at 298 K can be found from the partition coefficient with the following:
∆Gsolv = −RT lnK
Mintz et al. found solvent descriptors to utilize the solvatochromatic approach with
the Abraham solute descriptors for the prediction of the enthalpy of solvation, ∆Hsolv [59]:
∆Hsolv = ch + ehE + shS + ahA+ bhB + lhL
where the subscripts h denote that these solvent descriptors refer to the fitted parameters
corresponding to the enthalpy of solvation relationship.
To obtain ∆Gsolv at other temperatures, a method of linear extrapolation is com-
monly used. ∆Gsolv and ∆Hsolv at 298 K are used to calculate the entropy of solvation,
∆Ssolv:
∆Ssolv =∆Hsolv −∆Gsolv
T
Assuming that ∆Hsolv and ∆Ssolv are independent of temperature, ∆Gsolv at a given tem-
perature can be found with:
21
∆Gsolv(T ) = ∆Hsolv(298K)− T∆Ssolv(298K)
Other methods for treating the temperature dependence of ∆Gsolv can be used, as
in [60]; however, the Mintz method is both fast and useful in that the solute properties, cal-
culated via group additivity, can be used for both the partition coefficient and the enthalpy
of solvation.
The errors in the outlined linear solvation method for molecules at 298 K are com-
parable to those of the discrete and continuous methods, about 1 kcal/mol typically, but
higher for large molecules and ions [61]. However, the computational time required is
much shorter as compared to the more expensive QM and MM methods [53].
A study comparing force field (MM) calculations of solvation free energy with
SM6, a continuum QM method [62], and LSER, for nitroaromatic compounds, found that
the LSER method performs as well as the best force field methods. The LSER method had a
mean unsigned error of 0.59 kcal/mol as compared with experimental data, with all energies
slightly too negative. The deviation increased with increasing number of nitro functional
groups on the molecules. The SM6 method, in comparison, yielded a mean unsigned error
of 0.50 kcal/mol, was unaffected by the number of nitro groups, and is faster than MM
calculations. However, the error was highly affected by the level of theory used for the
geometry optimizations [63].
2.2 Methods
As mentioned, while solvation thermodynamics had been previously implemented
in the Java version of RMG [50], in this dissertation these capabilities were adapted for use
in RMG-Py. Additionally, several improvements to the prior implementation of solvation
thermodynamics were made in this work.
22
2.2.1 Adaptation of solvation thermodynamics from RMG-Java
(a) (b)
Figure 2.2: (a) Example of an entry in the solute group database, and (b) the molecularstructure of the corresponding group.
The estimation methods of Abraham and Platts which are described in the Back-
ground, using linear solvation energy relationships along with group additivity, were newly
implemented in this work. Specifically, the RMG-database [64] project, which is coupled
with the Python version of RMG, was modified. The addition of solute molecular struc-
ture groups consisted of adding two group databases, one for atom-centered groups and
one for non atom-centered groups. The groups are specified such that each atom (besides
hydrogen) in a molecule must belong to exactly one atom-centered group, and may also
belong to one non atom-centered group. An example entry in the solute group database
and its corresponding structure is shown in Figure 2.2. Each group is defined by a label, an
adjacency list specifying the atom types and bonds included in the group, and the Abraham
solute parameters S,B,E, L and A. The starred atom is the one matched against atoms in
the solute molecule in RMG, and “Cs” indicates that it is a carbon atom with only single
bonds.
23
A solvent database also was added, where the solvent is defined by either an adja-
cency list or its SMILES string identifier. The parameters included in the solvent database
are the solvent coefficients for the logK and ∆Hsolv linear solvation energy relationships.
Each entry also contains the A-E coefficients of the viscosity-temperature correlation to be
used in diffusion corrections. Some entries also store the solvent’s dielectric constant and
its solute A and B parameters for potential use in kinetic solvent corrections. The diffusion
and other kinetic solvent corrections will be explained in the following chapter.
2.2.2 New additions in RMG-Py
Once the solute and solvent parameters are obtained or calculated based on the
Platts group fragments, the functions to calculate partition coefficient K, ∆Hsolv, and ulti-
mately ∆Gsolv(T ) were implemented using the equations in the Background. These meth-
ods were added to a new solvation module in RMG-Py, with modifications also made to
the source code in the thermo and molecule modules to calculate and apply the solvation
correction during an RMG simulation. Modifications also were made to the main RMG-Py
execution in order to load the user-specified solvent at the start of the so that the proper sol-
vent descriptors would be loaded. Furthermore, in this work, several important updates and
improvements were made to the implementation of solvation thermodynamics in RMG-Py,
which are not present in RMG-Java
2.2.2.1 Treatment of radicals and Abraham A value
No Abraham parameters for logK or ∆Hsolv exist for radical species. Therefore, we
must devise a method to calculate solvation thermodynamics of radicals using the available
data. By comparison, for gas phase thermodynamics, radical species (R∗) data are calcu-
lated using the saturated species (RH) data and hydrogen bond increment (HBI) theory,
explained in Chapter 1 [28]. One can similarly use the saturated species data to calculate
the solvation thermodynamics of radicals by calculating the solute descriptors, and then
24
correcting the Abraham A solute descriptor to account for the effect on hydrogen bonding
caused by removing a hydrogen atom. For example, the effect of creating a peroxyl radical
from a peroxide group by removing a hydrogen is a 0.345 decrease in A [58]. 14 Platts
groups for this A descriptor correction have been added to a “radical” database in the cur-
rent RMG-Py. These A descriptor corrections were obtained from Table 5 of [58], with
slight modifications made to adhere to the RMG format of adjacency lists and to ensure
that the correction is not double counted for groups where there are two hydrogen bonded
atoms. The sign of the values were flipped from those provided in the table since the cor-
rection is applied when a hydrogen is removed. With these changes, the implementation of
radical groups in RMG-Py is shown in Table 2.1.
2.2.2.2 Treatment of lone pairs in solvation thermodynamics
Currently, there exist no Platts groups to account for lone pairs on atoms above their
normal bonded configuration (i.e., 0 for carbon, 2 for oxygen). Similarly, there also exist
no gas-phase Benson groups for carbon centered groups with one or more lone pairs. Since
these groups would not match existing Platts or Benson groups, in both cases, would be
estimated by RMG incorrectly. For example, since 1CH2 is only singly bonded to 2 groups,
in this case hydrogen atoms, it would not match any Platts groups that are sp3 carbon atoms
bonded to 4 groups, or any sp2 or sp groups. Therefore, it would fall up RMG’s molecular
structure group tree to a generic sp3 carbon atom. This generic group is filled in with the
data for a carbon atom bonded to 4 other carbon atoms (as in neopentane), which would
have very different thermodynamic properties than 1CH2. When this situation occurs, rather
than using this erroneous data, I devised and implemented a new method to convert these
lone pairs to unpaired electrons, therefore converting 1CH2 to 3CH2. RMG would then
proceed with the algorithm in the prior section; that is, it would saturate these unpaired
electrons, creating CH4, and calculate the solute descriptors for that molecule. While not
relevant in this example, if removing these two hydrogen atoms would change the A solute
25
Radical fragment Hydrogen bonding correction to A
-0.345
-0.345
-0.243
-0.087
-0.371
-0.543
-0.247
-0.275
-0.281
0.091
0.0825*
0.119
-0.17
Table 2.1: Hydrogen bonding parameters from [58], adapted for corrections to the solvationthermodynamics of radical species when saturating the radical with hydrogens causes ahydrogen bonding effect. *the value was divided in half since there are two hydrogen-bonded nitrogen atoms in this molecule.
26
descriptor, it would also be updated as in the true radical electrons case. Similarly, the gas-
phase thermodynamics methods were analogously updated to convert lone pairs to radicals
when we do not have data for the specific lone pair group.
2.2.2.3 Additional Platts groups
Carbon, hydrogen and oxygen atom-containing Platts groups for calculating solute
descriptors were originally included in the Java version of RMG. To adapt to increasing
capabilities of RMG-Py, in this work, additional published Platts groups were added to the
database of RMG-Py. Specifically, these were groups including nitrogen and sulfur atoms
[58]. In the future, halogen groups calculated by Platts should be added to the database, as
capabilities for chlorine and fluorine are currently being added to RMG. Additionally, sili-
con atom-containing groups should be added, but must first be calculated, as these groups
were not previously published.
2.2.2.4 Solute database
Using group additivity to calculate the five solute parameters for every species in
every reaction increases the time needed for mechanism generation. For many molecules,
the solute descriptors have already been calculated and tabulated [65]. Thus, in this work a
solute database was created from this published data, which currently contains 152 values.
If RMG can find the molecule of interest in this solute database, it will use the values given.
If it cannot, then it will use group additivity to calculate the solute descriptors.
27
2.3 Results
The calculations of solvation thermodynamics were validated to ensure that both
the group additive scheme for estimating solute descriptors as well as the Abraham and
Mintz correlations for logK and ∆Hsolv are sufficiently accurate.
2.3.1 Estimation of solute descriptors
Figure 2.3: g-decalactone structure, with atom-centered groups circled in green and nonatom-centered groups in blue.
For verification that the algorithm to predict solute descriptors is implemented cor-
rectly into RMG, the Abraham solute parameters for g-decalactone (Figure 2.3), a molecule
consisting of several atom-centered and non-atom-centered groups, are compared to exper-
imental values from ACD/Labs [66]. Table 2.2 displays the results of the comparison. The
values predicted by RMG-Py and the experimental values closely agree, showing that RMG
can identify molecular structure fragments via their adjacency list and count them correctly
for groups containing carbon, hydrogen and oxygen atoms. The groups containing nitrogen
and sulfur have been implemented, but not yet tested.
Solute Parameter Experimental Predicted by RMG-PyS 1.26 1.26B 0.55 0.54E 0.32 0.39L 6.27 6.33A 0 0
Table 2.2: Abraham solute parameters for g-decalactone calculated by group additivity inRMG-Py, shown with experimental values from from ACD/Labs [66]
28
(a) (b)
Figure 2.4: Comparison of solvation thermodynamics values for 20 solutes in water calcu-lated by RMG-Py to those in databases. (a) ∆Hsolv, compared to values in Mintz et al. [59](b) ∆Gsolv, compared to values from the University of Minnesota solvation database [67].
2.3.2 Calculation of solvation thermodynamics
The logarithm of the partition coefficient, logK and ∆Hsolv are calculated with
linear solvation energy relationships. These relationships utilize the solute descriptors,
which we showed can be calculated via group additivity in the previous section, along with
the solvent descriptors. Then, ∆Gsolv was obtained from K. For 20 solutes in water at
298 K, the calculated values of ∆Hsolv were compared to those in Mintz et al. [59] and
those of ∆Gsolv were correlated with those from the University of Minnesota solvation
database [67]. The mean absolute deviation was 4.2 kJ/mol for ∆Hsolv and 2.9 kJ/mol for
∆Gsolv. The results of this comparison are illustrated in Figure 2.4. Because the Minnesota
solvation database was likely used to train the values for the Platts group contributions,
the similarity in values should be interpreted as a demonstration of reproducibility (also
acknowledged in Jalan et al. [50]).
29
2.4 Summary
In this chapter of the dissertation, solvation thermodynamics was enabled in the
Python version of the Reaction Mechanism Generator (RMG). The calculation methods,
involving linear solvation energy relationships and a group additivity method for determin-
ing solute descriptors, were adapted into RMG-Py, with the source code updated with these
changes. In addition, treatment for the solvation thermodynamics of radicals was improved
by using saturated species thermodynamics and the Abraham A value, more of which were
added to RMG’s database. Solvation thermodynamics for species with lone pairs were
treated by converting the lone pairs into unpaired electrons. More group additivity values
were added to RMG’s database for species containing nitrogen and sulfur, and estimation
was sped up and improved by incorporating a database of known solute descriptors. It
was ensured that group additivity was implemented properly by comparison of ∆Gsolv and
∆Hsolv to the values in University of Minnesota’s solvation database [67]. Solute descrip-
tors for a complex molecule were also successfully compared to experimental values from
ACD/Labs [66]. The progress made in these areas make it possible to correct gas-phase
thermodynamics to account for reactions in different solvents, and ultimately are the first
step to generating detailed kinetic models for liquid-phase systems using RMG-Py.
2.5 Recommendations
The recommendations outlined below will further improve the estimation of solva-
tion thermodynamics in RMG-Py. These involve improving the temperature dependence
of ∆Gsolv, refining treatment of the solvation chemistry of species with lone pairs, better
benchmarking, and expanded solvent list.
30
2.5.1 Temperature dependence of solvation thermodynamics
The linear extrapolation method used for calculating ∆Gsolv at temperatures other
than 298 K, i.e., by assuming that ∆Hsolv and ∆Ssolv are constant, is not always an ac-
curate assumption for temperatures far from 298 K. For example, it was shown to be very
inaccurate for O2 in water; while the RMG estimation method would have ∆Gsolv increase
monotonically with temperature, the experimental values only increase slightly, and begin
to decrease near 450 K [68, 69], (Chung, unpublished work). Other methods for treating
the temperature dependence have been previously suggested and implemented [53, 60].
Because they are slower than the current method, other methods should only be imple-
mented in RMG where the inaccuracy is expected to be high; for example, only above
certain temperatures or for certain chemical systems.
2.5.2 Calculation of solvation thermodynamics for lone pair species
A fix has been implemented to convert lone pairs to radical electrons in order to
compute solute descriptors; however, it would be preferable to directly calculate Platts
groups for lone pair species. Though there is a lack of experimental data for these molecules,
quantum chemistry calculations could be done to obtain solvation thermochemistry and
derive solute descriptors for these species. These descriptors could simply be put into the
solute library, or several calculations could be further processed to derive Platts group ad-
ditivity values.
2.5.3 Improved benchmarking of group additivity values
The solvation thermodynamic parameters calculated in this work were only com-
pared to values in the Minnesota solvation database, which were originally used to train
the molecular structure group values used to calculate the solute descriptors. While ex-
perimental ∆Hsolv and ∆Gsolv are not widely available, the calculated parameters could
be compared to high level quantum calculations to get a better feel for the accuracy of
31
the estimates. This type of comparison is often done to gauge the accuracy of gas-phase
thermodynamic values calculated by group additivity [28, 70].
2.5.4 Expansion and enhancement of solvents
The current list of 26 solvents available for calculating solvation thermodynam-
ics could be further expanded in the future. For example, there exist published Abraham
solvent descriptors for biological solvents such as blood in the brain [71], which would
facilitate kinetic modeling of even more diverse processes using RMG. Another useful
addition would be the ability to select mixtures of solvents for a RMG simulation, thus re-
quiring a method of interpolation between the solvent descriptors of two or more solvents.
Ben Amara et al. use dodecane as the single solvent in their RMG simulations, despite
investigating different mixtures of dodecane and methyl oleate as biodiesel surrogates, for
example [4]. Furthermore, if a solvent is also a reacting chemical species, the identity of
the solvent changes throughout the RMG simulation. Capability to make the solvent de-
scriptors similarly change as a simulation progresses would make the RMG simulations
more realistic.
32
3 IMPLEMENTING KINETIC SOLVENT EFFECTS IN AUTOMATIC
MECHANISM GENERATION
The previous chapter dealt with corrections to gas-phase thermodynamics to ac-
count for solvation, which is necessary in calculating the overall ∆Grxn and thus reverse
reaction rates. However, learning about complex liquid-phase systems also requires knowl-
edge of solvent effects on the forward rates of elementary chemical reactions. Many of
the reactions in liquid-phase mechanisms of interest, including fuel oxidation, are radical-
molecule and radical-radical reactions. Depending on the solvent, rates of reaction can vary
by orders of magnitude, thus changing likely pathways and product distributions. Further-
more, knowledge of kinetic solvent effects not only helps with generation of liquid-phase
reaction mechanisms, but can aid in the design of solvents to promote a desired reaction
pathway or product [72]. Two main effects must be understood: physical diffusion and
intrinsic kinetics effects.
To account for diffusion limitations on reaction kinetics, we calculate an effective
rate constant for each reaction, which depends on the intrinsic gas-phase reaction rate and
the diffusivities of reacting species. Diffusivities, diameters of reacting species, and solvent
viscosities are all calculated using empirical correlations. The specifics of these calcula-
tions will be outlined in this chapter.
Modifying reaction rates to account for a solvent’s intrinsic effect on the gas-phase
kinetics is more challenging. This intrinsic effect changes the chemical environment of
reactants and modifies the reaction barrier, as opposed to a solvent’s physical diffusion
limitation, which modifies the effective rate [73]. Solvent effects on the reaction rate de-
33
pend on both the nature of the solvent and the type of reaction occurring.
The intrinsic effect of the solvent on the reaction rate will be investigated using
quantum mechanical methods to find the energy difference between the reactant and transi-
tion states of chemical reactions, i.e. Figure 3.1. Several reactions within specific families
in RMG that are relevant to oxidation chemistry, such as hydrogen abstraction, will be
analyzed. Trends in solvent effect will be illustrated, based upon properties of the react-
ing species, for example, carbon chain length or presence of an alcohol group. Using these
trends, a scheme can be created to predict the change in rate with solvation based on molec-
ular structure and properties of the solvent.
Figure 3.1: Example of a potential energy surface for a reaction in gas-phase (black) and asolvent (blue). A solvent may have a different effect on the energy of reactants, transitionstates, and products in a chemical reaction. Reproduced from [73]
3.1 Background
Prior studies on intrinsic solvent effects employ a number of experimental and the-
oretical approaches, which will be outlined below. Previous solvent effect discoveries will
then be organized and discussed based on reaction family.
34
3.1.1 Experimental techniques for determining reaction rates in liquids
Experimentally determining reaction rates for radical reactions in solution can be
difficult due to the short-lived nature of some radicals; however, some methods have been
developed over the last century and are commonly used for measuring these kinetics.
In an early method pioneered by Briers and Chapman known as rotating sector, or the
intermittent-illumination method (IIM), a sample is exposed to a constant intensity of light
for intermittent periods of time, such that the amount of time spent in light and in the dark
remains constant [74–76]. The average reaction rate, WM, can be calculated by:
WM =kp√2kt
[M ]√φI(1 + r)−1
where kp and kt are the propagation and termination rates, respectively, [M ] is the concen-
tration of the compound under investigation, M , reacting with a radical, φ is the quantum
yield of photoinitiation, I is the light intensity, and r is the ratio of time in the dark to
time in the light [76]. This method has been applied to reactions in gas phase and solution,
including polymerization and radical recombination [77–79]. This method, however, can
only be used for some specific types of radical chain reactions, with one requirement being
that they can be photochemically initiated. [80]
A very common method for measuring the reaction rate of radical reactions in both
gas and liquid-phase is laser flash photolysis. In this method a sample is excited by a
pulse from a laser, and radical species are monitored by measurement of their spectral
absorption. The spectral absorption can be measured with electron spin resonance, in which
the unpaired electron of the radical interacts with the nuclei in the molecule leading to a
mapping of electron density [81].
An indirect way of measuring the rate constants for radical-molecule reactions is the
radical clock method, which uses a known unimolecular reaction rate and a measured prod-
uct distribution to determine an unknown radical-molecule reaction rate.[80] For example,
35
Roschek and co-workers developed radical clocks for peroxyl radical reactions using the
competition between a beta-fragmentation of a peroxyl radical and a bimolecular H-atom
transfer [82]. This concept is shown in Figure 3.2. Jha and Pratt point out some limitations
Figure 3.2: Concept of peroxyl radical clock from Roschek et al.[82]
to the type of molecule R1-H from which the hydrogen atom is abstracted [83]. If R1 is
either persistent or highly stabilized, it cannot carry the chain reaction, and a large con-
centration of substrate is required. They describe a modification the radical clock method
using peroxyesters, making it possible to study a wider range of reactions.
3.1.2 Computational chemistry
Another approach to computing intrinsic kinetic solvent effects is by comparing
reaction barriers calculated in both gas and solution using quantum chemistry. Differen-
tial solvation between reactants and transition states affects the reaction rate according to
transition state theory, which says that a quasi-equilibrium exists between reactant and ac-
tivated complexes, or transition states [32]. Several computational methods are commonly
utilized for obtaining geometries of reactants and transition states and their energies.
3.1.2.1 Density functional theory
One set of methods used to solve the Schrodinger equation numerically is based on
density functional theory (DFT), an approximation to the wave function with only three
variables used to obtain electronic structures of molecules, radicals and activated com-
plexes [84, 85]. The DFT method chosen has a significant impact on whether the species’
36
geometries and energies are accurate, and it was previously found that the accuracy of den-
sity functionals for predicting barrier height is correlated with their accuracy for transition
state geometries [86]. In addition, approximate functionals such as DFT predict the tran-
sition state energies too low because they incorrectly delocalize electrons [87]. However,
when comparing transition states in gas and liquid, this discrepancy matters less since it
will be present in both cases and some cancellation of error will occur. When DFT is used
to compute solvent effects, comparison to experimental rates shows that it provides a high
enough level of theory to capture the desired effects [88, 89].
3.1.2.2 Solvation models
Figure 3.3: Comparison between a continuum solvation model (left, figure from [90]) anda hybrid implicit/explicit model (right, figure from[91])
Computational methods for estimating solvation energies are reviewed briefly in
the previous chapter and in [53] and generally fall into two categories: those that represent
solute and solvent molecules explicitly, and those that represent only solute molecules ex-
plicitly and the solvent molecules somewhere in between explicit and continuous. Explicit
treatment is done either quantum mechanically (QM) or with molecular mechanics (MM),
or some combination of both, as in QM/MM.
One hybrid implicit/explicit solvation method is the shells theory proposed by Pliego
[92]. In this solvation treatment, the solvent shell closest to the solute (S1), representing
solute-solvent interaction, is treated either fully quantum mechanically or with molecular
dynamics based on classical force fields. The remaining solvent, S2, is treated with contin-
uum solvation. When the number of solvent molecules in S1 becomes infinite, this theory
37
converges to the full discrete solvent representation. This approach is also known as the
cluster-continuum model, mixed discrete-continuum model, or quasichemical theory [93].
Continuum solvation models represent solvation as a solute placed inside a cavity
within an implicit solvent, which is modeled as a continuum with a constant property such
as conductivity or dielectric constant. See Figure 3.3 for a pictoral comparison between
continuum and a more explicit method. The solute cavity can be shaped like a sphere or
ellipsoid, or as in more modern methods, based upon a superposition of atom-centered
spheres [93]. However, representing a solvent this way does not account for local solute-
solvent interactions, and the assumption that the dielectric constant near the solute surface
is equal to the bulk dielectric constant is inaccurate.
A recommended method for calculating liquid-phase energies, which takes into
account contributions from the first solvation shell, is SMD [94]. This model, like the
authors’ other SMx methods, include a term for non-electrostatic effects due to cavity for-
mation, dispersion interactions, and solvent structure. The contribution is dependent upon
the solvent-accessible surface area (SASA) of each solute atom. The main feature distin-
guishing SMD from the other SMx methods is that it utilizes a continuous charge density of
the solute, rather than a discrete representation. In the Gaussian computational chemistry
program [95], the method is combined with the polarizable continuum method (PCM) [96]
for single-point energy calculations on a solute in a solvent. However, it can be used with
other algorithms such as COSMO [97] and COSab [98].
A recent method developed by Pomogaeva and Chipman, known as the composite
method for implicit representation of solvent (CMIRS), uses six parameters to describe
interactions between solute and solvent including dispersion, exchange, hydrogen bonding,
and long range electrostatic interactions. Because of low level of parameterization, this
model is believed to capture a higher level of physical truth. For hydration energy, the
mean unsigned error may be as low as 0.8 kcal/mol for neutral solutes and 2.4 kcal/mol
for ionic solutes and has been parametrized for the B3LYP and Hartree-Fock quantum
38
chemistry methods [99]. With regards to solvation kinetics, Silva et al. parametrized the
CMIRS model for methanol in order to predict activation free energy barriers for SN2 and
SNAr reactions. Both CMIRS and SMD were compared to experimental data of solvation
free energies and while they perform similarly for neutral species, the MUE for CMIRS is
lower in the case of anion and cation solutes. For free energy barriers, CMIRS performs
similarly to COSMO-RS while SMD is slightly worse [100].
3.1.3 Kinetic solvent effects within reaction families
Kinetic solvent effects are usually deduced, and are most generalizable, within a
particular reaction type. However, as will be shown, the kinetic solvent effect can vary
within a reaction family and across solvents.
3.1.3.1 Hydrogen abstraction
Figure 3.4: The solvent effect on hydrogen abstraction from α-tocopherol is independent ofthe radical (DPPH (x-axis) or TOH (y-axis)) Each number from 1-13 represents a differentsolvent. Reproduced from [101]
The largest body of literature on solvent effects on reaction rates is in bimolecular
hydrogen abstraction reactions. Das et al. used laser flash photolysis to study the reaction
39
of tert-butoxyl radicals with phenols in six solvents [102]. The rate decreased in polar sol-
vents, explained by the capability of the phenolic OH group to hydrogen bond with solvent
molecules. Ingold and co-workers have attempted since to further deduce these solvent
effects in hydrogen abstraction reactions. Valgimigli et al. found that the solvent effect on
abstraction of the phenolic hydrogen from α-tocopherol by both tert-butoxyl radical and
2,2,-diphenyl-1-picrylhydrazyl (DPPH·) is independent of the radical in almost every sol-
vent they tested [101] (see Figure 3.4). This result is especially surprising, since for these
two radicals, the reaction rate in the same solvent differs by over 106. Any deviation from
this behavior, as in tert-butyl alcohol, is thought to be due to the reaction being partially
diffusion controlled. The reaction of α-tocopherol with tert-butoxyl was further investi-
gated in four solvents; the rate constant decreased with increasing βH2 value [103]. The
βH2 parameter represents the hydrogen bond acceptance ability/basicity of the solvent. Fi-
nally, discrepancies between the data obtained and another study on the reaction of Trolox
with Cl3COO· [104] are explained by a mechanistic shift from hydrogen atom abstraction
to electron transfer in solvents with high dielectric constants and basicities. This electron
transfer mechanism may also be accompanied by a solvent-assisted proton loss, known as
sequential proton loss electron transfer (SPLET):
Cl3COO ·+ArOH −→ Cl3COO− + ArO ·+H+(1)
Cl3COO ·+ArOH + S −→ Cl3COO− + ArO ·+SH+(2)
Thus, the electron transfer mechanism can account for rates which are higher than
the rate expected from simply using a correlation with a solvent property. Foti et al. also
discovered this fast electron-transfer reaction between phenols and DPPH [105]. While hy-
drogen atom transfer was dominant in nonpolar solvents most of the time, electron transfer
still occurred if the radical was strongly oxidizing, as with Cl3COO·. Reactions proceeding
via the electron-transfer mechanism should be faster in polar solvents. But surprisingly, the
40
rate constant was higher in ethanol than methanol despite methanol having a higher dielec-
tric constant; they attributed this inconsistency to solvent impurities.
The Snelgrove-Ingold correlation for hydrogen abstraction reactions relates the dif-
ference in rate constant between the reaction in gas and in solvent to the solvent’s αH2 and
βH2 (hydrogen bonding) parameters [106]:
log10
kgasksolvent
= 8.3αH2 β
H2
Later, it was found that this single empirical equation could not describe an entire
reaction family [107–109]. In solvents which support ionization, some hydrogen abstrac-
tion reactions, for example that of 2,2’-methylene-bis(4-methyl-6-tert-butylphenol) (BIS)
with DPPH·, proceeded by the SPLET mechanism [109]. This led to a reaction which is
zero-order in DPPH·. Because this is not true for the reactions of all phenols with DPPH·,
it was suggested that properties of the reacting phenol play a role. One such property is the
intramolecular H-bond in BIS, which may slow the reverse proton-transfer reaction, thus
leading to the unusual effect observed. Furthermore, Nielsen and Ingold found that the βH2
scale does not account for solvents’ anion solvating properties, and thus reactions involving
proton transfer do not quite follow the Snelgrove-Ingold correlation [110]. The Taft β scale
[54] gives better correlations for this type of reaction. An experimental study by Warren
and Mayer also note the failure of the generalized Snelgrove-Ingold correlation. They stud-
ied the effect of small amounts of solvent additives on the oxidation of ascorbate (vitamin
C) by TEMPO radical. Their results indicate that solvent effect on hydrogen abstraction
reactions is better explained by local solvent effects, as the effects are much greater than
can be explained by bulk solvent properties [111].
In some types of hydrogen abstraction reactions, the effect of changing the solvent
is low. The reaction between ascorbate and 2,2,6,6-tetramethylpiperidine-1-oxyl radical
was studied experimentally [112]. The mechanism was best explained by proton-coupled
electron transfer (PCET) (Figure 3.5), where an electron and proton are transferred simul-
41
Figure 3.5: Example of the PCET mechanism, reproduced from [112]
taneously but between different sets of orbitals. The solvent was varied between water
and mixtures of water and dioxane, decreasing the polarity. The quantity investigated was
the kinetic isotope effect (KIE), which is defined as the ratio of the rates of hydrogen ab-
straction in water and in D2O. KIE was found to only slightly increase with decreasing
solvent polarity. Interestingly, hydrogen tunneling was suspected to take place in all sol-
vents studied, because the experimental KIEs were larger than expected by semiclassical
theory.
From these studies it has generally been understood that solvent effects on hydro-
gen abstraction reactions are significant when looking at O-H bond abstraction, primarily
because of the O-H bond’s ability to participate in hydrogen bonding networks, but that
the effect on C-H bond abstraction is negligible. Despite this assumption, Koner and co-
workers generalized these effects to C-H and Sn-H bonds [113]. They explain the results as
a stabilization of the abstracting species in polar solvents, rather than the hydrogen donor
(which is the case for abstraction from the O-H bond). Thus, the nature of the abstracting
species has a large effect on the reaction rate in solvents. They still hypothesize that the
reaction of a non-polar hydrogen donor and non-polar abstracting radical will have little
solvent effect, but acknowledge this is hard to test due to the high activation barrier of these
reactions.
42
3.1.3.2 Radical addition to multiple bonds
Kinetic solvent effects on the addition of radicals to multiple bonds has mainly been inves-
tigated theoretically and has been extensively studied by the groups of Fouassier [114–116]
and Radom [117–120]. When various radicals were added to methyl acrylate using DFT
calculations, the rate of reaction in various solvents correlated well with the dipole mo-
ment of the solvents; however, it was argued that a multipole approach was still needed
[115]. Polar solvents still had a small effect on the rate if the charge transfer from the
reactants to transition states, calculated using a Mulliken charge analysis [121], was low.
Wong and Radom performed calculations using the self-consistent isodensity polarizable
continuum model (SCIPCM) [52, 117]. In the case of radicals with saturated substituents
adding to alkenes with saturated substituents, addition of any solvent increases the barrier.
Solvent decreases the barrier in the unsaturated case. The higher the dielectric constant of
the solvent, the greater this effect. Finally, Garcia et al. investigated the addition of sev-
eral radicals to methyl aminoacrylate with DFT and found that for an unspecified solvent,
barriers increase with solvation for electrophilic radicals (phenyl and trifluoromethyl) and
decrease for nucleophilic radicals (methoxymethyl and methyl) [122]. From these studies,
it can be concluded that the chemical nature of the radical has a larger effect on the rates
than the polarity of the solvents in this reaction family.
3.1.3.3 Beta scission
β-scission of tert-butoxyl radicals using electron spin resonance to measure rates was in-
vestigated by Weber and Fischer [124]. They found that at 300 K, the solvation rates were
at least ten times larger than the gas phase rates. This result was explained by a transition
state effect, where the transition state is more polar than the radical and is thus more sta-
bilized by interactions with polar and polarizable solvents. Furthermore, the β-scission of
cumyloxyl radicals using laser flash photolysis also showed a rate increase with increas-
ing solvent polarity [125]. The same effect was found for alkyloxyl radicals [126]. Bietti
43
Figure 3.6: β-scission rates correlate with Dimroth-Reichardt parameter ET . Reproducedfrom [123]
et al. also confirmed this effect and found that the solvent Dimroth-Reichardt parameter
(ET ) correlated well with the increase in rate [127], confirming earlier results by the Ingold
group [123], see Figure 3.6. The ET parameter represents the charge-transfer absorption of
the solvent in pyridinium N-phenolbetaine and serves as a different measure of solvent po-
larity than the dielectric constant [128]. For example, methyl formamide has an extremely
high dielectric constant but is of similar polarity to methanol as characterized by its ET
value. It appears from these studies that the rates of β-scission reactions can be general-
ized to increase with some measure of solvent polarity such as ET . However, since only
reactions of cumyloxyl and alkyloxyl radicals have been investigated, and mostly in water,
acetonitrile or mixtures of the two, it is difficult to infer kinetic solvent effects for the entire
reaction family.
3.1.3.4 Diels-Alder
Like the other reaction families discussed, the rates of Diels-Alder reactions in so-
lution, in general, depend on both solute and solvent properties [130]. Breslow et al. [131]
described a large increase in rate of Diels-Alder reactions in water and in stereoselectivity
between endo and exo products. They explain the acceleration in terms of the reactant struc-
tures; the diene and dienophile engage in hydrophobic stacking. Experimental studies with
44
Figure 3.7: Example of Diels-Alder reaction of cyclopentadiene and (-)-menthyl acrylateand possible product isomers. Figure reproduced from [129]
methanol show that it is indeed this hydrophobic effect rather than a polarity effect which
increases the rate in water, as the rates of some reactions actually decrease in methanol,
which is unexpected.
Later, Ruiz-Lopez et al. [129] studied Diels-Alder reactions of cyclopentadiene and
methyl acrylate using ab initio calculations. They found that because the solvent’s electric
field changes the shape of the potential energy surface, a direct change in the overall reac-
tion mechanism is seen with solvation. It is argued that only adding the solvation energy to
the gas-phase energy is not sufficient to determine the reaction path in solution. However,
they also maintain that continuum theory models are sufficient for capturing some specific
interactions with solvents, such as hydrogen-bonding, since overall electrostatic effects
will implicitly include these properties. Another simulation study on the reaction of methyl
acrylate with cyclopentadiene was done by Sheehan and Sharratt using molecular dynam-
ics [132]. The reaction was studied in both methanol and n-hexane. The result showed that
the endo/exo selectivity of the reaction products is related to the difference in the solvated
transition states’ free energies. Further, the rates and selectivity were affected by properties
of the solvent such as their polarity and H-bonding ability. In methanol, the product was
energetically favored as compared to the energy in n-hexane; this was explained by the
similarity in polarity between the product and methanol.
Soto-Delgado et al. studied the reaction of cyclopentadiene and methyl vinyl ke-
tone in both water and methanol, using a combined QM/MM-MD approach [133]. The
45
activation free-energy barrier in methanol is 2.1 kcal/mol higher than that in water, which
is within 0.1 kcal/mol from the experimental difference (a rate deceleration of 2 orders
of magnitude. Again, the ability of hydrogen bonding between solvent and transition
state, which is stronger and longer-lived in water than in methanol, contributes to this
effect. Kiselev et al. compared reaction rates of 9-(hydroxymethyl)anthracene and 9,10-
bis(hydroxymethyl)anthracene with maleic anhydride, N-ethylmaleimide and N- phenyl-
maleimide, in organic and water-1,4-dioxane cosolvents [134]. It was found that even
those reactants which do not hydrogen-bond with water experienced acceleration of rate in
water, depending on the structure of the diene. The organic cosolvents reduced the reaction
rate, also depending on the polarity of the reactants.
These studies illustrate the importance of both polarity of reactants and hydrogen
bonding ability in kinetic solvent effects of Diels-Alder reactions, and verify that contin-
uum models and other computational methods can capture both of these effects reasonably.
Despite the success of these models, experimental studies were necessary to show the hy-
drophobic effect that is crucial in some cases. Additionally, merely adding solvation en-
ergies to those obtained in the gas-phase is not sufficient enough when the transition state
geometries or shape of the potential energy surface change significantly in a solvent.
3.1.3.5 Acetylation
Figure 3.8: Acetylation mechanism, reproduced from [135]
Xu et al. [136] theoretically investigated the catalyst-assisted acetylation of tert-butanol
with acetic anhydride in the gas phase and in three solvents. The geometry optimization
46
and reaction energies were found with B3LYP/6-311++G(d,p) and B3LYP/6-31G(d), while
solvation energies were found using PCM. They found that the reaction proceeds via a
mechanism characterized by nucleophilic attack of the catalysis, and that this mechanism
does not change, nor does the rate-limiting step, with solvent. Because polar solvents
solvate the reactants better than the transition states or reaction intermediates, the reaction
is less favorable in these solvents. Consequently, the reaction proceeds less favorably in
going from gas, to carbon tetrachloride, to chloroform, and finally to dichloromethane, the
most polar solvent of the group. These results are also supported by early experimental
studies [137].
3.1.3.6 Epoxidation
The epoxidation of olefins by hydrogen peroxide was studied both experimentally and with
DFT by Berksessel and co-workers [89, 138, 139]. They find that for these reactions,
using fluorinated alcohols such as HFIP accelerates the rate in relation to 1,4-dioxane [138,
139]. From 0-4 molecules of HFIP were studied with the reactants quantum mechanically
in the gas-phase, and then the whole system was treated with PCM [89]. Acetone was
chosen as a model solvent for HFIP, because of its similar dielectric constant, with some
additional considerations for cavitation. The activation enthalpies were shown to decrease
with increasing number of HFIP molecules, while increasing contribution of entropy lead to
the Gibbs free energy of activation reaching saturation with three or four HFIP molecules.
The same theoretical study was done using methanol as a solvent, but increasing the number
of methanol molecules had no influence. However, the activation barrier was reduced more
with methanol than with HFIP. This result shows that methanol acts only as a polar solvent
for the reaction, and explicit hydrogen-bonding with methanol does not affect the reaction
rate, as it does with HFIP. While prior theoretical study by the Shaik group showed that
fluorinated alcohols increased the epoxidation reaction rate [140], the studies by Berkessel
explicitly showed the effect of multiple aggregates of the solvent molecules.
47
(a) (b)
Figure 3.9: Plots for epoxidation rate of β-caryophyllene versus differing solvent parame-ters, reproduced from [141]
Later, Steenackers et al. experimentally studied the epoxidation of β -caryophyllene
to caryopyllene oxide in aqueous H2O2, alcohols, nitrogen containing solvents, and furans,
11 in total [141]. In all cases, the rate correlated extremely well with Abraham’s hydro-
gen bonding parameters (R2 = 0.97), and interestingly, not well at all with the dielectric
constant (R2 = 0.17), as shown in Figure 3.9 reproduced from the paper. They further char-
acterized solvent effect using ωB97XD/g-311++G(df,pd) and IEPCM. This computational
study confirmed previous studies that the solvent stabilizes the O-O bond in H2O2 in the
transition state structure via hydrogen bonding.
3.1.3.7 Hydrolysis
Almerindo and Pliego studied the hydroylsis of formamide with ab initio calculations and
PCM [142]. 1-4 explicit water molecules were considered. Two mechanisms, stepwise and
concerted, were investigated; see Figure 3.10. For the stepwise mechanism, the activation
barrier increased by 4.6 kcal/mol with one water molecule and 11.0 kcal/mol with two water
molecules. Adding water molecules beyond two made the system entropically unfavorable.
For the concerted mechanism, the solvation only increased the barrier by 6.4 kcal/mol with
two water molecules, indicating that the transition state is more stabilized by the solvent
48
Figure 3.10: The possible mechanisms in the hydrolysis of formamide. Figure reproducedfrom [142]
than in the stepwise mechanism. Additionally, the level of theory used for the calculations
had a large effect on the barriers. The discrepancy caused by the level of theory was larger
than the difference between using geometries optimized in the liquid-phase, rather than
geometries optimized in the gas phase and then modeled with PCM.
3.1.3.8 O-neophyl rearrangement
The O-neophyl rearrangement of two 1,1, diphenylethoxyl radicals was investigated with
laser flash photolysis in five solvents by Bietti and Salamone [143]. For both radicals, the
rate constant decreased with increasing solvent polarity. There was a linear correlation
found between the logarithm of the rearrangement rate constant and the Dimroth-Reichardt
parameter E NT . Because E N
T represents the solvent anion’s solvating ability, the trend was
explained in terms of the “decrease in the extent of negative charge on the oxygen atom on
going from the starting radical to the transition state.” [143]
49
3.2 Methods
As the numerous studies in the Background showed, chemical reactivity can change
drastically in different solvents. While some rates change systematically with solvent po-
larity, it is not always clear what properties of the solvent and reactant structure will have
an effect on the rates. Experimental data are only available for certain radical reactions
in some reaction families, most extensively hydrogen-abstraction. Again, because RMG
contains information and estimation methods for gas-phase kinetics, it is desirable to de-
vise a method to correct these rates in the liquid-phase, which is done in this dissertation.
For diffusion effects, simple correlations are available for correcting reaction rates, and are
utilized here; however, correcting intrinsic kinetics is more complicated. Using quantum
chemistry provides a powerful tool for determining reactant and transition state energies.
These quantum calculations are expensive, so in this work, we take the approach of study-
ing a subset of reactions and generalizing the results.
3.2.1 Diffusion
Diffusion corrections to reaction rate are needed because reactions are physically
impeded by the presence of solvent. To implement this phenomenon into RMG-Py, simple
correlations based on the hard-sphere approximation were used to minimize computational
cost and complexity. The correction is only made in the bimolecular direction; no cor-
rection is needed for unimolecular reactions. For reactions which are bimolecular in both
directions, the diffusion correction is calculated based upon the direction where it will have
the greater effect. In all cases, since the equilibrium constant does not change as a re-
sult of diffusion, it is used to modify the reverse rate of reaction proportionally [50]. This
modification is implemented by calculating an effective rate constant [144]:
keff =4π(r1 + r2)(D1 +D2)kint
4π(r1 + r2)(D1 +D2) + kint
50
where r1 and r2 are the radii of the reacting species, D1 and D2 are their diffusivities, and
kint is the intrinsic reaction rate in the gas phase. The radii are calculated by assuming the
molecules are spherical and using the McGowan volume, which gives a contribution to the
volume for each atom and a subtraction for each bond [145]. The Stokes-Einstein equation
is used to calculate the diffusivities:
D =kBT
6πηr
and the viscosity η, dependent on temperature, can be calculated with the following:
η = A+B
T+ C log T +DTE
where A,B,C,D and E are fitted parameters tabulated for many solvents [146]. This
temperature dependence is an improvement over the implementation in RMG-Java, where
solvent viscosity is assumed to be its value at 298 K. The solvent database, which contains
the solvent descriptors described in Chapter 2, also include the A−E parameters for each
solvent.
All changes to the source code were made in RMG-Py’s solvation module, used to
store the solvent properties when loaded, and a newly created diffusionlimited module that
has functions based on the empirical correlations. Additionally, the reaction module was
modified in order to correct the gas-phase rate when solvation is turned on as specified by
the user in the RMG input file.
3.2.2 Intrinsic kinetics
Group-based estimation techniques have proved effective for predicting parame-
ters for mechanism generation, including thermodynamic parameters [27], kinetics [30],
and most recently, transition state geometries [39]. In this dissertation, I show that such
an approach is also possible for predicting the difference in a reaction’s activation energy
51
between the gas-phase and liquid-phase (∆EA). First, the effect on ∆EA of changing re-
actant functional groups was tested in small sets of hydrogen abstraction reactions. The
required reactant and transition state geometries and energies were calculated with density
functional theory (DFT) and a continuum solvation method, which have previously been
used to determine kinetic solvent effects [89, 136, 141]. All calculations were completed
using the Gaussian ‘09 computational chemistry package [95] carried out on Northeast-
ern University’s Discovery cluster, a high performance computing cluster. Based on the
observations from these calculations, a group estimation method was developed to predict
∆EA for different classes of solvents, described in the following sections. The method was
applied to a published n-dodecane/methyl oleate oxidation model previously built using
RMG, serving to represent a middle-distillate fuel with a fatty acid methyl ester (FAME)
additive, both which are components of biofuels [4]. I then observed the effect of including
kinetic solvent effects on predicted fuel induction period (IP).
3.2.2.1 Quantum chemistry protocol
For the quantum chemistry calculations, 53 hydrogen abstraction reactions were
chosen to best observe effects of functional groups on reaction barriers (Table 3.1, rep-
resented by SMILES). These reactions included carbon, hydrogen, oxygen and nitrogen
atoms, and probed the effect of alcohol groups versus alkanes, different abstraction sites
within the same molecule, unsaturated versus saturated rings, and carbon chain length. For
this set of reactions, gas-phase reactant geometries and transition state geometries were
optimized using M06-2X [147, 148]/MG3S [149–154]. The transition states were verified
with an intrinsic reaction coordinate calculation. Once a correct transition state was found,
single point energies were calculated in eight different solvents, for both the reactants and
transition states, using SMD [94]. These eight solvents, given in Table 3.2, cover a wide
range of dielectric constants and at least one falls under each of six categories of solvent
as defined by Schmid [155]: 1) nonpolar, 2) aliphatic, 3) protic/protogenetic in which one
52
Reaction # heavy atoms1 [CH3] + C←→ C + [CH3] 42 [OH] + C←→ O + [CH3] 43 O[O] + C←→ OO + [CH3] 64 [CH3] + CC←→ C + C[CH2] 65 [CH3] + CO←→ C + C[O] 66 [CH3] + CO←→ C + [CH2]O 67 [OH] + CC←→ O + C[CH2] 68 [OH] + CO←→ O + C[O] 69 [OH] + CO←→ O + [CH2]O 610 O[O] + CC←→ OO + C[CH2] 811 O[O] + CO←→ OO + C[O] 812 O[O] + CO←→ OO + [CH2]O 813 [CH3] + CCC←→ C + CC[CH2] 814 [CH3] + CCC←→ C + C[CH]C 815 [CH3] + CCO←→ C + CC[O] 816 [CH3] + CCO←→ C + C[CH]O 817 [CH3] + CCO←→ C + [CH2]CO 818 [OH] + CCC←→ O + CC[CH2] 819 [OH] + CCO←→ O + CC[O] 820 [OH] + CCO←→ O + C[CH]O 821 [OH] + CCO←→ O + [CH2]CO 822 O[O] + CCO←→ OO + CC[O] 1023 O[O] + CCO←→ OO + C[CH]O 1024 O[O] + CCO←→ OO + [CH2]CO 1025 [CH3] + CCCO←→ C + CCC[O] 1026 [CH3] + CCCO←→ C + CC[CH]O 1027 [CH3] + CCCO←→ C + C[CH]CO 1028 [CH3] + CCCO←→ C + [CH2]CCO 1029 [OH] + CCCO←→ O + CCC[O] 1030 [OH] + CCCO←→ O + CC[CH]O 1031 [OH] + CCCO←→ O + C[CH]CO 1032 [OH] + CCCO←→ O + [CH2]CCO 1033 O[O] + CCCO←→ OO + CCC[O] 1234 O[O] + CCCO←→ OO + CC[CH]O 1235 O[O] + CCCO←→ OO + C[CH]CO 1236 O[O] + CCCO←→ OO + [CH2]CCO 1237 [CH3] + O1C=CC=C1←→ C + O1C=[C]C=C1 1238 [CH3] + O1C=CC=C1←→ C + O1[C]=CC=C1 1239 [CH3] + [NH]1C=CC=C1←→ C + [NH]1C=[C]C=C1 1240 [CH3] + [NH]1C=CC=C1←→ C + [NH]1[C]=CC=C1 1241 [CH3] + [NH]1C=CC=C1←→ C + [N]1C=CC=C1 1242 [OH] + C1C=COC=1←→ O + C1C=[C]OC=1 1243 [OH] + C1C=COC=1←→ O + C1[C]=COC=1 1244 [OH] + [NH]1C=CC=C1←→ O + [NH]1C=[C]C=C1 1245 [OH] + [NH]1C=CC=C1←→ O + [NH]1[C]=CC=C1 1246 [CH3] + C1=CC=CC=C1←→ C + C1=[C]C=CC=C1 1447 [CH3] + C1=CC=NC=C1←→ C + C1=C[C]=NC=C1 1448 [CH3] + C1=CC=NC=C1←→ C + C1=[C]C=NC=C1 1449 [CH3] + C1=CC=NC=C1←→ C + [C]1=CC=NC=C1 1450 [CH3] + C1CCCCC1←→ C + C1[CH]CCCC1 1451 [OH] + C1=CC=NC=C1←→ O + C1=C[C]=NC=C1 1452 [OH] + C1=CC=NC=C1←→ O + C1=[C]C=NC=C1 1453 [OH] + C1=CC=NC=C1←→ O + [C]1=CC=NC=C1 14
Table 3.1: Training reactions used to deduce kinetic solvent effects
53
hydrogen atom is bonded to oxygen, 4) halogenated, 5) amines, and 6) select “normal”
solvents which are non-protonic, do not contain chlorine, and are aliphatic with a single,
dominant bond dipole. The SMD calculations and gas-phase energy calculations then made
it possible to calculate ∆EA for each reaction in each solvent.
Solvent εOctane 1.9Benzene 2.3Tetrahydofuran 7.4Dichloromethane 8.9Pyridine 13.0Acetonitrile 35.7Dimethylsulfoxide 46.8Water 78.4
Table 3.2: Solvents used for single-point energy calculations on training reactions
3.2.2.2 Molecular structure group training
Based upon the observations from the training set of reactions, two hierarchical,
molecular group trees (one for each reactant) were manually constructed based on the fea-
tures considered most important to predicting ∆EA. I chose these features based on both
chemical intuition and the data, and a similar procedure for constructing similar group trees
has been described [39]. Using regression, group contributions to ∆EA were calculated for
each leaf in the trees to best fit the data for the ∆EA of the 53 training reactions. The
top level of the tree contains a base value for ∆EA, and the total ∆EA is calculated by
adding this base value to the values for the leaf in each tree which which best matches the
reacting groups in the reaction of interest. This calculation method is similar to the current
procedure for gas-phase thermodynamics, solvation thermodynamics, and transition state
geometry data estimation in RMG (and hierarchical trees are also used for the rule-based
estimation of gas-phase kinetics). Methods for calculating ∆EA based on this group con-
tribution method were added to the solvation module of RMG, drawing from the newly
added databases of solvation kinetics group values in RMG-database.
54
Though the trees for each solvent have the same structure, a different contribution to
∆EA is used according to the calculations in that solvent. Twenty-six solvents are currently
available to use for thermodynamic corrections in RMG, and these may be expanded in the
future. These solvents can be generalized to different categories, as mentioned above. The
eight solvents for which SMD calculations have been done will be used to construct these
categories; ∆EA for solvents outside of these eight will be calculated using the appropriate
category.
3.2.3 Fuel oxidation model modification
The published n-dodecane/methyl oleate model, generated with RMG by Ben Amara
et al., includes a Chemkin file and a dictionary which gives the SMILES strings for each
named species in the mechanism [4]. Using this information, I parsed the Chemkin file
using methods in RMG to generate a reaction in RMG format, corresponding to each hy-
drogen abstraction reaction in the mechanism. Once the reacting groups were determined
by this script, the tree was traversed to find the contributions to ∆EA which are most ap-
plicable to the reaction. The contributions from the n-octane solvent tree were used, as
it is the solvent which is most chemically similar to n-dodecane. The script then rewrote
the Chemkin file, updating the EA of each hydrogen abstraction reaction. The updated
Chemkin file is included in Appendix A.3, and the script used to generate it can be found
in Appendix A.2.
3.2.4 Reactor simulations
To examine the effect of changing the reaction barriers in the fuel oxidation method,
I simulated a reactor using both the previous (gas-phase kinetics) and updated (liquid-phase
kinetics) models. An ideal gas reactor model was used, with the density set to the liquid fuel
density of n-dodecane, to approximate a liquid-phase reactor. Oxygen was continuously
replenished throughout the course of the simulation to emulate a constant partial pressure
55
of oxygen. Other conditions were set to be the same as the conditions set in the modeling
study conducted by Ben Amara et al. [4]. Different methyl oleate concentrations (0%-30%
by volume) were used. The induction period, which is defined as the time at which the fuel
is 5% converted, was compared for both models as well as to prior experimental data. The
script used to simulate the reactor is included in Appendix A.4.
3.3 Results
Some of the trends observed for change in ∆EA with molecular structure groups
are outlined below. The trends were used to generate the group tree for correcting the gas
phase kinetics in different solvents. Finally, the data for n-octane was used to modify the
EA in a fuel oxidation model; the updated reactor simulations are shown below.
3.3.1 Solvation kinetics trends
Several solvation kinetics trends, based on solvent and molecular structure, were
deduced from the training data. One such relationship is displayed in Figure 3.11. Here,
when ·OH abstracts a hydrogen from different sites of propanol, the kinetic solvent effect
increases as the abstraction site gets closer to the alcohol group.
Examples of other such trends will be briefly listed. Solvent effect on hydrogen
abstraction from alkanes is the same with increasing carbon chain length, with the only
difference being for methane; hydrogen abstraction from saturated rings and unsaturated
rings have a slight difference, with larger differences occurring for high dielectric solvents;
and hydrogen abstraction by ·OOH trends differently than abstraction ·OH and ·CH3, with
the latter two behaving similarly [156]. The intuition gathered from these trends was used
to construct the molecular structure group tree for ∆EA.
56
Figure 3.11: Difference in energy between gas-phase and liquid-phase, between reac-tants and transition state (∆EA), for the reaction XH + ·OH←→ ·X + H2O, where XH ispropanol and each symbol represents a different abstraction site (indicated by moleculeson the right with the abstraction site circled in purple). Kinetic solvent effect increases asdistance of abstraction site to alcohol group decreases.
3.3.1.1 Group tree
From the intuition gained from the trends discovered in the section above, along
with general principles included in the existing thermodynamics and kinetics databases in
RMG, a molecular structure group tree was constructed for the determination of solvation
kinetic corrections for hydrogen abstraction reactions. Importantly, the structure of this tree
determines what estimates will be used for ∆EA when an exact value is unavailable. The
first few levels of the tree are displayed in Figure 3.12; the full tree has 1-2 more layers of
complexity.
Similar studies were conducted for intra-hydrogen migration reactions, although the
solvation corrections were not applied to the n-dodecane/methyl oleate oxidation model.
57
(a) Molecular structure group tree for the molecule being abstracted from (XH)
(b) Molecular structure group tree for abstractingspecies (Y·)
Figure 3.12: Illustration of the first few levels of group trees for hydrogen abstraction. Thefull tree contains 2 more levels of complexity. Reacting atoms are colored in blue. R’ refersto a functional group that is not a hydrogen.
58
Both hydrogen abstraction and intra-hydrogen migration group trees were also further
trained with a larger set of reactions (see Recommendations section). The full tree along
with group values for ∆EA, for hydrogen abstraction in n-octane, is included in Appendix
A.1; this tree was trained using the large set of reactions.
3.3.2 New reactor simulations
Figures 3.13, 3.14, 3.15, and 3.16 compare the results of reactor simulations us-
ing the original, gas-phase kinetic model for n-dodecane/methyl oleate oxidation, the up-
dated model with kinetic solvent effects, and experiments. For the simulation with pure
n-dodecane (0% methyl oleate), addition of solvation kinetic effects appears to increase the
induction period of the fuel. This increase is in the direction of most, though not all, of the
experimental data points. The amount that the modified model increases the induction pe-
riod, on a logarithmic basis, is dependent on the amount of methyl oleate in the fuel blend.
Additionally, the model which is least consistent with the experimental data is the one built
for 30% methyl oleate content. These results may suggest more training data are needed
for the solvation kinetic groups which contain or are proximal to oxygen atoms. While not
relevant in this example, as n-dodecane is a non-hydrogen bonding solvent, oxygen as a re-
acting atom will matter even more in such cases as the transition state would be stabilized
by a hydrogen bonding solvent relative to the reactants.
3.4 Summary
In this chapter, modifications to the RMG-Py software were implemented for liquid-
phase mechanism generation, specifically for reaction kinetics. Based on well known corre-
lations, reaction rates were modified to account for diffusion effects, with an improvement
over the implementation in RMG-Java: the addition of temperature-dependence in the sol-
vent viscosity. Additionally, a large part of this chapter was devoted to demonstrating the
applicability of a group contribution method for estimating kinetic solvent effects. Several
59
2.20 2.25 2.30 2.35 2.40 2.45 2.501000T
(K−1 )
10-1
100
101
102
Indu
ctio
n P
erio
d (h
ours
)
ExperimentsBen Amara 2013 modelModified model
Figure 3.13: Comparison of experiments, original model and updated model with kineticsolvent effects for 0% methyl oleate.
2.2 2.3 2.4 2.51000
T (K 1)
10 1
100
101
102
Indu
ctio
n P
erio
d (h
ours
)
ExperimentsBen Amara 2013 modelModified model
Figure 3.14: Comparison of experiments, original model and updated model with kineticsolvent effects for 5% methyl oleate.
experimental and theoretical methods for measuring liquid phase rate constants were re-
viewed; however, given RMG’s extensive database of gas-phase rates, it was preferable to
design a method that modified these rates systematically to account for a solvent. A train-
ing set of hydrogen abstraction reactions, and later intra-hydrogen migration reactions, was
used to deduce trends based upon molecular structure on the change in energy between gas-
phase and different solvents, for both reactants and transition states. The calculations were
completed using M06-2X/MG3S. Based upon the trends, a group contribution method was
devised, and used to modify rates of a fuel oxidation model built using RMG-Java. Reac-
tor simulations using the updated model showed that correcting rates changes the induction
60
2.2 2.3 2.4 2.51000
T (K 1)
10 1
100
101
102
Indu
ctio
n P
erio
d (h
ours
)
ExperimentsBen Amara 2013 modelModified model
Figure 3.15: Comparison of experiments, original model and updated model with kineticsolvent effects for 10% methyl oleate.
2.2 2.3 2.4 2.51000
T (K 1)
10 1
100
101
102
Indu
ctio
n P
erio
d (h
ours
)
ExperimentsBen Amara 2013 modelModified model
Figure 3.16: Comparison of experiments, original model and updated model with kineticsolvent effects for 30% methyl oleate.
period measurement, generally towards the direction of experimental induction period. The
more methyl oleate in the fuel blend, the worse the agreement of the updated model with
the experimental values. This reflects a need for more training reactions including methyl
esters, and oxygen-containing species in general.
61
3.5 Recommendations
Several recommendations to improve solvation kinetics in RMG-Py are outlined
below. Among these are better automation, benchmarking, and integration with estimation
methods to ensure thermodynamic consistency.
3.5.1 On-the-fly estimation of solvation kinetics
In the future, the algorithm described in this chapter can be integrated into the RMG
software such that the mechanism including the kinetic solvent effect can be generated on
the fly, instead of as a post-processing step to modify a Chemkin file. On-the-fly genera-
tion of solvation kinetics ensures that important reactions are included in the model, since
a post-processing step is done on reactions whose inclusion in the mechanism was based
on their gas-phase rates. This framework has already been partially set up, as the post-
processing step is completed using methods I added to RMG. When the solvation database
is loaded, the solvation kinetics correction database is loaded in addition to the thermody-
namic database files. A method to calculate the barrier correction using group contributions
was added to the solvation module. The missing step is correcting the intrinsic reaction rate
during model generation, as we similarly do to account for diffusion. Because these steps
are similar, it may make sense to do them in the same place in the source code. Some
placeholder code is included in the diffusionLimited module for this purpose, in a new
class called LiquidKinetics. Complete interfacing with the kinetics and solvation databases
is currently incomplete. Importantly, this on-the-fly step should not be integrated into RMG
until further benchmarking and consistency checks are done, explained below.
62
3.5.2 Benchmarking the estimates
To minimize computational cost, the training reactions used to train group additivity
values for ∆EA involved small molecules, but the reactions in real detailed kinetic mod-
els for fuel oxidation will contain many more atoms. Thus, it is important to understand
whether our training reaction set can accurately capture the kinetic solvent effect on the
actual reactions. It is hypothesized that the chemistry of the reacting site is most important
to ∆EA, and that effects beyond the next-nearest neighbor are negligible. To test this, the
∆EA for some hydrogen abstraction and intra-hydrogen migration reactions from the n-
dodecane/methyl oleate model should be calculated using M062X/MG3S and SMD using
the same protocol as above. These ∆EA can then be compared to the ∆EA that would have
been estimated for the same reaction using the automated method. This is work currently
in progress at the time of writing.
3.5.3 Check thermodynamic consistency with LSERs
One way of checking the consistency of both the SMD calculations, and the linear
solvation energy relationships (LSERs) discussed in Chapter 2, is to compare the equilib-
rium constant obtained by using both of these methods. Using LSERs, the equilibrium
constant of reaction, Keq, can be obtained with:
Keq = exp−∆G
RT
∆G = ∆Ggas + ∆Gsolv
where ∆Gsolv was calculated via the partition coefficient obtained from the Abraham corre-
lations. Another Keq can be calculated using ∆G for the reaction’s reactants and transition
state, which are calculated using the DFT and SMD method outlined in this chapter. Since
data from both of these methods would be used alongside each other in mechanism gener-
63
ation codes such as RMG, it is important the values obtained from each method reasonably
match.
3.5.4 Data-driven approaches
The algorithm for predicting intrinsic kinetic solvent effects has two main draw-
backs: small amounts of training data, and a human-constructed tree structure. Manually
setting up transition state calculations is tedious, thus the first iteration of this algorithm
only contained 53 hydrogen abstraction reactions as training data, with the starting transi-
tion state geometry guess and input files all set up by hand. Using larger amounts of data
which had been previously calculated automatically, and the effect of more data points on
the estimation of solvation kinetics, can be explored.
Furthermore, the molecular structure trees in RMG are all constructed with the fea-
tures ordered in the way the scientist believes to be the most important. For example, many
of the trees in RMG for calculating thermodynamic or kinetic parameters consider the ele-
ment of the central reacting atom to be more important than its bonding configuration, and
the number of its radical electrons to be more important than its element. While a chemist’s
intuition should not be ignored, a data driven approach may improve the accuracy of the
estimation method. To explore this concept, automated decision trees can be constructed
for solvation kinetics using the Python package, scikit-learn [157].
3.5.4.1 Automated transition state theory to generate large training sets
To generate the large solvation kinetics training set, we needed a large set of gas-
phase transition states, as this calculation set-up is the bottleneck of calculating ∆EA. The
set of transition states was created using an automated algorithm, based on a group contri-
bution approach to predicting transition state geometries [39]. Outside of this algorithm,
Python scripts were written to streamline the process of 1) obtaining the gas-phase tran-
sition state geometry and energy from the database of transition states, 2) calculating the
64
Figure 3.17: Algorithm for obtaining solvation energy estimation values from a large tran-sition state dataset. Utilizes software (from top to bottom): AutoTST [39], Gaussian 09[95], cclib [158], and RMG-Py [7, 8], with helper scripts written as part of this work.
single-point energy of the transition state and reactant(s) in the solvent of interest, 3) cal-
culating ∆EA for this reaction, and 4) using the values to train group values for the ∆EA
estimation algorithm. A schematic for this process is shown in Figure 3.17.
3.5.4.2 Automatic generation of decision trees with scikit-learn
Scikit-learn is an open source Python package used for machine learning [157]. It
has been used for automatically constructing decision trees in several chemical and bio-
logical applications [159–161]. Rather than using a chemist’s intuition about how to order
the decision tree, scikit-learn will utilize a user-defined metric (such as RMS error on a
training set, for example), for determining how the tree should be constructed. Using a
package such as scikit-learn opens up several other possibilities. The decision tree model
can be robustly cross-validated on the training set and model hyperparameters can be tuned
using built in modules. Also, ensemble methods such as random forests, which use several
decision trees to estimate the quantity of interest, can improve the accuracy of the model.
Several feature selection, parameter tuning and model combinations can be tested using
pipelines in scikit-learn, with much less code than building these trees manually. Some ex-
ample experimental code for building decision trees for estimating solvation kinetic effect
65
is included in Appendix A.5.
66
4 AUTOMATED SILICON HYDRIDE MECHANISM GENERATION
Chemical vapor deposition (CVD) is a candidate domain for studying detailed
chemical kinetics. The gas-phase chemistry of the precursors to CVD directly affects
the yield and quality of the solid materials produced, and is thus an important facet of
the process for industrial scientists, such as those fabricating microelectronics, to under-
stand. Complete, detailed models of the reactions of these precursor gases, mainly silicon
hydrides such as silane (SiH4), can provide necessary insights to the semiconductor indus-
try. Detailed kinetic models also allow performance predictions to be made at different
operating conditions than what has been studied experimentally. However, building these
large mechanisms by hand is tedious, and automatic mechanism generators can be used to
build the models faster and minimize errors. Using automatic mechanism generation en-
sures that all important reaction pathways, including those involving radical species, will
be considered in modeling and prediction of the CVD process. Furthermore, a framework
for studying gas-phase silicon hydrides provides the first step in studying surface reactions
involved in CVD.
In this section of the dissertation, Reaction Mechanism Generator (RMG), an au-
tomatic mechanism generator discussed in the prior sections, was used to build detailed
kinetic models for silicon hydrides used for CVD. A new element, silicon, was added to
the RMG framework. Specifically, data for silicon hydrides were added to RMG’s database
of thermodynamic and kinetic parameters. Radical reaction types, which already existed in
RMG but lacked data for silicon, and new reaction types specifically important for silicon
hydride chemistry, can both be proposed with the updated RMG.
67
Using the new data in RMG, a model for SiH4 thermal decomposition was built.
The resulting model was used to simulate a flow reactor, and these simulations were com-
pared to SiH4 flow tube experimental data obtained from Onischuk et al. [162], including
data for SiH4 and Si2H6 concentration profiles with time, and at different temperatures,
residence times, and initial concentrations of SiH4. Studying SiH4 can serve as a proof-of-
concept that can be extended to other novel gas precursors.
4.1 Background
SiH4 CVD has been well studied experimentally and theoretically [163–170], among
others, for the past 5 decades. Several representative studies will be discussed in the fol-
lowing sections.
4.1.1 Experimental work on SiH4 chemistry
Several decades ago, Purnell and Walsh experimentally studied SiH4 pyrolysis be-
tween 375 and 430◦ C. They found the overall rate of SiH4 −→ SiH2 + H2 to be of order
1.5, and propose different chemical mechanisms to account for this order. The authors
rule out heterogenous steps under these conditions, because the vessel’s A/V ratio did not
strongly change the reaction rate. Thus, they conclude that the mechanism must include
a unimolecular, first order decomposition which is pressure dependent along with chain
reactions. However, the choice of whether SiH4 first decomposes to SiH3 or SiH2 required
further information of bond dissociation energies [163].
Newman et al. studied silane decomposition in a shock tube, supplemented by
RRKM rate calculations. They found that rate of silane decomposition is independent of
initial silane concentration, and also that hydrogen atom production is not important in the
process, providing evidence that SiH4 decomposition to SiH2 is the most important initial
step. However, because they did not see disilane, Si2H6, as a product, they hypothesize the
SiH2 must be consumed by some other means rather than producing hydrogen atoms [171].
68
Michael Coltrin and co-workers at Sandia National Laboratories have done exten-
sive work on silane CVD in a rotating disk reactor, which eliminates temperature and con-
centration gradients allowing CVD to be more uniform [165, 167]. Based on laser fluo-
rescence measurements, two reactions are proposed that produce Si atoms, which involve
disilylene (H3SiSiH). These reactions are in contrast to what was previously thought to be
the route to Si atoms, SiH2 −→ Si + H2 [167].
Frenklach et al. studied silane and disilane pyrolysis in a shock tube, diluted with
both argon and hydrogen, at high temperatures (900-2000 K). The results of these ex-
periments were used to build a detailed kinetic model for both gas-phase and gas-surface
reactions. Comparison to the experiment showed that the refractive index of the silicon
material used greatly affects model predictions [168].
Onischuk et al. investigated silane pyrolysis in a flow reactor between 800-1000
K. For the gas phase, the effects of initial silane concentration on silane decomposition
and disilane evolution were elucidated; specifically, these rates increased with increasing
silane concentration. At different residence times, the effect of temperature on final silane
concentration was probed. The solid phase was also investigated; the concentration of
particles as well as hydrogen content in the solid product was measured. Based on the
experimental results, the chemical mechanism was described as an initial decomposition of
silane into SiH2 and H2, followed by subsequent silylene insertions to create higher silanes
and substituted silylenes [162]. The Onischuk et al. experiments are used as a basis of
comparison in this chapter of the dissertation.
4.1.2 Detailed mechanisms for SiH4 CVD
Mechanisms for SiH4 thermal decomposition have been developed using the above
experimental data as well as theoretical calculations, by hand and automatically. Yuuki et
al. developed a model for SiH4 and Si2H6 from the experimental observations of Purnell
and Walsh and Newman et al. [163, 164, 171]. The model includes 10 species and 11
69
reactions, and it achieves good agreement with two experiments [163, 172].
Coltrin et al. developed a mathematic model for their rotating disk reactor, which
includes a 26 reaction gas-phase chemical mechanism [165]. Several rate constants were
taken from the RRKM calculations of Becerra and Walsh [173].
Giunta et al. built a gas-phase silane decomposition mechanism from prior exper-
imental works on SiH4/NO CVD[174] and CVD from Mg2Si [175], which includes 18
gas-phase reactions. They hypothesize that film growth is mainly due to disilene species.
The model generally modeled CVD from silane well, compared to experiment, while ex-
perimental comparison to disilane CVD only agrees qualitatively at best. Authors cite low
reliability of rate constants as well as the absence of heterogeneous chemistry from the
model as reasons why Si2H6 experiments do not match well [166].
The largest manual reaction mechanism for the pyrolysis of silane to form silicon
nanoparticles, which was built by Swihart and Girshick, contains 220 chemical species and
2600 chemical reactions. To build the mechanism, a group additivity method was devel-
oped to estimate the thermochemistry of several silicon hydrides in the model. Reactivity
rules were also developed to generate reactions in the mechanism, including templates for
silylene (SiH2) insertion into Si-H and H-H bonds (and its reverse, SiH2 elimination); sily-
lene to silene isomerization; and ring opening and closing isomerizations [5]. Later, this
work was continued in collaboration with the the Broadbelt group at Northwestern Univer-
sity, which has extensively modeled the pyrolysis of silane, particularly for the application
of silicon nanoparticle formation. The group additivity thermochemistry method was im-
proved, using the G3//B3LYP level of theory [70, 176]. Wong et al. [6] used automatic
mechanism generation to build a model for silicon nanoparticle formation from SiH4 de-
composition. In that study, automatic mechanism generation software developed by the
Broadbelt group, which uses a rate-based termination criteria to limit model size, was em-
ployed. Reaction types were the same as in the previous study by Swihart and Girshick
[5]. The study included a comprehensive analysis of the different factors contributing to
70
silicon particle clustering. However, radical pathways were not included in the built mod-
els. Adamczyk and co-workers further developed group additive rate rules for the reaction
families involved, derived from quantum chemistry calculations at G3//B3LYP [177–180].
4.1.3 Importance of radical chemistry in silicon hydride thermal decomposition
In their early study, Purnell and Walsh suggested that decomposition of SiH4 to
SiH3 and H plays a role in silane pyrolysis despite not being the most dominant path-
way [163]. An earlier study by Emeleus and Reid [181] suggested the role of SiH3 be-
comes more important when Si2H6 and Si3H8 are used as the silicon hydride precursor to
CVD. The involvement of radicals may also be more important at certain CVD conditions
[182, 183], for example, in low temperature CVD in silane plasma [184]. SiH3 was found
to be the dominant radical species at steady state in silane plasmas in a study by Robertson
et al. [185]. Watanabe et al. later demonstrated that the concentration profiles of Si, SiH
and SiH2 radicals greatly affect particle growth in these systems [186]. Given that these
studies all theorize the role of radical reaction pathways in silicon hydride thermal decom-
position, methods for building detailed kinetic models should have the ability to include
these pathways if they are important.
4.2 Methods
To enable RMG to generate detailed kinetic models for silicon hydrides, small up-
dates to the RMG source code were made, and its database was updated. Once these
updates were performed, RMG was used to build detailed kinetic models for silane thermal
decomposition. The models output by RMG were used to simulate a reactor for comparison
to the experiments of Onischuk et al. [162].
71
4.2.1 RMG source code
Although silicon atom types already existed in RMG, the functionalities for in-
crementing and decrementing bond orders, breaking and forming bonds, and adding and
removing lone pairs and radical electrons were fully implemented in this work. These
updates were made in the molecule module of RMG.
Additionally, several thermodynamic and kinetics calculations were completed us-
ing quantum chemistry and CanTherm, which parses quantum chemistry output files to
perform thermodynamic and kinetics calculations, and is a subprogram of RMG [33]. In
this work, CanTherm uses the rigid rotor harmonic oscillator approximation to calculate
thermodynamic properties and canonical transition state theory with Eckart tunneling to
calculate kinetic parameters. In order to perform these calculations, CanTherm must apply
atom, bond and spin-orbit coupling energy corrections to adjust the energies calculated for
a particular level of theory and quantum chemistry program. These corrections must be
manually entered into the RMG source code for silicon atom types and silicon containing
bonds. Thus, the necessary parameters were entered for the levels of theory used in the
calculations, either from published data [70, 187, 188] or additional calculations of atomic
energies using Gaussian 09 [95].
4.2.2 Updating RMG’s database
RMG’s database was updated with both kinetics and thermodynamics data for sil-
icon hydrides. Published reaction rates have been added to reaction libraries, which are
preferentially used during RMG simulations if they match a generated reaction. Additional
rates were calculated using transition state theory. Some published and calculated rates
have been added to the training data for four reaction families, to be described below, and
will be used to estimate reaction rates in these families where the exact rates are unavail-
able. Published, calculated, and group additive thermodynamic data for silicon hydrides
have also been added to RMG’s database.
72
4.2.2.1 Kinetics data
In this work, two reaction rate libraries were added to RMG’s database. The li-
braries are based on one experimental study of SiH4 CVD [166] and one theoretical study
involving pressure-dependent rate calculations for mainly Si2 species [170]. If specified
in the input file of an RMG job, the rates from these libraries will be used if an RMG-
generated reaction matches a reaction in the library.
Additionally, two new reaction families were added to RMG’s database: 1. sily-
lene insertion, in which a silicon atom with a lone pair can insert into a silicon-hydrogen
or hydrogen-hydrogen bond; and 2. silylene-to-silene isomerization, in which a hydrogen
atom migrates from an sp3 bonded silicon atom to a silicon atom with a lone pair, and a
double bond is formed. These families are based on the reaction types used in a previous
work on model generation [6]. The kinetics data in these reaction families come from rates
calculated by Adamczyk et al. [177–179] using G3//B3LYP.
The third reaction family considered in previous mechanism generation for silicon
hydrides were ring closing reactions (and its reverse, ring opening), in which a silylene
molecule with at least three silicon atoms isomerizes to form a ring structure. Two of these
reactions (for Si3H6 and Si4H8) were added into a reaction library. Because the current work
does not consider larger molecules, an RMG reaction family for arbitrarily large rings was
not created.
In addition, the training reaction databases of two existing reaction families in
RMG, hydrogen abstraction and radical recombination, were updated with silicon hydride-
containing reactions and kinetic parameters [183, 189], found in the NIST kinetics database
[190]. For hydrogen abstraction reactions, ten additional rates were calculated. The reac-
tant and transition state geometries were optimized using M06-2X [147, 148] / 6-311+G(3d2f)
[150–152, 191, 192] , with a clear saddle point found for each reaction. The CanTherm
package, which applies conventional transition state theory, was used to determine the
Arrhenius kinetic parameters [33]. More detailed information about CanTherm can be
73
found in Chapter 1. Reliability of these new, calculated rates were tested by comparing to
available published experimental and theoretical rate calculations for hydrogen abstraction
reactions of silicon hydrides [182, 189, 193–197].
Reaction family Template
Hydrogen abstraction R1 H + R2 R1 R2 H+
Radical recombination R1 + R2 R1 R2
Silylene insertion R1 Si H R2 H+ SiH2R1R2
Silylene-to-silene isomerization Si Si HHR1
R2Si SiH2
R1R2
Table 4.1: Reaction families used to generate mechanisms for silicon hydrides in RMG
The templates for these four reaction families, which are used to generate the SiH4
thermal decomposition models, are given in Table 4.1. Since these general reactions are
reversible, reactions based on the reverse templates (i.e. silylene elimination) are also in-
cluded in the model.
Further information about how RMG prioritizes reaction rates from libraries and
families can be found in Chapter 1.
4.2.2.2 Thermodynamics data
Group additivity values for the thermodynamics of silicon hydride species (silanes,
silenes, and silylenes) were previously calculated by Swihart and Girshick [5]. The group
values were later improved upon by Wong et al. by fitting to G3//B3LYP calculations [70].
In this work, the Wong et al. values were added to RMG’s database for use in the existing
group additivity scheme for calculating thermodynamic parameters in RMG.
For radical species, group additivity values have not previously been determined.
We used G3//B3LYP quantum chemistry calculations to generate hydrogen bond increment
(HBI) values for silicon hydride radicals. HBI is a method, described in more detail in
Chapter 1, used to calculate the thermodynamic parameters of a radical species R∗ from
74
its parent molecule RH, which is the chemical species created by saturating the radical
with hydrogen atoms [28]. Both the G3//B3LYP calculations for the closed-shell species
and group additivity values using HBI for radical species were benchmarked against higher
level calculations by Katzer et al. [198]
4.2.3 RMG model generation
The conditions for the RMG simulation were chosen to closely mirror those of the
experimental conditions in the flow tube experiment [162], with the base case conditions of
T = 913 K, P = 39 kPa, and y0(SiH4) = 0.00016 in an argon bath gas. Temperature, pressure,
and initial SiH4 mole fraction were varied around these values to generate a comprehensive
mechanism that could be used for reactor simulations at a variety of conditions. Pressure
dependence was included in some of the built models to test its effects on the model ac-
curacy. The pressure dependence scheme in RMG has been described in Chapter 1 and
previously [25].
4.2.4 Reactor modeling
Once the models were built in RMG, they were tested for validity by comparison
to experimental results using a constant pressure reaction model in Cantera, which inte-
grates with Python [199]. The Python script used to simulate the reactor and generate the
comparison plots are provided in Appendix B.4. We simulated the same conditions for
temperature, pressure, initial SiH4 mole fraction, and residence time given in the experi-
ment by Onischuk et al. [162] The rate constants and thermodynamic parameters used in
the simulation come from the RMG-generated mechanism; the RMG simulation provides a
Chemkin file as output, which can be easily converted to a Cantera input file with a script.
Different simulations as well as sensitivity analyses were performed to understand the ef-
fects of temperature, pressure, initial SiH4 mole fraction, residence time, and model size
on the SiH4 concentration profile.
75
4.3 Results
The thermodynamic and kinetic data calculated were compared to published data
in order to validate their use in RMG’s database. After updating the database, the RMG
generated models were used in simulations and compared with experimental results. The
effects of changing various process conditions were considered.
4.3.1 Kinetics of hydrogen abstraction reactions
# Reaction log A Ea log k300K log k1000K Source
1 SiH4 + H −→ SiH3 + H2 14.8 10.7 13.1 14.2 This work13.9 11.7 11.9 Ref. 196
11.2 13.6 Ref. 18913.2 Ref. 197 (QI)13.1 Ref. 197 (VTST)13.2 Ref. 195
11.3 Ref. 194
2 SiH3 + H2 −→ SiH4 + H 13.4 72.3 1.03 9.51 This work12.4 61.6 1.65 Ref. 196
3 Si2H6 + H −→ Si2H5 + H2 15.9 3.55 15.5 15.7 This work13.8 11.1 11.9 Ref. 193
11.9 14.0 Ref. 189
4 Si2H5 + H2 −→ Si2H6 + H 14.2 77.0 1.01 10.1 This work
5 Si2H6 + SiH3 −→ Si2H5 + SiH4 15.1 12.7 13.2 14.3 This work9.6 Ref. 182
6 Si2H5 + SiH4 −→ Si2H6 + SiH3 14.8 24.6 10.9 13.4 This work
7 Si2H5 + H −→ SiH3SiH + H2 14.5 4.55 13.9 14.2 This work
8 SiH3SiH + H2 −→ Si2H5 + H 14.1 88.1 -0.97 9.45 This work
9 Si2H4 + H −→ Si2H3 + H2 14.7 7.95 13.5 14.3 This work
10 Si2H3 + H2 −→ Si2H4 + H 13.6 126 -8.11 6.93 This work
Table 4.2: Hydrogen abstraction rates calculated from M062X/6-311+(3d2f) and transitionstate theory using Cantherm. Rates were compared with those obtained from literature,where available. Units in kJ, mol, cm3, s. Logarithms are base 10.
76
We calculated ten hydrogen abstraction reaction rates for use in RMG’s database,
but only four prior published rates were available for comparison. The rate coefficients
at 300 K and 1000 K and Arrhenius parameters are displayed in Table 4.2, along with
published data where available (Reactions 1, 2, 3 and 5). The calculated geometries of the
transition states of these ten reactions are included in Appendix B.1. There are only five
transition state geometries given, since the reverse reactions will have the same transition
states.
In all cases, the rate coefficients at 1000 K are within 1–2 orders of magnitude
of the published data. However, at 300 K, the discrepancy is much greater for Reactions
3 and 5. Because we are considering temperatures between 800 K and 1000 K for this
study, the differences at low temperatures are not particularly concerning. For reactions
such as Reaction 1, where many experimental and theoretical data are available and are in
agreement with one another, it is practical to use this agreed-upon rate in RMG’s database
instead of the less accurate rate calculated here. However, the comparison of these four
calculated rates to the literature data shows that for reactions where data are unavailable or
scarce, such as Reaction 4, these DFT estimates are reasonable enough for the purposes of
automatic mechanism generation, particularly at CVD relevant temperatures. If the RMG
generated model were to be simulated at process conditions at which radical chemistry
becomes more important (see further results), or different silicon precursors are used, more
accurate calculations should be done.
4.3.2 Calculated thermodynamic data
All geometry optimization results from G3//B3LYP calculations on silicon hydrides
species are provided in Appendix B.2. The parity plot shown in Figure 4.1 reveals that cal-
culations of ∆fH◦298 using G3//B3LYP for both closed-shell and radical species compare
well with the high level calculations of Katzer et al. [198]. Most discrepancies are less
than 5 kcal/mol. Larger differences between the two methods are seen for some Si3 and
77
Figure 4.1: G3//B3LYP calculations of ∆fH◦298 (this work) compared to high level calcu-
lations by Katzer et al. [198], for silicon hydride species with up to three silicon atoms.Structures shown for species with more than 5 kcal/mol discrepancy.
multiradical species. Katzer et al. report that for proper treatment of these species, which
have both multiple radicals and divalent silicon atoms, a multiconfiguration reference wave
function with a post-self-consistent field calculation method must be used [198]. Therefore,
we know that our treatment with G3//B3LYP will not be exact. However, G3//B3LYP pro-
vides a reasonable and computationally less expensive estimation of the thermochemistry
for most species, which justifies our use of it for other calculations in this work.
Hydrogen bond increment (HBI) values derived from the G3//B3LYP calculations
of radical species and their parent molecules are given in Table 4.3. ∆fH◦298 for 16 silicon
hydride radicals were then calculated via group additivity and compared to the Katzer et
al. values [198], illustrated in Figure 4.2. In this particular comparison, the group addi-
tivity values of Wong et al. [70] were used to calculate the thermodynamics for the parent
molecules, for the sake of consistency across species, but during an actual RMG simulation,
exact species thermodynamics would be used for the parent molecules if available in the
database. Because of this, Figure 4.2 is a reflection of both the accuracy of Wong’s group
78
Radical species Parent moleculeHBI(∆fH
◦298)
(kcal/mol)HBI(S◦298)
(cal/mol/K)
SiH3Si HSi SiH3 74.0 -3.47SiH SiH2 75.3 -4.92
HSi SiH2 HSi SiH3 75.8 -0.571H2Si SiH H2Si SiH2 83.1 3.01
SiH3
HSi
H3Si SiH3
H2Si
H3Si86.0 1.76
H2Si SiH3 H3Si SiH3 88.5 0.812SiH3 SiH4 91.3 0.192
Si SiH2 146 -12.9
H2Si Si H2Si SiH2 154 -0.446
SiH3
SiH3Si SiH3
H2Si
H3Si168 3.00
HSi SiH3 H3Si SiH3 174 0.443
SiH2 SiH4 181 -1.03
Table 4.3: Hydrogen bond increment (HBI) corrections calculated with G3//B3LYP. Thesecorrections account for the effect of losing 1 or 2 hydrogen atoms on the enthalpy andentropy.
79
Figure 4.2: Group additivity calculations of ∆fH◦298 from RMG-Py, derived from Wong
et al. GAV [70] and HBI corrections (this work), compared to high level calculations byKatzer et al. [198], for 16 silicon hydride radical species. Structures shown for specieswith more than 5 kcal/mol discrepancy.
additivity values and our calculated HBI values. While most values compare reasonably
well, there are again significant differences for three multiradical species which contain
either double bonds or divalent silicons, for the reasons discussed above. It may be prefer-
able, therefore, to calculate the thermodynamics of these multiradical species at a higher
level and to put them in a thermodynamic library in RMG, which are used preferentially
over group additivity estimates.
4.3.3 RMG generated mechanisms
Two RMG mechanisms were used for most of the reactor simulations, the difference
being inclusion of pressure dependent rate parameters. Tolerance was adjusted to make
the size of the models roughly equal and to ensure that radical chemistry was included in
both models. The mechanism including pressure dependence contains 63 silicon hydride
species and 1298 reactions and the mechanism without pressure dependence contains 57
80
species and 578 reactions. A third (pressure-dependent) mechanism was also generated to
incorporate important species and reactions at a variety of initial SiH4 concentrations for
comparison to experiment. This mechanism was larger, with 83 species and 2708 reactions.
The largest RMG-generated mechanism (Chemkin file, species dictionary, and converted
Cantera input file) is provided in Appendix B.3.
4.3.3.1 Effect of pressure dependence
A constant-pressure reactor model was simulated with both the pressure dependent
and non-pressure dependent RMG mechanisms. The result, shown in comparison to the
prior experimental results, is displayed in Figure 4.3(a). Since the RMG model containing
pressure dependence is shown to replicate more accurately the experimental result, pressure
dependence was used for the remaining analysis. This result is expected, as including
pressure dependent networks in the model is important at the low pressures typically used
for SiH4 CVD.
4.3.3.2 Effect of temperature
The full pressure dependent model was further used for reactor simulations at dif-
ferent temperatures. Figure 4.3(b) displays the plot from experiment with the pressure
dependent model simulated at 863 K, 893 K, 913 K (the experimental temperature), and
963 K.
The simulations at 863 K and 963 K show that temperatures within a ± 50 K range
can affect the SiH4 concentration profile enormously. Simulating the reactor at 893 K
provides a close comparison to the experimental data reported at 913 K, illustrating that an
uncertainty of 20 K can fully explain the difference between the model and the experiments.
The main initial decomposition reaction, SiH4 −−⇀↽−− SiH2 + H2, has an activation energy of
about 55 kcal/mol, with many reported theoretical and experimental determinations cov-
ering a range of about ±5 kcal/mol. A 20 K change from 913 K to 893 K corresponds
81
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8Time (s)
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
ySiH
4
y0,
SiH
4
ExperimentPDepNo PDep
(a)
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9Time (s)
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
ySiH
4
y0,
SiH
4
Experiment863 K893 K913 K963 K
(b)
Figure 4.3: Simulation results (this work) compared with SiH4 thermal decomposition ex-periment (data points) [162]. (a) Comparison of RMG models with and without pressuredependence to experimental results. Conditions are T= 913 K, P = 39 kPa, and y0,SiH4
=1.6× 10−4 (b) Comparison of RMG model, including pressure dependence, at differenttemperatures, compared to the experimental results at 913 K.
to only a −1.20 kcal/mol change in activation energy for a global reaction rate following
Arrhenius kinetics with a 55 kcal/mol activation energy. In other words, the discrepancy
lies well within the expected uncertainty in the activation energies.
4.3.3.3 Residence time variation with temperature
In the SiH4 experiment, the reactor residence time was varied by changing the volu-
metric flow rate of the feed gas [162]. Our simulations similarly varied the volumetric flow
rate into the reactor. Figure 4.4 compares the SiH4 concentration profile versus temperature
at four different residence times. In the compared experiment, reported inlet volumetric
flow rates and residence times at 843 K were not consistent given a constant volume re-
actor, leading to slight errors in the residence time used in the simulation. The error bars
in the figure represent this discrepancy only, with more information included in Appendix
B.5.
The reactor simulations compare qualitatively well with the experimental data. In
the previous section, it was noted that a 20 K difference in temperature, which is well within
the uncertainty of the reaction rates, would better capture the SiH4 profile. One can imagine
82
800 850 900 950 1000Temperature (K)
0.0
0.2
0.4
0.6
0.8
1.0
ySiH
4
y0,S
iH4 Exp. 0.8 s
Exp. 0.5 sExp. 0.35 sExp. 0.2 sMod. 0.8 sMod. 0.5 sMod. 0.35 sMod. 0.2 s
Figure 4.4: SiH4 concentration vs. temperature at different residence times, from Onischuket al. [162] (points) and from reactor simulations in this work.
that the simulation data in Figure 4.4 would, therefore, become closer to the experimental
results if the curves were shifted by +20 K; the largest discrepancy is about 40 K.
4.3.3.4 Effect of radical reaction families
To investigate the effect of excluding radical reaction families, a new mechanism
was generated with the radical reaction families (hydrogen abstraction, radical recombina-
tion) disabled during the mechanism generation. The results for the mechanisms simulated
at 913 K are displayed in Figure 4.5(a), along with the experimental data. At these condi-
tions, the removal of radical reactions has no noticeable effect on overall SiH4 decompo-
sition rate. Because radical pathways are thought to be relatively more important at lower
temperatures of CVD conditions, RMG mechanisms were generated at 613 K, both with
all reactions included and with radical reaction families removed, and used in simulations.
At this lower temperature and far slower decomposition, as shown in Figure 4.5(b), there is
still hardly a difference between the full mechanism and the mechanisms with radical reac-
tions removed. Comparing the Si fluxes at 6× 104 seconds, which was chosen due to the
slight acceleration of the non-radical pathway at that time, we don’t see a difference in the
significant pathways to SiH4 decomposition, and only negligible participation by radicals.
The flux diagram for the full mechanism is shown in Figure 4.6.
83
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8Time (s)
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
ySiH
4
y0,
SiH
4
ExperimentFull mechanismNo radical reactions
(a)
x105
(b)
Figure 4.5: Simulation results for full pressure dependent mechanisms generated by RMG,compared with a mechanism generated without radical reaction families allowed. P = 39kPa, y0,SiH4
= 1.6× 10−4 (a) T = 913 K (b) T = 613 K.
Scale = 1e-10 time = 60158
Si₂H₆
SiH₃SiH
H₄Si₂
SiH₂Si
Si₂
SiH₄
SiH₂
Figure 4.6: Flux diagram for Si at 6× 104 seconds for full, pressure dependent mechanismgenerated by RMG and simulated at 613 K
84
4.3.3.5 SiH4 and Si2H6 concentration profiles
0.00 0.05 0.10 0.15 0.20 0.25Time (s)
0.70
0.75
0.80
0.85
0.90
0.95
1.00y
SiH
4
y0,
SiH
4
y0 = 0.00016y0 = 0.00088y0 = 0.0025y0 = 0.05
0.00 0.05 0.10 0.15 0.20 0.25Time (s)
0.0000.0050.0100.0150.0200.0250.0300.0350.040
ySi2
H6
y0,
SiH
4
y0 = 0.00088y0 = 0.0025y0 = 0.01y0 = 0.05
(a) Experimental results from Onischuk et al. [162]
0.00 0.05 0.10 0.15 0.20 0.25Time (s)
0.70
0.75
0.80
0.85
0.90
0.95
1.00
ySiH
4
y0,
SiH
4
y0 = 0.00016y0 = 0.00088y0 = 0.0025y0 = 0.01y0 = 0.05
0.00 0.05 0.10 0.15 0.20 0.25Time (s)
0.0000.0050.0100.0150.0200.0250.0300.0350.040
ySi2
H6
y0,
SiH
4
(b) Results of reactor simulations using RMG-generated model
0.00 0.05 0.10 0.15 0.20 0.25Time (s)
0.70
0.75
0.80
0.85
0.90
0.95
1.00
ySiH
4
y0,
SiH
4
y0 = 0.00016y0 = 0.00088y0 = 0.0025y0 = 0.01y0 = 0.05
0.00 0.05 0.10 0.15 0.20 0.25Time (s)
0.0000.0050.0100.0150.0200.0250.0300.0350.040
ySi2
H6
y0,
SiH
4
(c) Simulation results after updating three reaction rates.
Figure 4.7: Variation in concentration profiles of SiH4 and Si2H6 with initial SiH4 concen-tration at 873 K.
Figure 4.7 displays the concentration of SiH4 and Si2H6 with time at different ini-
tial SiH4 concentrations at 873 K. These profiles appear not to vary with the initial SiH4
concentration (4.7(b)), although the experimental results show a clear difference (4.7(a)).
In contrast, an early shock tube study reveals that rate of SiH4 decomposition is unaffected
85
by initial SiH4 concentrations [171], although the conditions are different from both this
modeling study and from the Onischuk et al. experiment.
A sensitivity analysis was performed to assess which reactions qualitatively affected
the concentration profiles of SiH4 and Si2H6. Three reactions were identified which, when
the rate constants were modified by no more than two orders of magnitude (102), produced
noticeable changes in these concentration profiles. These reactions are as follows:
SiH2 + SiH2−−⇀↽−− Si2H4
H2 + SiH3SiH −−⇀↽−− Si2H6
H2 + SiH3SiH −−⇀↽−− SiH2 + SiH4
Changing these few rates at the same time can produce concentration profiles that
are close to the experimental results, as shown in Figure 4.7(c). While the authors do not
recommend changing the rates on the basis of the current evidence, the result is shown to
establish that this initial built mechanism is plausible given the uncertainties. For example,
the first altered reaction rate, for SiH2 + SiH2 ←→ Si2H4, was estimated by Giunta et
al. to have the same rate as SiH2 + Si2H6 ←→ Si3H8, which was measured by Jasinski and
Chu[166, 200] (this is the rate used in the RMG mechanism). However, Dollet and de Persis
reported their calculated rate for this reaction to be an order of magnitude different[170].
Therefore, it reasonable to assume considerable uncertainty in this rate. For a final model,
however, modifying rates should be done carefully, with additional evidence and more
knowledge about the fidelity of all rates.
86
4.4 Discussion
Thermodynamic and kinetic data for silicon hydrides were generated to update
RMG’s database, and the computational methods used were benchmarked with available
experimental and high level theoretical data. From both of these studies, it was observed
that where high accuracy data are available, they should be put into a thermodynamic or
kinetic library in RMG for greater model accuracy. However, if data are unavailable, the
calculations provide reasonable enough estimates for initial construction of a model. These
calculations were used in RMG’s database for the hydrogen abstraction reaction family
and to generate hydrogen bond increment values for calculating thermodynamics of radical
species. The additions to RMG’s database allows users to build kinetic models for SiH4
thermal decomposition.
The inclusion of radical reaction families was found to be unimportant to overall
SiH4 decomposition rate at both 613 K and 913 K. Thus, inclusion of these pathways does
not mechanistically change the understanding of thermal decomposition of SiH4, instead
corroborating the prior studies that conclude radical reaction pathways are much slower
compared to the main decomposition pathway to SiH2 and H2 and the subsequent insertion
and elimination reactions that occur. However, the new ability to automatically build more
complex mechanisms for silicon hydrides in general can provide understanding which helps
in selection of process conditions and/or different precursors for CVD which reduce the
amount of radicals formed, thus reducing the risk for unwanted particle formation and
growth in the CVD reactor.
Generation of two detailed kinetic models in RMG, one with pressure dependent
reactions and the other without, showed that the pressure dependent model more accu-
rately compared to experimental data. Furthermore, using the pressure dependent model,
temperature had a large effect on the SiH4 concentration profile with time. A 20 K change
in temperature, which is comparable to a change of only about 1.2 kcal/mol in activation
87
energy at these temperatures, brought the simulation results closer to the experimental SiH4
concentration profile.
Simulations using the RMG generated model were able to reasonably replicate the
effect of residence time on the SiH4 concentration profile at different temperatures. If
uncertainty in temperature by about 20 K is taken into account, the simulations using the
RMG generated model more closely match the experiment.
Comparing the SiH4 and Si2H6 profiles obtained by varying initial SiH4 concen-
tration, the model does not match well with the experimental data. Specifically, the sim-
ulations show no change in these profiles at different initial concentrations, whereas the
experiment shows a clear dependence.
The qualitative trends seen in the experimental results can be obtained by chang-
ing three reaction rates by amounts likely within the uncertainty of these rates; however,
this approach to modifying rates is not advised, as the causes of the discrepancy may lie
elsewhere. Instead, it’s recommended to calculate carefully those reaction rates which can
feasibly be calculated. A global sensitivity analysis could provide additional information
about which rates significantly affect the SiH4 and Si2H6 concentration profiles. Addi-
tionally, given that these results in this dissertation are more in line with the Newman et
al. experiment at different conditions, the effect of SiH4 initial concentration still remains
unclear. Further investigation by detailed measurements and calculations of these important
rates could improve the accuracy of RMG’s database for silicon hydrides.
4.5 Summary
A framework for extending the Reaction Mechanism Generator software package
(RMG) to a new class of elements and chemical reactions has been demonstrated, which
allows generation of detailed kinetic models for silicon hydride decomposition. This work
builds on previous efforts to generate these mechanisms automatically, making use of pub-
lished thermodynamic and kinetic data. Additional data were calculated to allow radical
88
reaction pathways to be enabled, including calculations of rates for hydrogen abstraction
reactions of silicon hydrides, and hydrogen bond increment values for radical species. All
thermodynamic calculations were completed at the G3//B3LYP level of theory, while the
reaction rates were calculated using M062X/MG3S and classical transition state theory.
For silane thermal decomposition, simulations using the RMG-generated model reason-
ably compare to experimental results, with inconclusive results on the effect of initial silane
concentration. Inclusion of the enabled radical reaction families, hydrogen abstraction and
radical recombination, made little difference in the reactor simulations. Many other reac-
tion families exist in RMG which can be extended to silicon hydrides, but availability of
experimental or high level theoretical calculations of their reaction rates remains a chal-
lenge. This work represents an important first step in extending RMG to model chemical
vapor deposition by more accurately representing the detailed gas phase chemistry in this
process, and validating the results for silane thermal decomposition.
4.6 Recommendations
Several recommendations for SiH4 model improvement are outlined below. These
mainly involve refinement of the thermodynamics and kinetics parameters used in RMG’s
database for the construction of the model. Additionally, the possibility of including surface
chemistry in RMG is discussed.
4.6.1 Expansion of thermodynamic libraries for radical species
As suggested previously, certain effects caused by multiradical or divalent silicon
species cannot be accounted for using lower-level electronic structure methods [198]. For
species with these particular characteristics, the comparison of the values calculated with
G3//B3LYP as well as the group additivity values using new HBI corrections, and the higher
level calculations of Katzer et al., yielded differences in ∆fH◦298 larger than 5 kcal/mol.
The radical species with these chemical characteristics that appear in the core or edge of the
89
RMG-generated models should be recalculated, using high level methods, such as coupled-
cluster methods. Availability of computational chemistry software that has capability to run
these methods is imperative; ORCA is one such open-source software [201]. Furthermore,
adequate computational resources are necessary to run such high-level calculations. Such
resources are available using the Discovery cluster at Northeastern University, accessible
through the Massachusetts Green High Performance Computing Center (MGHPCC).
4.6.2 Calculation of rates
In this work, ten hydrogen abstraction reaction rates were calculated using DFT and
added to RMG’s database. For silylene insertion, silylene-to-silene isomerization, and ring
opening reactions, rates were sourced from the work of Adamczyk et al [177–180]. All
of these rates from literature were calculated with G3//B3LYP. Both levels of theory used
to calculate rates are known to be approximate, with errors in DFT energies contributing
to discrepancies in barrier heights that have a significant effect on the rate. Furthermore,
the M062X/MG3S calculations used the rigid rotor harmonic oscillator (RRHO) approxi-
mation and did not check whether the reactants, products and transition states found were
the lowest energy conformers. While these effects are generally larger for thermodynamics
calculations since some error cancellation might occur in the calculation of kinetics, the
magnitude of the effect is unknown. Thus, one way of improving RMG-generated models
would be to calculate reaction rates that are either estimated, or used directly from lower-
level calculations, at higher levels of theory. If computational resources are a problem,
the lower levels of theory can be used, but the 1-D hindered rotor approximation can be
incorporated to find the lowest energy conformer and include the effects of hindered rotors
[202].
90
4.6.2.1 Using automatic transition state theory calculations
A recently developed method calculates reaction rates automatically, in a high-
throughput manner, using canonical transition state theory. The method, known as Au-
toTST, uses a machine-learning algorithm to estimate transition state geometries via group
contribution (discussed in Chapter 1) and is integrated with computational chemistry soft-
wares and CanTherm [33, 39, 40]. AutoTST could in theory be applied to the reaction fam-
ilies utilized in this work which have a reaction barrier in order to generate many reaction
rates for these families. However, significant challenges arise when extending AutoTST to
new reaction families. Furthermore, AutoTST currently has some limitations, including in-
accuracies that result from using DFT energies, errors in calculation of symmetry number,
use of the RRHO approximation, and improper determination of lowest-energy conformer
[203]. Use of AutoTST, or any other methods used to calculate reaction rates more care-
fully, should be used in conjunction with sensitivity analyses that identify which reactions
in the mechanism are important.
4.6.3 Sensitivity analysis
One type of sensitivity analysis was employed in this work, where the reaction rates
were individually varied to assess the effect on silane decomposition with differing initial
silane concentration. Once three reactions were identified, they were all modified at the
same time to visualize a more pronounced effect. However, such a sensitivity analysis
was semi-manual and rather unsystematic. Further sensitivity analysis is available using
Cantera, and can be used to see how each parameter in the reactor system affects a system
output variable. In this way, it is possible to rank each reaction rate constant as affecting
concentration of SiH4, Si2H6, or any other species concentration of interest.
91
4.6.4 Surface chemistry
Further extension of RMG to surface chemistry could allow for more rigorous com-
parison to CVD experiments. Many published experiments and mechanisms could not be
used for validation, since the gas and surface phase analysis were coupled, unlike in the
Onischuk et al. paper [162]. A collaborative project between Professor Richard West and
Professor C. Franklin Goldsmith at Brown University involves extending RMG to heteroge-
neous catalysis. The framework used here, which allows for the representation of adsorbed
species and heterogeneous reactions, is extensible to silicon surface chemistry [204].
Another avenue which can aid in developing reaction rules for silicon surface re-
actions is metadynamics. Metadynamics is a method fueled by molecular dynamics and
explores phase space through a bias potential, based on collective variables that describe
the system. Such an approach can map the potential energy surface of a system without
knowing the types of reactions that can occur beforehand [205]. Zheng and Pfaendtner
demonstrated that the approach provides meaningful results for methanol oxidation [206].
Preliminary work in our group, in collaboration with Intel Corporation, also shows that
metadynamics can find simple pathways of SiH4 decomposition on a silicon surface, mod-
eled as a silicon hydride cluster. Clever choice of collective variables may yield addi-
tional reaction pathways involved in silane chemistry, allowing them to be added to RMG’s
database.
92
5 CONCLUSION
This dissertation made progress in extending automatic mechanism generation to
two domains not previously or commonly studied in this manner: liquid-phase oxidation
of fuels and gas-phase thermal decomposition of silicon hydrides. These extensions were
made within the framework of the open-source software package, the Reaction Mechanism
Generator (RMG).
5.1 Liquid-phase fuel oxidation
To enable liquid-phase mechanism generation, source code was updated with ex-
isting linear solvation energy relationships and group additivity methods for determining
solute descriptors, in order to calculate solvation thermodynamics. In addition, more ca-
pabilities were added to deal with chemical species with unpaired electrons and lone pairs.
Furthermore, group additivity values for nitrogen and sulfur-containing species, for which
RMG can now generate mechanisms, were added. Addition of a database of known so-
lute descriptors now makes it faster to look up values on the fly rather than using group
additivity every time.
Based on well-known correlations, gas-phase reaction rates in RMG were modified
to account for diffusion effects on bimolecular reactions in the presence of a solvent. In-
trinsic solvent effects were incorporated by development of a group contribution method
to correct gas-phase rates for different solvents systematically, based on molecular struc-
ture of the reaction. This method has been developed for two different reaction families,
intra-hydrogen migration and hydrogen abstraction, for eight different solvents that are
93
categorically different. The corrections have been applied to a detailed kinetic model for
predicting induction period of oxidation of biofuel blends. Used as a training set for the
group contribution method, this work also contributes gas-phase geometries for several
chemical species and transition states calculated at the M062X/MG3S level of theory, and
energies calculated using a continuum solvation model in the eight solvents.
There are several possible avenues for future work on liquid-phase mechanism gen-
eration in RMG-Py. Thermodynamic data can be more accurately represented by perform-
ing high-level quantum chemistry calculations. These may be used to replace some of the
estimation techniques and other assumptions made about species containing lone pairs and
unpaired electrons, or to extend RMG to new applications. Calculations in different sol-
vents, beyond the 26 currently in RMG’s database, can also lead to the development of
new solvent descriptors to be used in the estimation of thermodynamic properties using the
linear solvation energy relationships.
The calculation of kinetic solvent effect could be better automated. At this point, the
easiest feature to implement is the on-the-fly calculation of ∆EA, as most of this code has
been written. This would ensure that important reactions are included in the model, since
a post-processing step only ensures modification of rates of reactions that were included
in the model based on their gas-phase rates. Furthermore, building the decision tree for
determining kinetic solvent effect can also be automated using a software package such as
scikit-learn. Some of this work has already been completed, but the automatically built tree
should be compared to the existing tree for accuracy.
Finally, both the solvation thermodynamics and kinetics estimation methods would
benefit from better benchmarking. Specifically, the values calculated from group contri-
bution should be compared to those calculated exactly from quantum chemistry, using
continuum solvation. If computational power and time is available, benchmarking with
higher-level methods such as coupled cluster methods, and/or more explicit solvent repre-
sentations, could greatly benefit the accuracy of detailed kinetic models and also provide
94
much-needed data to the chemistry community.
5.2 Thermal decomposition of silicon hydrides
The second application of this dissertation, gas-phase silicon hydride decomposi-
tion, demonstrated how to add new classes of elements and chemical reactions to mecha-
nism generation software. While microkinetic models had been developed for some silicon
hydrides previously, this work specifically focused on the ability to include radical reaction
types along with commonly used reaction families. Thermodynamic and kinetic data for
silicon hydrides were added from literature to RMG’s database. The work also contributes
new, calculated data for hydrogen bond increment (HBI) values of silicon hydride species,
as well as hydrogen abstraction reaction rates.
The new additions to RMG were demonstrated by building a model for SiH4 ther-
mal decomposition. Comparison to experimental work shows a qualitative and semi-
quantitative match, within the uncertainty in activation energy of the main SiH4 decom-
position channel. Additionally, this work has added insight on which process conditions,
as well as model generation options, have an effect on the SiH4 decomposition profile,
mainly corroborating early studies.
However, the SiH4 and Si2H6 profiles obtained when initial SiH4 concentration is
varied do not compare well with the experimental data. This motivates further research,
including global sensitivity analysis to see which reactions affect this dependence. If im-
portant reactions are identified, the source of these rates should be investigated and if they
are poorly estimated, should be carefully calculated. To calculate these reactions in a high
throughput way, the automated transition state theory calculator, AutoTST, could be uti-
lized. In its current state, AutoTST has some sources of errors that must be addressed, the
two major errors being in the calculation of symmetry number and improper determination
of lowest energy conformer. Despite these flaws, efforts to extend AutoTST to the silylene
insertion reaction family have been initiated. Alternatively, individual reaction rates could
95
be calculated using quantum chemistry methods at a higher level of theory than DFT.
Another research direction is in extension of RMG further, either to surface chem-
istry, or to new silicon precursors. Enabling surface chemistry is important for comparison
to real chemical vapor deposition experiments. Lastly, extending RMG further to new pre-
cursors (such as those containing germanium or silicon) can shift these generated microki-
netic models from merely confirming prior experiments and modeling, to being predictive.
5.3 Summary
This dissertation reports new thermodynamic and chemical kinetic parameters, as
well as new insights about two systems not previously investigated with automatic mech-
anism generation. Each of these chemistry applications demonstrates how machine learn-
ing can be used to predict parameters during the course of a simulation, where on-the-fly
quantum calculations for every unknown parameter would be computationally infeasible.
Importantly, the work also provides a framework for future developers to add capabilities
for new phases or elements to mechanism generation software.
96
References
[1] B. R. Moser. Comparative oxidative stability of fatty acid alkyl esters by acceleratedmethods. JAOCS, J. Am. Oil Chem. Soc., 86(7):699–706, 2009.
[2] S. Blaine and P. E. Savage. Reaction Pathways in Lubricant Degradation. 2. n-Hexadecane Autoxidation. Ind. Eng. Chem. Res., 30:2185–2191, 1991.
[3] F. Garcia-ochoa, A. Romero, and J. Querol. Modeling of the Thermal n-OctaneOxidation in the Liquid Phase. Ind. Eng. Chem. Res., 28(1):43–48, 1989.
[4] A. Ben Amara, A. Nicolle, M. Alves-Fortunato, and N. Jeuland. Toward PredictiveModeling of Petroleum and Biobased Fuel Stability: Kinetics of Methyl Oleate/ n-Dodecane Autoxidation. Energy & Fuels, 27(10):6125–6133, October 2013.
[5] M. T. Swihart and S. L. Girshick. Thermochemistry and kinetics of silicon hydridecluster formation during thermal decomposition of silane. J. Phys. Chem. B, 103(1):64–76, 1999.
[6] H. W. Wong, X. Li, M. T. Swihart, and L. J. Broadbelt. Detailed kinetic modelingof silicon nanoparticle formation chemistry via automated mechanism generation. J.Phys. Chem. A, 108(46):10122–10132, 2004.
[7] W. H. Green, Jr, J. W. Allen, B. A. Buesser, R. W. Ashcraft, G. J. O. Beran, C. A.Class, C. Gao, C. F. Goldsmith, M. R. Harper, A. Jalan, M. Keceli, G. R. Magoon,D. M. Matheu, S. S. Merchant, J. D. Mo, S. Petway, S. Raman, S. Sharma, J. Song,Y. V. Suleimanov, K. M. Van Geem, J. Wen, R. H. West, A. Wong, H.-W. Wong,P. E. Yelvington, N. Yee, and J. Yu. RMG - Reaction Mechanism Generator, 2013.URL http://greengroup.github.io/RMG-Py/.
[8] C. W. Gao, J. W. Allen, W. H. Green, and R. H. West. Reaction Mechanism Gen-erator : automatic construction of chemical kinetic mechanisms. Comput. Phys.Commun., 2016.
[9] F. Seyedzadeh Khanshan and R. H. West. Developing detailed kinetic models ofsyngas production from bio-oil gasification using Reaction Mechanism Generator(RMG). Fuel, 163:25–33, 2016.
[10] C. A. Class, M. Liu, G. Vandeputte, and W. H. Green. Automatic mechanism gen-eration for pyrolysis of di-tert-butyl sulfide. Phys. Chem. Chem. Phys., 18:21651–21658, 2016.
97
[11] F. S. Khanshan. Automatic generation of detailed kinetic models for complex chem-ical systems. PhD thesis, Northeastern University, College of Engineering, Depart-ment of Chemical Engineering, http://hdl.handle.net/2047/D20213055, 2016.
[12] K. Prozument, Y. V. Suleimanov, B. Buesser, J. M. Oldham, W. H. Green, A. G.Suits, and R. W. Field. A Signature of Roaming Dynamics in the Thermal De-composition of Ethyl Nitrite: Chirped-Pulse Rotational Spectroscopy and KineticModeling. J. Phys. Chem. Lett., 5:3641–3648, 2014.
[13] I. Ugi, J. Bauer, and J. Brandt. New applications of computers in chemistry. Angew.Chemie Int. Ed. English, 18:111–123, 1979.
[14] Y. Yoneda. A Computer Program Packages for the Analysis, Creation, and Esti-mation of Generalized Reactions-GRACE. I. Generation of Elementary ReactionNetwork in Radical Reactions-A/GRACE(I). Bull. Chem. Soc. Jpn., 52(1):8–14,1979.
[15] F. D. Maio and P. Lignola. KING, a KInetic Network Generator. Chem. Eng. Sci.,47(9-11):2713–2718, 1992.
[16] E. Ranzi, T. Faravelli, P. Gaffuri, and A. Sogaro. Low-Temperature Comustion:Automatic Generation of Primary Oxidation Reactions and Lumping Procedures.Combust. Flame, 102:179–192, 1995.
[17] R. G. Susnow, A. M. Dean, W. H. Green, P. Peczak, and L. J. Broadbelt. Rate-BasedConstruction of Kinetic Models for Complex Systems. J. Phys. Chem. A, 5639(96):3731–3740, 1997.
[18] V. Warth, F. Battin-Leclerc, R. Fournet, P. Glaude, G. Come, and G. Scacchi. Com-puter based generation of reaction mechanisms for gas-phase oxidation. Comput.Chem., 24(5):541–60, July 2000.
[19] A. Ratkiewicz and T. N. Truong. Application of chemical graph theory for automatedmechanism generation. J. Chem. Inf. Comput. Sci., 43:36–44, 2003.
[20] S. Rangarajan, A. Bhan, and P. Daoutidis. Language-oriented rule-based reactionnetwork generation and analysis: Description of RING. Comput. Chem. Eng., 45:114–123, October 2012.
[21] R. Van De Vijver, N. M. Vandewiele, P. L. Bhoorasingh, B. L. Slakman, F. S. Khan-shan, H. H. Carstensen, M. F. Reyniers, G. B. Marin, R. H. West, and K. M. VanGeem. Automatic mechanism and kinetic model generation for gas- And solution-phase processes: A perspective on best practices, recent advances, and future chal-lenges. Int. J. Chem. Kinet., 47(4):199–231, 2015.
[22] J. W. Allen, R. W. Ashcraft, G. J. Beran, B. A. Buesser, C. A. Class, C. Gao, C. F.Goldsmith, M. R. Harper, A. Jalan, M. Keceli, G. R. Magoon, D. M. Matheu, S. S.Merchant, J. D. Mo, S. Petway, S. Ruman, S. Sharma, K. M. Van Geem, J. Song,
98
Y. Suleymanov, N. Vandewiele, J. Wen, R. H. West, A. Wong, H.-W. Wong, N. W.-W. Yee, P. E. Yelvington, J. Yu, and W. H. Green. RMG (Reaction MechanismGenerator) version 4.0, 2013. URL http://rmg.sourceforge.net/.
[23] J. Song. Building robust chemical reaction mechanisms: nextgeneration of automatic model construction software. PhD the-sis, Massachusetts Inst. Technol. Dept. Chem. Eng., 2004. URLhttp://dspace.mit.edu/handle/1721.1/30058.
[24] G. P. Smith, D. M. Golden, M. Frenklach, N. W. Moriarty, B. Eite-neer, M. Goldenberg, C. T. Bowman, R. K. Hanson, S. Song, J. WilliamC. Gardiner, V. V. Lissianski, and Z. Qin. Gri-mech 3.0. URLhttp://www.me.berkeley.edu/gri mech/.
[25] J. W. Allen, C. F. Goldsmith, and W. H. Green. Automatic estimation of pressure-dependent rate coefficients. Phys. Chem. Chem. Phys., 14(3):1131–55, jan 2012.
[26] L. J. Broadbelt and J. Pfaendtner. Lexicography of kinetic modeling of complexreaction networks. AIChE J., 51(8):2112–2121, aug 2005.
[27] S. W. Benson and J. H. Buss. Additivity Rules for the Estimation of MolecularProperties. Thermodynamic Properties. J. Chem. Phys., 29(3):546, 1958.
[28] T. H. Lay, J. W. Bozzelli, A. M. Dean, and E. R. Ritter. Hydrogen atom bond incre-ments for calculation of thermodynamic properties of hydrocarbon radical species.J. Phys. Chem., 99(39):14514–14527, 1995.
[29] G. R. Magoon and W. H. Green. Design and implementation of a next-generationsoftware interface for on-the-fly quantum and force field calculations in automatedreaction mechanism generation. Comput. Chem. Eng., 52:35–45, may 2013.
[30] R. Sumathi, H. H. Carstensen, and W. H. Green. Reaction rate prediction via groupadditivity part 1: H abstraction from alkanes by H and CH3. J. Phys. Chem. A, 105(28):6910–6925, 2001.
[31] M. Evans and M. Polanyi. Further considerations on the thermodynamics of chemi-cal equilibria and reaction rates. Trans. Faraday Soc., 32:1333–1360, 1936.
[32] H. Eyring, H. Gershinowitz, and C. E. Sun. The Absolute Rate of HomogeneousAtomic Reactions. J. Chem. Phys., 3(12):786, 1935.
[33] J. W. Allen and W. H. Green. CanTherm: Open-source software for thermodynam-ics and kinetics. Included in: Reaction Mechanism Generator, v2.0.0, 2016. URLhttp://reactionmechanismgenerator.github.io.
[34] A. D. Isaacson, D. G. Truhlar, S. N. Rai, R. Steckler, G. C. Hancock, B. C. Garrett,and M. J. Redmon. Polyrate: A general computer program for variational transi-tion state theory and semiclassical tunneling calculations of chemical reaction rates.Comput. Phys. Commun., 47.
99
[35] S. J. Klippenstein, A. F. Wagner, R. C. Dunbar, D. M. Wardlaw, S. H. Robertson,and J. A. Miller. Variflex: Version 2.02m, 2010.
[36] J. M. P. L. L. L. A. M. P. J. S. T. L. N. J R Baker, N F Ortiz and T. J. D. Kumar.Multiwell-2012.1 software, 2012.
[37] P. M. Zimmerman. Automated discovery of chemically reasonable elementary reac-tion steps. J. Comput. Chem., 34(16):1385–92, jun 2013.
[38] J. Zador and H. Najm. Kinbot: An automated code for exploring reaction pathwaysin the gas phase. Sandia National Laboratories Technical Report SAND2012-8095,2012.
[39] P. Bhoorasingh and R. West. Transition state geometry prediction using moleculargroup contributions. Phys. Chem. Chem. Phys., 17(48):32173–32182, 2015.
[40] P. Bhoorasingh, B. Slakman, F. S. Khanshan, J. Cain, and R. West. Kineticdata for manuscript describing the AutoTST algorithm for automated Transi-tion State Theory calculations of chemical reaction rates. figshare.com, page10.6084/m9.figshare.4234160, 12 2016.
[41] C. Kingsford and S. L. Salzberg. What are decision trees? Nat. Biotechnol., 26(9):1011–3, 2008.
[42] L. Han, Y. Wang, and S. H. Bryant. Developing and validating predictive deci-sion tree models from mining chemical structural fingerprints and highthroughputscreening data in PubChem. BMC Bioinformatics, 9(1):401, 2008.
[43] J. R. Quinlan. C4. 5: programs for machine learning. Elsevier, 2014.
[44] D. Thornley, M. Zverev, and S. Petridis. Machine learned regression for abduc-tive DNA sequencing. Proc. - 6th Int. Conf. Mach. Learn. Appl. ICMLA 2007,(December):254–259, 2007.
[45] Y. Hu and H. Cheng. Application of stochastic models in identification and appor-tionment of heavy metal pollution sources in the surface soils of a large-scale region.Environ. Sci. Technol., 47(8):3752–3760, 2013.
[46] V. F. McNeill, J. L. Woo, D. D. Kim, A. N. Schwier, N. J. Wannell, A. J. Sumner,and J. M. Barakat. Aqueous-phase secondary organic aerosol and organosulfateformation in atmospheric aerosols: a modeling study. Environ. Sci. Technol., 46:8075–8081, aug 2012.
[47] Y. B. Lim and B. J. Turpin. Laboratory evidence of organic peroxide and peroxy-hemiacetal formation in the aqueous phase and implications for aqueous OH. Atmos.Chem. Phys., 15(22):12867–12877, 2015.
[48] K. Chatelain, A. Nicolle, A. Ben Amara, L. Catoire, and L. Starck. Wide RangeExperimental and Kinetic Modeling Study of Chain Length Impact on n -AlkanesAutoxidation. Energy & Fuels, 30:1294–1303, 2016.
100
[49] V. Thavasi, R. P. A. Bettens, and L. P. Leong. Temperature and solvent effects onradical scavenging ability of phenols. J. Phys. Chem. A, 113(13):3068–3077, apr2009.
[50] A. Jalan, R. H. West, and W. H. Green. An extensible framework for capturingsolvent effects in computer generated kinetic models. J. Phys. Chem. B, 117(10):2955–70, March 2013.
[51] H. M. Senn and W. Thiel. QM/MM methods for biomolecular systems. Angew.Chem. Int. Ed. Engl., 48(7):1198–229, January 2009.
[52] J. Tomasi, B. Mennucci, and R. Cammi. Quantum mechanical continuum solvationmodels. Chem. Rev., 105(8):2999–3093, August 2005.
[53] A. Jalan, R. W. Ashcraft, R. H. West, and W. H. Green. Predicting solvation energiesfor kinetic modeling. Annu. Reports Sect. “C” (Physical Chem.), 106:211, 2010.
[54] M. Kamlet and R. Taft. The Solvatochromic Comparison Method. I. The beta-Scaleof Solvent Hydrogen-Bond Acceptor (HBA) Basicities. J. Am. Chem. Soc., 98(2):377, 1976.
[55] M. H. Abraham, A. Ibrahim, and A. M. Zissimos. Determination of sets of solutedescriptors from chromatographic measurements. J. Chromatogr. A, 1037(1-2):29–47, May 2004.
[56] M. Abraham. Scales of solute hydrogen-bonding: their construction and applicationto physicochemical and biochemical processes. Chem. Soc. Rev., 96(5), 1993.
[57] C. Mintz, M. Clark, W. E. Acree, and M. H. Abraham. Enthalpy of solvation corre-lations for gaseous solutes dissolved in water and in 1-octanol based on the Abrahammodel. J. Chem. Inf. Model., 47(1):115–21, January 2007.
[58] J. Platts and D. Butina. Estimation of molecular linear free energy relation descrip-tors using a group contribution approach. J. Chem. Inf. Comput. Sci., 39:835–845,1999.
[59] C. Mintz, M. Clark, K. Burton, W. E. Acree, and M. H. Abraham. Enthalpy of Solva-tion Corrections for Gaseous Solutes Dissolved in Benzene and in Alkane SolventsBased on the Abraham Model. QSAR Comb. Sci., 26(8):881–888, 2007.
[60] R. A. Pierotti. The solubility of gases in liquids. J. Phys. Chem., 67(9):1840–1845,1963.
[61] J. Kongsted, P. Soderhjelm, and U. Ryde. How accurate are continuum solvationmodels for drug-like molecules? J. Comput. Aided. Mol. Des., 23(7):395–409, July2009.
[62] C. Kelly, C. Cramer, and D. Truhlar. SM6: a density functional theory continuumsolvation model for calculating aqueous solvation free energies of neutrals, ions, andsolute-water clusters. J. Chem. Theory . . . , pages 1133–1152, 2005.
101
[63] A. Ahmed and S. I. Sandler. Hydration free energies of multifunctional nitroaromaticcompounds. J. Chem. Theory Comput., 9(6):2774–2785, 2013.
[64] W. H. Green, Jr, J. W. Allen, B. A. Buesser, R. W. Ashcraft, G. J. O. Beran, C. A.Class, C. Gao, C. F. Goldsmith, M. R. Harper, A. Jalan, M. Keceli, G. R. Ma-goon, D. M. Matheu, S. S. Merchant, J. D. Mo, S. Petway, S. Raman, S. Sharma,J. Song, Y. V. Suleimanov, K. M. Van Geem, J. Wen, R. H. West, A. Wong, H.-W. Wong, P. E. Yelvington, N. Yee, and J. Yu. Rmg-database project. URLhttp://github.com/GreenGroup/RMG-database.
[65] M. Abraham. Hydrogen Bonding. Part 34. The Factors that Influence the Solubilityof Gases and Vapours in Water at 298 K, and a New Method for its Determination.J. Chem. Soc., Perkin Trans., 1994.
[66] ACD/Labs. Acd/i-lab, 2010-2013. URLhttp://ilab.acdlabs.com/iLab2/.
[67] G. D. Marenich, A. V.; Kelly, C. P.; Thompson, J. D.; Hawkins and D. G. Cham-bers, C. C.; Giesen, D. J.; Winget, P.; Cramer, C. J.; Truhlar. Minnesota SolvationDatabase-version 2012, 2012. URL http://comp.chem.umn.edu/mnsol/.
[68] D. Tromans. Temperature and pressure dependent solubility of oxygen in water: athermodynamic analysis. Hydrometallurgy, 48(3):327–342, 1998.
[69] R. F. Prini and R. Crovetto. Evaluation of Data on Solubility of Simple Apolar Gasesin Light and Heavy Water at High Temperature, 1989.
[70] H. W. Wong, J. C. A. Nieto, M. T. Swihart, and L. J. Broadbelt. Thermochemistryof silicon-hydrogen compounds generalized from quantum chemical calculations. J.Phys. Chem. A, 108(5):874–897, 2004.
[71] J. A. Platts, M. H. Abraham, Y. H. Zhao, A. Hersey, L. Ijaz, and D. Butina. Corre-lation and prediction of a large blood-brain distribution data set - An LFER study.Eur. J. Med. Chem., 36(9):719–730, 2001.
[72] H. Struebing, Z. Ganase, P. G. Karamertzanis, E. Siougkrou, P. Haycock, P. M.Piccione, A. Armstrong, A. Galindo, and C. S. Adjiman. Computer-aided moleculardesign of solvents for accelerated reaction kinetics. Nat. Chem., (September), sep2013.
[73] F. F. Crim. Molecular reaction dynamics across the phases: similarities and differ-ences. Faraday Discuss., 157:9, 2012.
[74] F. Briers, D. Chapman, and E. Walters. The Influence of the Intensity of Illuminationon the Velocity of Photochemical Changes. The Determination of the Mean Life ofa Hypothetical Catalyst. J. Chem. Soc., 129:562–569, 1926.
102
[75] F. Briers and D. Chapman. The Influence of the Intensity of Illumination on theVelocity of the Photochemical Union of Bromine and Hydrogen, and the Determi-nation of the Mean Life of a Postulated Catalyst. J. Chem. Soc., pages 1802–1811,1928.
[76] E. T. Denisov. Liquid-Phase Reaction Rate Constants.Springer US, 2012. ISBN 9781468483000. URLhttps://books.google.com/books?id=j8LeBwAAQBAJ.
[77] H. W. Melville. The Photochemical Polymerization of Methyl Methacrylate Vapour.Proc. R. Soc. A Math. Phys. Eng. Sci., 163:511–542, dec 1937.
[78] C. Swain and P. Bartlett. Rate Constants of the Steps in Addition Polymerization. II.Use of the Rotating-Sector Method on Liquid Vinyl Acetate. J. Am. Chem. Soc., 68(11):2381–2386, 1946.
[79] A. Shepp. Rate of Recombination of Radicals. I. A General Sector Theory; A Cor-rection to the Methyl Radical Recombination Rate. J. Chem. Phys., 24(5):939–943,1956.
[80] D. Griller and K. U. Ingold. Free-Radical Clocks. Acc. Chem. Res., 13:317–323,1980.
[81] D. Geske and A. Maki. Electrochemical Generation of Free Radicals and TheirStudy by Electron Spin Resonance Spectroscopy; the Nitrobenzene Anion Radical.J. Am. Chem. Soc., 82(9):2671–2676, 1960.
[82] B. Roschek, K. a. Tallman, C. L. Rector, J. G. Gillmore, D. a. Pratt, C. Punta, andN. a. Porter. Peroxyl radical clocks. J. Org. Chem., 71(9):3527–32, apr 2006.
[83] M. Jha and D. a. Pratt. Kinetic solvent effects on peroxyl radical reactions. Chem.Commun., pages 1252–1254, mar 2008.
[84] P. Hohenberg and W. Kohn. Inhomogeneous Electron Gas. Phys. Rev., 155(1962):864, 1964.
[85] W. Kohn and L. J. Sham. Self-consistent equations including exchange and correla-tion effects. Phys. Rev., 385(1951):1133–1138, 1965.
[86] X. Xu, I. M. Alecu, and D. G. Truhlar. How Well Can Modern Density FunctionalsPredict Internuclear Distances at Transition States ? J. Chem. Theory Comput., 7:1667–1676, 2011.
[87] A. J. Cohen, P. Mori-Sanchez, and W. Yang. Insights into current limitations ofdensity functional theory. Science (80-. )., 321:792–794, aug 2008.
[88] R. Arnaud and N. Bugaud. Role of polar and enthalpic effects in the addition ofmethyl radical to substituted alkenes: a density functional study including solventeffects. J. Am. . . . , 7863(13):5733–5740, 1998.
103
[89] A. Berkessel and J. A. Adrio. Dramatic acceleration of olefin epoxidation in fluo-rinated alcohols: activation of hydrogen peroxide by multiple h-bond networks. J.Am. Chem. Soc., 128(41):13412–13420, oct 2006.
[90] J. Herbert. Dielectric continuum solvation models, 2013. URLhttps://chemistry.osu.edu/ herbert/projects/PCM.html.
[91] R. S. Cataliotti, F. Aliotta, and R. Ponterio. Silver nanoparticles behave as hydropho-bic solutes towards the liquid water structure in the interaction shell. A Raman studyin the O-H stretching region. Phys. Chem. Chem. Phys., 11(47):11258–11263, dec2009.
[92] J. R. Pliego. Shells theory of solvation and the long-range Born correction. Theor.Chem. Acc., 128(3):275–283, nov 2011.
[93] B. Mennucci and R. Cammi, editors. Continuum Solvation Models in Chem-ical Physics: From Theory to Applications. John Wiley and Sons Ltd, TheAtrium, Southern Gate, Chichester, West Sussex PO19 8SQ, England, 2007. ISBN9780470029381.
[94] A. V. Marenich, C. J. Cramer, and D. G. Truhlar. Universal solvation model based onsolute electron density and on a continuum model of the solvent defined by the bulkdielectric constant and atomic surface tensions. J. Phys. Chem. B, 113(18):6378–96,May 2009.
[95] M. J. Frisch, G. W. Trucks, H. B. Schlegel, G. E. Scuseria, M. A. Robb, J. R. Cheese-man, G. Scalmani, V. Barone, B. Mennucci, G. A. Petersson, H. Nakatsuji, M. Cari-cato, X. Li, H. P. Hratchian, A. F. Izmaylov, J. Bloino, G. Zheng, J. L. Sonnenberg,M. Had, and D. J. Fox. Gaussian 09, Revision D.01, 2009.
[96] S. Miertus, E. Scrocco, and J. Tomasi. Electrostatic interaction of a solute with acontinuum. A direct utilization of ab initio molecular potentials for the prevision ofsolvent effects. Chem. Phys., 55:117–129, 1981.
[97] A. Klamt and G. Schuurmann. COSMO: a new approach to dielectric screening insolvents with explicit expressions for the screening energy and its gradient. J. Chem.Soc. Perkin Trans., 2, 1993.
[98] K. Baldridge and A. Klamt. First principles implementation of solvent effects with-out outlying charge error. J. Chem. Phys., 106(16):6622, 1997.
[99] A. Pomogaeva and D. M. Chipman. Hydration energy from a composite method forimplicit representation of solvent. J. Chem. Theory Comput., 10(1):211–219, 2014.
[100] N. M. Silva, P. Deglmann, and J. R. Pliego. CMIRS Solvation Model for Methanol:Parametrization, Testing, and Comparison with SMD, SM8, and COSMO-RS. J.Phys. Chem. B, 120(49):12660–12668, 2016.
104
[101] L. Valgimigli and J. Banks. Kinetic Solvent Effects on Hydroxylic Hydrogen AtomAbstractions Are Independent of the Nature of the Abstracting Radical. Two Ex-treme Tests Using Vitamin E and Phenol. J. Am. Chem. Soc., 117(9):9966–9971,1995.
[102] P. Das and M. Encinas. Reactions of tert-Butoxy Radicals with Phenols. Comparisonwith the Reactions of Carbonyl Triplets. J. Am. Chem. Soc., 103(14):4162–4166,1981.
[103] L. Valgimigli, K. U. Ingold, and J. Lusztyk. Antioxidant Activities of Vitamin EAnalogues in Water and a Kamlet-Taft beta-Value for Water. J. Am. Chem. Soc., 118(15):3545–3549, 1996.
[104] Z. Alfassi, R. Huie, and P. Neta. Solvent Effects on the Rate Constants for Reactionof Trichloromethylperoxyl Radicals with Organic Reductants. J. Phys. Chem., 97:7253–7257, 1993.
[105] M. Foti, C. Daquino, and C. Geraci. Electron-Transfer Reaction of Cinnamic Acidsand Their Methyl Esters with the DPPH Radical in Alcoholic Solutions. J. Org.Chem., 69(14):2309–2314, 2004.
[106] D. Snelgrove and J. Lusztyk. Kinetic Solvent Effects on Hydrogen-Atom Abstrac-tions: Reliable, Quantitative Predictions via a Single Empirical Equation. J. Am.Chem. Soc., 123(3):469–477, 2001.
[107] G. Litwinienko and K. U. Ingold. Abnormal solvent effects on hydrogen atom ab-stractions. 1. The reactions of phenols with 2,2-diphenyl-1-picrylhydrazyl (dpph*)in alcohols. J. Org. Chem., 68(9):3433–8, May 2003.
[108] G. Litwinienko and K. U. Ingold. Abnormal solvent effects on hydrogen atom ab-straction. 2. Resolution of the curcumin antioxidant controversy. The role of sequen-tial proton loss electron transfer. J. Org. Chem., 69(18):5888–96, September 2004.
[109] G. Litwinienko and K. Ingold. Abnormal Solvent Effects on Hydrogen Atom Ab-straction. 3. Novel Kinetics in Sequential Proton Loss Electron Transfer Chemistry.J. Org. Chem., 70(22):8982–8990, 2005.
[110] M. Nielsen and K. Ingold. Kinetic Solvent Effects on Proton and Hydrogen AtomTransfers from Phenols/ Similarities and Differences. J. Am. Chem. Soc., 128(4):1172–1182, 2006.
[111] J. J. Warren and J. M. Mayer. Tuning of the thermochemical and kinetic properties ofascorbate by its local environment: solution chemistry and biochemical implications.J. Am. Chem. Soc., 132(22):7784–93, jun 2010.
[112] I. Sajenko, V. Pilepic, C. J. Brala, and S. Ursic. Solvent dependence of the kineticisotope effect in the reaction of ascorbate with the 2,2,6,6-tetramethylpiperidine-1-oxyl radical: tunnelling in a small molecule reaction. J. Phys. Chem. A, 114(10):3423–3430, mar 2010.
105
[113] A. L. Koner, U. Pischel, and W. M. Nau. Kinetic solvent effects on hydrogen ab-straction reactions. Org. Lett., 9(15):2899–902, July 2007.
[114] J. Lalevee, X. Allonas, and J. Fouassier. Reactivity of Carbon-Centered Radicalstoward Acrylate Double Bonds: Relative Contribution of Polar vs. Enthalpy Effects.J. Phys. Chem. A, pages 4326–4334, 2004.
[115] J. Lalevee, X. Allonas, J. P. Fouassier, D. Rinaldi, M. F. Ruiz Lopez, and J. L. Rivail.Solvent effect on the radical addition reaction to double bond: Experimental andquantum chemical investigations. Chem. Phys. Lett., 415(4-6):202–205, November2005.
[116] J. Lalevee, X. Allonas, and J.-P. Fouassier. Addition of carbon-centered radicalsto double bonds: influence of the alkene structure. J. Org. Chem., 70(3):814–9,February 2005.
[117] M. W. Wong and L. Radom. Radical Addition to Alkenes: Further Assessment ofTheoretical Procedures. J. Phys. Chem. A, 102(12):2237–2245, March 1998.
[118] H. Fischer and L. Radom. Factors Controlling the Addition of Carbon-CenteredRadicals to Alkenes- An Experimental and Theoretical Perspective. Angew. ChemieInt. Ed., 2001.
[119] G. P. F. Wood, M. S. Gordon, L. Radom, and D. M. Smith. Nature of Glycine andIts a-Carbon Radical in Aqueous Solution : A Theoretical Investigation. J. Chem.Theory Comput., 4:1788–1794, 2008.
[120] B. Chan, R. J. O’Reilly, C. J. Easton, and L. Radom. Reactivities of Amino AcidDerivatives Toward Hydrogen Abstraction by Cl * and OH *. J. Org. Chem., 77(21):9807–9812, November 2012.
[121] R. S. Mulliken. Electronic Population Analysis on LCAO[Single Bond]MO Molec-ular Wave Functions. I. J. Chem. Phys., 23(10):1833, 1955.
[122] A. Garcıa, D. Domınguez, and A. Navarro-Vazquez. Addition of carbon centeredradicals to methyl 3-(methylamino)acrylate: The regioselectivity of radical additionto enamino esters. Comput. Theor. Chem., 979:17–21, January 2012.
[123] D. V. Avila, U. Ingold, and J. Lusztyk. Solvent Effects on the Competitive beta-Scission and Hydrogen Atom Abstraction Reactions of the Cumyloxyl Radical. Res-olution of a Long-standing Problem. J. Am. Chem. Soc., 115(2):466–470, 1993.
[124] M. Weber and H. Fischer. Absolute Rate Constants for the beta-Scission and Hy-drogen Abstraction Reactions of the tert-Butoxyl Radical and for Several RadicalRearrangements: Evaluating Delayed Radical Formations by Time-Resolved Elec-tron Spin Resonance. J. Am. Chem. Soc., 121:7381–7388, 1999.
106
[125] E. Baciocchi, M. Bietti, M. Salamone, and S. Steenken. Spectral properties andabsolute rate constants for beta-scission of ring-substituted cumyloxyl radicals. Alaser flash photolysis study. J. Org. Chem., 67(7):2266–70, April 2002.
[126] M. Newcomb, P. Daublain, and J. H. Horner. p-Nitrobenzenesulfenate esters asprecursors for laser flash photolysis studies of alkyl radicals. J. Org. Chem., 67(24):8669–71, November 2002.
[127] M. Bietti, G. Gente, and M. Salamone. Structural effects on the beta-scission reac-tion of tertiary arylcarbinyloxyl radicals. The role of alpha-cyclopropyl and alpha-cyclobutyl groups. J. Org. Chem., 70(17):6820–6, August 2005.
[128] C. Reichardt. Empirical Parameters of the Polarity of Solvents. Angew. Chemie Int.Ed. English, 4(1):29, 1965.
[129] M. F. Ruiz-Lopez, X. Assfeld, J. I. Garcia, J. a. Mayoral, and L. Salvatella. Solventeffects on the mechanism and selectivities of asymmetric Diels-Alder reactions. J.Am. Chem. Soc., 115(19):8780–8787, sep 1993.
[130] R. Bini, C. Chiappe, and V. Mestre. A rationalization of the solvent effect on theDiels-Alder reaction in ionic liquids using multiparameter linear solvation energyrelationships. Org. Biomol. Chem., 6:2522–2529, 2008.
[131] R. Breslow and T. Guo. Diels-Alder Reactions in Nonaqueous Polar Solvents. Ki-netic Effects of Chaotropic and Antichaotropic Agents and of beta-Cyclodextrin. J.Am. Chem. Soc., 110(17):5613–5617, 1988.
[132] M. E. Sheehan and P. N. Sharratt. Molecular dynamics methodology for the studyof the solvent effects on a concentrated Diels-Alder reaction and the separation ofthe post-reaction mixture. Comput. Chem. Eng., 22:S27–S33, mar 1998.
[133] J. Soto-Delgado, R. A. Tapia, and J. Torras. Multiscale Treatment for the MolecularMechanism of a Diels-Alder Reaction in Solution: A QM/MM-MD Study. J. Chem.Theory Comput., 12(10):4735–4742, 2016.
[134] V. D. Kiselev, D. A. Kornilov, I. A. Sedov, and A. I. Konovalov. Solvent Influ-ence on the Diels-Alder Reaction Rates of 9-(Hydroxymethyl)anthracene and 9,10-Bis(hydroxymethyl)anthracene with Two Maleimides. Int. J. Chem. Kinet., 49(1):61–68, 2017.
[135] Organic chemistry forum: p-aminophenol n-acylation mechanism question, 2011. URLhttp://www.chemicalforums.com/index.php?topic=51784.0.
[136] S. Xu, I. Held, B. Kempf, H. Mayr, W. Steglich, and H. Zipse. The DMAP-catalyzedacetylation of alcohols–a mechanistic study (DMAP = 4-(dimethylamino)pyridine).Chemistry, 11(16):4751–4757, aug 2005.
107
[137] A. Hassner, L. Krepski, and V. Alexanian. Aminopyridines as acylation catalysts fortertiary alcohols. Tetrahedron, 34:2069–2076, 1978.
[138] A. Berkessel and J. Adrio. Kinetic Studies of Olefin Epoxidation with HydrogenPeroxide in 1,1,1,3,3,3,-Hexafluoro-2-propanol Reveal a Crucial Catalytic Role forSolvent Clusters. Adv. Synth. Catal., 346:275–280, 2004.
[139] A. Berkessel, J. a. Adrio, D. Huttenhain, and J. M. Neudorfl. Unveiling the ”boostereffect” of fluorinated alcohol solvents: aggregation-induced conformational changesand cooperatively enhanced H-bonding. J. Am. Chem. Soc., 128(26):8421–8426, jul2006.
[140] S. D. Visser, J. Kaneti, R. Neumann, and S. Shaik. Fluorinated Alcohols EnableOlefin Epoxidation by H2O2: Template Catalysts. J. Org. Chem., 68(7):2903–2912,2003.
[141] B. Steenackers, A. Neirinckx, L. DeCooman, I. Hermans, and D. DeVos. Thestrained sesquiterpene beta-caryophyllene as a probe for the solvent-assisted epoxi-dation mechanism. ChemPhysChem, 15:966–973, 2014.
[142] G. Almerindo and J. Pliego. Ab initio investigation of the kinetics and mechanismof the neutral hydrolysis of formamide in aqueous solution. J. Braz. Chem. Soc., 18(4):696–702, 2007.
[143] M. Bietti and M. Salamone. Solvent Effects on the O-Neophyl Rearrangement of1,1-Diarylalkoxyl Radicals. A Laser Flash Photolysis Study. J. Org. Chem., 70(25):10603–10606, 2005.
[144] S. Rice. Diffusion-Controlled Reactions in Solution. In C. Bamford and C. Tipper,editors, Compr. Chem. Kinet., chapter 2, pages 3–45. Elsevier Pub. Co., Amsterdam,New York, 1985.
[145] J. McGowan. Molecular Volumes and structural chemistry. Recl. des Trav. Chim.des Pays-Bas, 75(2):193–208, 1956.
[146] R. L. Rowley, W. V. Wilding, J. L. Oscarson, Y. Yang, N. A. Zundel, T. E. Daubert,and R. P. Danner. DIPPR Data Compilation of Pure Chemical Properties. DesignInstitute for Physical Properties, AIChE, New York, NY, 2009.
[147] Y. Zhao and D. G. Truhlar. A new local density functional for main-group ther-mochemistry, transition metal bonding, thermochemical kinetics, and noncovalentinteractions. J. Chem. Phys., 125(19):194101, 2006.
[148] Y. Zhao and D. G. Truhlar. The M06 suite of density functionals for main groupthermochemistry, thermochemical kinetics, noncovalent interactions, excited states,and transition elements: two new functionals and systematic testing of four M06-class functionals and 12 other function. Theor. Chem. Acc., 120(1-3):215–241, 2008.
108
[149] R. Krishnan, J. S. Binkley, R. Seeger, and J. A. Pople. Self-consistent molecularorbital methods. XX. A basis set for correlated wave functions. J. Chem. Phys., 72(1):650–654, 1980.
[150] T. Clark, J. Chandrasekhar, G. W. Spitznagel, and P. V. R. Schleyer. Efficient diffusefunction-augmented basis sets for anion calculations. III. The 3-21+G basis set forfirst-row elements, Li-F. J. Comput. Chem., 4(3):294–301, 1983.
[151] M. J. Frisch, J. A. Pople, and J. S. Binkley. Self-consistent molecular orbital methods25. Supplementary functions for Gaussian basis sets. J. Chem. Phys., 80(7):3265,1984.
[152] L. A. Curtiss, K. Raghavach Ari, P. C. Redfern, V. Rassolov, and J. A. Pople.Gaussian-3 (G3) theory for molecules containing first and second-row atoms. J.Chem. Phys., 109(18):7764–7776, 1998.
[153] P. L. Fast, M. L. Sanchez, and D. G. Truhlar. Multi-coefficient Gaussian-3 methodfor calculating potential energy surfaces. Chem. Phys. Lett., 306(5-6):407–410, jun1999.
[154] B. J. Lynch, Y. Zhao, and D. G. Truhlar. Effectiveness of diffuse basis functions forcalculating relative energies by density functional theory. J. Phys. Chem. A, 107(9):1384–1388, 2003.
[155] R. Schmid. Handbook of Solvents. In G. Wypych, editor, Handbook of Solvents,chapter 13, pages 737–846. William Andrew, ChemTec, 2001.
[156] B. L. Slakman and R. H. West. Intrinsic solvation kinetics for automatic mechanismgeneration. In Gordon Research Conference on Atomic and Molecular Interactions,Easton, MA, July 2014.
[157] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blon-del, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau,M. Brucher, M. Perrot, and E. Duchesnay. Scikit-learn: Machine learning in Python.Journal of Machine Learning Research, 12:2825–2830, 2011.
[158] N. OBoyle, A. Tenderholt, and K. Langner. cclib: a library for package-independentcomputational chemistry algorithms. J. Comp. Chem., 29.
[159] J. Schreiber, Z. L. Wescoe, R. Abu-Shumays, J. T. Vivian, B. Baatar, K. Karplus,and M. Akeson. Error rates for nanopore discrimination among cytosine, methylcy-tosine, and hydroxymethylcytosine along individual DNA strands. Proc. Natl. Acad.Sci. U. S. A., 110(47):18910–5, 2013.
[160] S. Riniker, N. Fechner, and G. a. Landrum. Heterogeneous Classi fi er Fusion forLigand-Based Virtual Screening: Or, How Decision Making by Committee Can Bea Good Thing. J. Chem. Inf. Model., 53(Ml):2829–2836, 2013.
109
[161] T. Leuchtefeld, A. Maertens, J. M. McKim, T. Hartung, A. Kleensang, and V. Sa-Rochaa. Probabilistic hazard assessment for skin sensitization potency by dosere-sponse modeling using feature elimination instead of quantitative structureactivityrelationships. J Appl Toxicol., 35(11):1261–1371, 2015.
[162] A. A. Onischuk, V. P. Strunin, M. A. Ushakova, and V. N. Panfilov. Studying ofsilane thermal decomposition mechanism. Int. J. Chem. Kinet., 30(2):99–110, 1998.
[163] J. H. Purnell and R. Walsh. The Pyrolysis of Monosilane. Proc. R. Soc. A Math.Phys. Eng. Sci., 293(1435):543–561, 1966.
[164] A. Yuuki, Y. Matsui, and K. Tachibana. A Numerical Study on Gaseous Reactionsin Silane Pyrolysis. Jpn. J. Appl. Phys., 26:747–754, 1987.
[165] M. E. Coltrin, R. J. Kee, and G. H. Evans. A Mathematical Model of the FluidMechanics and Gas-Phase Chemistry in a Rotating Disk Chemical Vapor DepositionReactor. J. Electrochem. Soc., 136(3), 1989.
[166] C. J. Giunta, R. J. McCurdy, J. D. Chapple-Sokol, and R. G. Gordon. Gas–phasekinetics in the atmospheric pressure chemical vapor deposition of silicon from silaneand disilane. J. Appl. Phys., 67(2):1062–1075, 1990.
[167] P. Ho, M. E. Coltrin, and W. G. Breiland. Laser-induced fluorescence measurementsand kinetic analysis of Si atom formation in a rotating disk chemical vapor deposi-tion reactor. J. Phys. Chem., 98(40):10138–10147, 1994.
[168] M. Frenklach, L. Ting, H. Wang, and M. J. Rabinowitz. Silicon particle formationin pyrolysis of silane and disilane. Isr. J. Chem., 36(3):293–303, 1996.
[169] A. Dollet, S. De Persis, and F. Teyssandier. Rate constants of reactions involvingSiH4 as an association product from quantum Rice-Ramsperger-Kassel calculations.Phys. Chem. Chem. Phys., 6(6):1203–1212, 2004.
[170] A. Dollet and S. de Persis. Pressure-dependent rate coefficients of chemical reactionsinvolving Si2H4 isomerization from QRRK calculations. J. Anal. Appl. Pyrolysis, 80(2):460–470, 2007.
[171] C. Newman, H. O’Neal, M. Ring, F. Leska, and N. Shipley. Kinetics and Mechanismof the Silane Decomposition. Int. J. Chem. Kinet., 11:1167–1182, 1979.
[172] M. Bowrey and J. H. Purnell. The Pyrolysis of Disilane and Rate Constants of SileneInsertion Reactions. Proc. R. Soc. A Math. Phys. Eng. Sci., 321(1546):341–359,1971.
[173] R. Becerra and R. Walsh. Mechanism of formation of tri- and tetrasilane in the re-action of atomic hydrogen with monosilane and the thermochemistry of the disileneisomers. J. Phys. Chem., 91(22):5765–5770, 1987.
110
[174] J. D. Chapple-Sokol, C. J. Giunta, and R. G. Gordon. A Kinetics Study of theAtmospheric Pressure CVD Reaction of Silane and Nitrous Oxide. J. Electrochem.Soc., 136(10):2993–3003, 1989.
[175] R. J. McCurdy and R. G. Gordon. Compensating impurities as the limiting factorin atmospheric pressure chemical vapor deposition of a-Si:H from Mg2Si generatedhigher silanes. J. Appl. Phys., 63(9):4669–4676, 1988.
[176] A. G. Baboul, L. A. Curtiss, P. C. Redfern, and K. Raghavachari. Gaussian-3 theoryusing density functional geometries and zero-point energies. J. Chem. Phys., 110(16):7650, 1999.
[177] A. J. Adamczyk, M.-F. Reyniers, G. B. Marin, and L. J. Broadbelt. Exploring 1,2-hydrogen shift in silicon nanoparticles: reaction kinetics from quantum chemicalcalculations and derivation of transition state group additivity database. J. Phys.Chem. A, 113(41):10933–46, 2009.
[178] A. J. Adamczyk, M.-F. Reyniers, G. B. Marin, and L. J. Broadbelt. Kinetics ofsubstituted silylene addition and elimination in silicon nanocluster growth capturedby group additivity. ChemPhysChem, 11(9):1978–94, 2010.
[179] A. J. Adamczyk, M.-F. Reyniers, G. B. Marin, and L. J. Broadbelt. Kinetic corre-lations for H2 addition and elimination reaction mechanisms during silicon hydridepyrolysis. Phys. Chem. Chem. Phys., 12(39):12676–96, 2010.
[180] A. J. Adamczyk, M. F. Reyniers, G. B. Marin, and L. J. Broadbelt. Hydrogenatedamorphous silicon nanostructures: Novel structure-reactivity relationships for cy-clization and ring opening in the gas phase. Theor. Chem. Acc., 128(1):91–113,2011.
[181] H. Emeleus and C. Reid. 220. The pyrolysis of disilane and trisilane. J. Chem. Soc.,pages 1021–1030, 1939.
[182] S. K. Loh and J. M. Jasinski. Direct kinetic studies of SiH3 +SiH3HCCl4SiD4Si2H6,and C3H6 by tunable infrared diode laser spectroscopy. J. Chem. Phys., 95(7):4914,1991.
[183] J. Takahashi, T. Momose, and T. Shida. Thermal Rate Constants for SiH4 −−⇀↽−−SiH3 + H and CH4 −−⇀↽−− CH3 + H by Canonical Variational Transition State Theory.Bull. Chem. Soc. Jpn., 67(1):74–85, 1994.
[184] U. V. Bhandarkar, M. T. Swihart, S. L. Girshick, and U. R. Kortshagen. Modellingof silicon hydride clustering in a low-pressure silane plasma. J. Phys. D. Appl. Phys.,33(21):2731–2746, 2000.
[185] R. Robertson, D. Hils, H. Chatham, and A. Gallagher. Radical species in argon-silane discharges. Appl. Phys. Lett., 43(6):544–546, 1983.
111
[186] Y. Watanabe. Contribution of short lifetime radicals to the growth of particles inSiH4 high frequency discharges and the effects of particles on deposited films. J.Vac. Sci. Technol. A Vacuum, Surfaces, Film., 14(3):995, 1996.
[187] G. A. Petersson, D. K. Malick, W. G. Wilson, J. W. Ochterski, J. A. Montgomery,and M. J. Frisch. Calibration and comparison of the Gaussian-2, complete basis set,and density functional methods for computational thermochemistry. J. Chem. Phys.,109(24):10570–10579, 1998.
[188] L. A. Curtiss, P. C. Redfern, K. Raghavachari, and J. A. Pople. Assessment ofGaussian-2 and density functional theories for the computation of ionization poten-tials and electron affinities. J. Chem. Phys., 109(1):42–55, 1998.
[189] S. Y. Wu, P. Raghunath, J. S. Wu, and M. C. Lin. Ab initio chemical kinetic study forreactions of H atoms with SiH4 and Si2H6: comparison of theory and experiment. J.Phys. Chem. A, 114(1):633–639, 2010.
[190] J. A. Manion, R. E. Huie, R. D. Levin, D. R. Burgess Jr., V. L. Orkin, W. Tsang, W. S.McGivern, J. W. Hudgens, V. D. Knyazev, D. B. Atkinson, E. Chai, A. M. Tereza,C.-Y. Lin, T. C. Allison, W. G. Mallard, F. Westley, J. T. Herron, R. F. Hampson,and D. H. Frizzell. NIST Chemical Kinetics Database, NIST Standard ReferenceDatabase 17, Version 7.0 (Web Version), Release 1.6.8, Data version 2015.12, 2015.
[191] A. D. McLean and G. S. Chandler. Contracted Gaussian basis sets for molecularcalculations. I. Second row atoms, Z=11–18. J. Chem. Phys., 72(10):5639, 1980.
[192] Y. Zhao, B. J. Lynch, and D. G. Truhlar. Development and Assessment of a NewHybrid Density Functional Model for Thermochemical Kinetics. J. Phys. Chem. A,108(14):2715–2719, 2004.
[193] N. L. Arthur, P. Potzinger, B. Reimann, and H. P. Steenbergen. Reaction of H-Atoms with Some Silanes and Disilanes - Rate Constants and Arrhenius Parameters.J. Chem. Soc., Faraday Trans. 2, 85(9):1447–1463, 1989.
[194] N. L. Arthur and L. A. Miles. Rate constants for H + (CH3)4-nSiHn, n = 1–4. Chem.Phys. Lett., 282(January):192–196, 1998.
[195] X. Yu, S.-M. Li, Z.-S. Li, and C.-C. Sun. Direct Ab Initio Dynamics Studies of theReaction Paths and Rate Constants of Hydrogen Atom with Germane and Silane. J.Phys. Chem. A, 104(40):9207–9212, 2000.
[196] L. D. Crosby and H. A. Kurtz. Application of Electronic Structure and TransitionState Theory: Reaction of Hydrogen With Silicon Radicals. Int. J. Quantum Chem.,106(15):3149–3159, 2006.
[197] W. Wang, S. Feng, and Y. Zhao. Quantum instanton evaluation of the thermal rateconstants and kinetic isotope effects for SiH4 + H −−→ SiH3 + H2 reaction in fullCartesian space. J. Chem. Phys., 126:114307, 2007.
112
[198] G. Katzer, M. C. Ernst, A. F. Sax, and J. Kalcher. Computational Thermochemistryof Medium-Sized Silicon Hydrides. J. Phys. Chem. A, 101(21):3942–3958, 1997.
[199] D. G. Goodwin, H. K. Moffat, and R. L. Speth. Cantera: An Object-oriented Soft-ware Toolkit for Chemical Kinetics, Thermodynamics, and Transport Processes,2016. URL http://www.cantera.org.
[200] J. M. Jasinski and J. O. Chu. Absolute rate constants for the reaction of silylene withhydrogen, silane, and disilane. J. Chem. Phys., 88(3):1678, 1988.
[201] F. Neese. The ORCA program system. Wiley Interdiscip. Rev.: Comput. Mol. Sci.,2:73–78, 2012.
[202] J. Pfaendtner, X. Yu, and L. J. Broadbelt. The 1-D hindered rotor approximation.Theor. Chem. Acc., 118(5-6):881–898, jul 2007.
[203] P. L. Bhoorasingh. Automated Calculation of Reaction Kinetics via Transition StateTheory. PhD thesis, Northeastern University, Boston, MA, jun 2016.
[204] C. F. Goldsmith and R. H. West. Automatic Generation of Microkinetic Mechanismsfor Heterogeneous Catalysis. 2017.
[205] A. Laio and F. L. Gervasio. Metadynamics: a method to simulate rare events andreconstruct the free energy in biophysics, chemistry and material science. ReportsProg. Phys., 71(12):126601, 2008.
[206] S. Zheng and J. Pfaendtner. Car-Parrinello molecular dynamics + metadynamicsstudy of high-temperature methanol oxidation reactions using generic collectivevariables. J. Phys. Chem. C, 118(20):10764–10770, 2014.
113
Appendices
114
A SUPPLEMENTARY INFO FOR SOLVATION KINETICS
A.1 Solvation kinetics molecular structure group trees and values
Table A.1: Group tree for hydrogen abstraction reactions
Group label ∆EA
(kJ/mol)# reactionstrained on
L1: X_H_or_Xrad_H_Xbirad_H_Xtrirad_H 1.925 997L2: Xrad_H -0.033 19L2: Xbirad_H -1.305 2
L3: CH2_triplet_H -1.305 2L3: CH2_singlet_H 0L3: NH_triplet_H 0L3: NH_singlet_H 0
L2: Xtrirad_H 0L3: C_quartet_H 0L3: C_doublet_H 0
L2: X_H 0.003 976L3: O_H 0.579 93
L4: O_sec 0.579 93L5: O/H/NonDeC 0.128 51
L6: O/H/Cs\H3 0.252 4L6: O/H/Cs\Cs|H3 0.083 24L6: O/H/Cs\Cs|Cs/H3 0.53 11
L3: Cs_H -0.259 819L4: C_alkane -1.178 154
L5: C_methane -1.606 9L5: C/H3/Cs\H3 -1.302 18L5: C/H3/Cs\Cs|H3 -1.008 60
L4: C/Hx/O -0.158 92L5: C/H3/O -0.004 45
L6: C/H3/O\H 0.363 32L5: C/H2/Cs/O -0.31 47
L6: C/H2/Cs/O\H -0.249 42L6: C/H2/O/Cs\Cs 0
L7: C/H2/O|H/Cs\Cs 0Continued on next page
115
Table A.1 – continued from previous pageGroup label ∆EA
(kJ/mol)# reactionstrained on
L4: C/Hx/Cs\O -0.879 21L5: C/H3/Cs\O 0
L6: C/H3/Cs\O|H 0L5: C/H2/Cs/Cs\O -0.879 21
L6: C/H2/Cs/Cs\O|H -0.879 21L4: C/H3/Cs\Cs|O 0
L5: C/H3/Cs\Cs|O/H 0L4: C/H2/NonDeC_6ring 1.711 1
L3: Cb_H 0L4: Cb/H/Cb 0
L5: Cb/H/Cb/Cb 0L6: Cb/H/Cb/Cb\O 0L6: Cb/H/Cb/Cb\Nb 0L6: Cb/H/Cb/Cb\Cb|Nb 0L6: Cb/H/Cb/Cb\N_pyrrole 0
L3: Cd/H/Cb 0L4: Cd/H/Cb/O 0L4: Cd/H/Cb/Nb 0L4: Cd/H/Cb/N_pyrrole 0
L3: N3_H 0L4: N3s_H 0
L5: N3s/H/Cb/Cb 0L1: Y_rad_birad_trirad_quadrad 0
L2: Y_1centerquadrad 0L3: C_quintet 0L3: C_triplet 0
L2: Y_1centertrirad 0L3: N_atom_quartet 0L3: N_atom_doublet 0L3: CH_quartet 0L3: CH_doublet 0
L2: Y_1centerbirad -2.192 61L2: Y_rad 0.149 936
L3: Y_2centeradjbirad -2.713 24L3: O_rad 1.971 337
L4: O_pri_rad 0.758 79L4: O_sec_rad 2.355 258
L5: O_rad/NonDeO 2.368 210L3: Cs_rad -0.473 282
L4: C_methyl -2.962 64
116
A.2 Script for modifying Chemkin files for solvation kinetics corrections
Included electronically as modifyReactionBarriers.py
A.3 Modified Cantera input file for n-dodecane/ methyl oleate oxidation
Included electronically as liq modified.cti
A.4 Cantera script to simulate liquid fuel oxidation reactor
Included electronically as oxidation.py
A.5 Code for automatic tree building
Included electronically as Building a Tree for Solvation KineticsData using Scikit Learn.ipynb
117
B SUPPLEMENTARY INFO FOR SILICON HYDRIDES
B.1 Geometries of reactants and transition states for hydrogen abstraction reactions
Table B.1: Transition state geometries for silicon hydride hydrogen abstractions. Geome-tries were optimized at M06-2X/6-311+G(3d2f).
Reaction Geometry (xyz)SiH4 + H←→ SiH3 + H2 Si 0.15453 0.00006 -0.00002
H -1.41678 0.00240 0.00055H 0.62492 -0.56812 1.27728H 0.62860 1.38878 -0.14797H 0.62319 -0.82427 -1.12967H -2.62334 0.00039 0.00014
Si2H6 + H←→ Si2H5 + H2 Si 1.08477 -0.21175 0.00000H 1.76873 1.18341 0.00048Si -1.23147 0.06812 0.00000H 1.54266 -0.93570 1.20460H 1.54277 -0.93472 -1.20514H 2.39507 2.29684 0.00003H -1.64097 0.81927 1.20466H -1.91403 -1.24233 -0.00282H -1.64042 0.82406 -1.20185
Continued on next page
118
Table B.1 – continued from previous pageReaction Geometry (xyz)Si2H6 + SiH3 ←→ Si2H5 + SiH4 Si -0.55565 0.74630 -0.57115
H 2.82912 -1.40097 1.04827Si -2.27585 -0.37494 0.52734H -0.67818 0.64407 -2.04227H -0.49801 2.17627 -0.19500Si 2.72787 -0.35594 0.00917H -2.06810 -0.27329 1.98716H -2.25046 -1.80175 0.14252H -3.60717 0.17968 0.19965H 3.22534 -0.86811 -1.28355H 0.98622 0.12059 -0.20376H 3.48999 0.84300 0.41209
Si2H5 + H←→ SiH3SiH + H2 Si 1.09996 -0.23822 -0.10182H 1.92029 1.02459 0.19908Si -1.19412 0.08446 0.01971H 1.64355 -1.36585 0.68301H 2.83364 2.02686 0.17835H -1.63898 0.40468 1.39338H -1.88427 -1.14091 -0.43095H -1.55592 1.20330 -0.87343
Si2H4 + H←→ Si2H3 + H2 Si 0.97302 -0.22919 -0.11437H 1.90926 0.97626 -0.02486Si -1.13964 0.10093 0.07414H 1.55866 -1.43591 0.50100H 2.67744 1.96603 0.49374H -2.07255 -1.04044 0.06669H -1.74016 1.32958 -0.47338
119
B.2 Geometries of silicon hydride species
Table B.2: Geometries of silicon hydride species used to calculate thermodynamics proper-ties and HBI values. Geometries were optimized at G3//B3LYP. (S) and (T) indicate singletor triplet electronic state.
SMILES Geometry (xyz)[SiH2](S) Si -1.43587 0.00246 0.06697
H -1.74175 0.47705 -1.35562H -0.04854 0.64701 0.01782
[SiH2](T) Si -1.33315 0.10876 -0.07282H -1.93728 0.40943 -1.40268H 0.04429 0.60833 0.20466
[SiH2]=[Si] Si -3.60702 1.82709 -0.33428Si -1.49125 1.19881 0.33577H -0.81469 -0.11028 0.07296H -0.51770 2.01734 1.12493
[SiH2]=[SiH2] Si -1.53193 -0.02404 -0.20088Si 0.58853 0.34809 0.13524H -2.04746 -1.40486 -0.00758H -2.19289 0.64071 -1.35459H 1.24951 -0.31663 1.28895H 1.10404 1.72891 -0.05809
[SiH2][SiH3] Si 0.64164 0.63190 -0.07677H 2.12401 0.79844 -0.10622H 0.09716 1.50707 0.99666Si -0.01278 -1.58161 0.31378H 0.09701 1.08711 -1.38474H 0.40941 -2.11507 1.64186H 0.40969 -2.53755 -0.75125
[SiH3] Si 0.61349 0.58486 -0.06567H 2.10205 0.54852 -0.07144H 0.09988 1.44264 -1.16912H 0.09996 1.05209 1.25158
Continued on next page
120
Table B.2 – continued from previous pageSMILES Geometry (xyz)[SiH3][Si][SiH3] Si 1.67253 0.27843 0.63961
H 3.14517 0.40071 0.46884Si 0.63136 1.45854 -1.09549H 1.30287 0.82573 1.97628H 1.31054 -1.16767 0.61548Si -1.67483 1.68344 -1.43917H -2.33983 2.35487 -0.28601H -2.33137 0.36094 -1.64601H -1.88650 2.51252 -2.65597
[SiH3][SiH2][SiH3] Si 1.34707 0.41763 0.52351H 2.83377 0.50351 0.50128Si 0.41113 1.59415 -1.28727H 0.86407 0.97917 1.81566H 0.94075 1.03046 -2.56387H 0.84714 3.02022 -1.22248Si -1.94053 1.50667 -1.32139H -2.40856 0.09624 -1.42074H -2.48341 2.26393 -2.48329H -2.50232 2.09543 -0.07399H 0.95938 -1.01934 0.46801
[SiH3][SiH3] Si 1.25034 0.42662 0.50364H 2.73834 0.48621 0.51730Si 0.39108 1.62122 -1.32867H 0.74771 0.99732 1.78408H 0.84025 -1.00347 0.43617H 0.80116 3.05131 -1.26120H -1.09692 1.56161 -1.34235H 0.89373 1.05052 -2.60911
Continued on next page
121
Table B.2 – continued from previous pageSMILES Geometry (xyz)[SiH3][SiH][SiH3] Si 1.61691 0.33637 0.61343
H 3.09531 0.50916 0.63765Si 0.72147 1.40841 -1.26788H 1.04575 0.94103 1.84764H 1.32225 -1.12597 0.63522Si -1.60568 1.62562 -1.43403H 1.32982 0.90996 -2.53930H -2.13957 2.21855 -0.17765H -2.28972 0.31855 -1.65621H -1.94939 2.52478 -2.56959
[SiH4] Si 1.46180 0.40735 0.51255H 2.94718 0.40735 0.51255H 0.96668 -0.24704 -0.72558H 0.96667 1.80680 0.56489H 0.96667 -0.33770 1.69833
[SiH] Si 0.30787 2.67553 0.05506H -0.20525 2.37574 -1.36497
[SiH][SiH2] Si -2.73075 1.30914 0.07907Si -0.47875 1.78249 0.08754H -2.72465 1.27934 -1.44957H 0.33260 1.85245 1.33822H 0.30767 2.46622 -0.98447
[SiH][SiH3](S) Si -2.71696 1.51477 0.05196Si -0.31118 1.56106 0.04435H -2.68937 1.16938 -1.44010H 0.22225 0.23388 -0.39172H 0.30300 1.87871 1.36442H 0.17481 2.58171 -0.93456
[SiH][SiH3](T) Si -2.53013 1.47300 -0.10995Si -0.19734 1.55835 0.01393H -3.23625 1.16324 -1.39379H 0.42084 0.25197 -0.35026H 0.16206 1.88550 1.41850H 0.36336 2.60747 -0.88406
122
B.3 Largest SiH4 decomposition mechanism
Included electronically as chem annotated.inp and species dictionary.txt.
Converted Cantera input file is included as silane.cti.
B.4 Cantera script for simulating reactor
Included electronically as silane decomp.py
B.5 Code for residence time comparison
Included electronically as Residence Times.ipynb