Neutrinos, Symmetries and the Origin of Matter
João Tiago Neves Penedo
Thesis to obtain the Master of Science Degree in
Engineering Physics
Examination Committee
Chairperson: Prof.a Doutora Maria Teresa Haderer de la Peña Stadler
Supervisor: Prof. Doutor Filipe Rafael Joaquim
Members of the Comittee: Prof. Doutor Gustavo da Fonseca Castelo Branco
Prof. Doutor Ricardo Jorge Gonzalez Felipe
November 2013
“Each piece, or part, of the whole of nature is
always merely an approximation to the complete
truth, or the complete truth so far as we know it.
In fact, everything we know is only some
kind of approximation, because we know
that we do not know all the laws as yet.
Therefore, things must be learned only to be
unlearned again or, more likely, to be corrected.”
– Richard P. Feynman (1918-1988)
The Feynman Lectures on Physics, Vol. I
i
ii
Acknowledgements
This thesis is not a work of mine alone: behind the stage curtain, a larger cast hides. To them I thank
for not only helping me construct a symmetric work, in the Vitruvian sense, but also for keeping me sane
in the process (well, as sane as possible at least).
I would like to start by thanking my supervisor, Professor Filipe Joaquim, for his guidance, patience,
and incessant encouragement. Being one of the few great professors I had, he has been responsible for
introducing me to the world of particle physics research, the do’s and don’ts of the field, and for providing
all the assistance needed in solving problems and answering questions, big or small, without exception.
I am additionally indebted to Fundacao para a Ciencia e Tecnologia (FCT) and Centro de Fısica
Teorica de Partıculas (CFTP), thanking, in particular, the kindness of Claudia Romao, who has lent
a helping hand whenever needed, and the support of Professor Jorge Romao, who worked towards my
participation in the ICTP Summer School on Particle Physics, earlier this year. There, I was given the
chance to learn from the leading experts in the field, and would like to express my gratitude towards
Doctor Alejandro Ibarra for a most helpful lightning-discussion as well as a saving reference.
I would also like to thank my family for their support and encouragement, and for enabling my crazy
endeavors, without understanding, most of the time, what the heck I am doing.
I thank my friends for the chaotic yet enjoyable sequence of events one calls a physics course. A special
mention must be made to Pedro Boavida and Antonio Coutinho, to whom I thank for our spontaneous
(and crucial) discussions about physics, Copernicus and chameleons.
I end by expressing my thanks to Joao Loureiro, who was there even if two thousand kilometers away,
and to Sılvia Conde, for a superposition of all possible reasons.
iii
iv
Este trabalho foi financiado pela Fundacao para a Ciencia e Tecnologia,
sob o projecto PTDC/FIS/102120/2008.
This work was supported by Fundacao para a Ciencia e Tecnologia,
under the grant PTDC/FIS/102120/2008.
v
vi
Resumo
As simetrias como leis de invariancia desempenham um papel fundamental na construcao de teo-
rias fısicas. Em particular, as simetrias de gauge estao na base do presente conhecimento do mundo
subatomico, que assenta no Modelo Padrao da fısica de partıculas. Apesar de repetido sucesso, este
modelo tem que ser necessariamente expandido a luz da existencia de massas e mistura de neutrinos.
Na presente dissertacao sao exploradas extensoes do Modelo Padrao baseadas no mecanismo seesaw
onde a supressao da massa dos neutrinos e naturalmente explicada. Massas de neutrinos nao nulas
conduzem a mistura leptonica, cuja estrutura se aproxima a um padrao tribimaximal, apontando para a
possıvel presenca de simetrias discretas na teoria a altas energias – como a invariancia sob transformacoes
do grupo A4, considerado neste trabalho.
O Modelo Padrao revela-se igualmente insuficiente na explicacao da assimetria barionica do Universo.
Nos modelos seesaw e possıvel gerar dinamicamente essa assimetria atraves dos decaimentos dos novos
estados pesados (fora de equilıbrio termico) mediante o mecanismo de leptogenese, cuja eficiencia e de-
terminada numericamente resolvendo o sistema de equacoes de Boltzmann adequado. Nesta dissertacao,
apresenta-se a analise de um modelo particular para violacao espontanea da simetria CP onde se explicam
as massas e mistura de neutrinos impondo uma simetria discreta A4. A implementacao do mecanismo
de leptogenese neste contexto e discutida em detalhe.
Palavras-chave: Assimetria barionica do Universo; Leptogenese; Massa e mistura
de neutrinos; Mecanismo seesaw; Simetrias; Violacao de CP
vii
viii
Abstract
Symmetries, understood as laws of invariance, play a fundamental role in the development of physics.
In particular, gauge symmetries are at heart of our current understanding of the subatomic world, which
relies on the Standard Model of particle physics. Despite its repeated successes, this model must neces-
sarily be extended to accommodate the experimental observation of nonzero neutrino masses and mixing.
In this thesis, we explore seesaw extensions of the Standard Model, where heavy states mediate
neutrino mass generation and the smallness of these masses is naturally accounted for. Nonvanishing
neutrino masses allow for leptonic mixing, whose structure strongly differs from that of quark mixing.
The closeness of the lepton mixing matrix to the tribimaximal pattern points to the presence of discrete
symmetries in the underlying high-energy theory – such as invariance under transformations of the A4
group, considered in this work.
The Standard Model also fails to provide a satisfactory mechanism for the generation of the baryon
asymmetry of the Universe. A remarkable feature of the seesaw extensions is the possibility that the
out-of-equilibrium decays of the new heavy states are responsible for the dynamical generation of this
asymmetry. This corresponds to the leptogenesis mechanism, whose efficiency is here determined by
numerically solving a system of Boltzmann equations. Additionally, a particular model for spontaneous
leptonic CP violation is analysed where neutrino masses and mixing are explained imposing an A4 discrete
symmetry. The implementation of the leptogenesis mechanism in this context is discussed in detail.
Keywords: Baryon asymmetry of the Universe; CP violation; Leptogenesis; Neu-
trino masses and mixing; Seesaw mechanism; Symmetries
ix
x
Contents
Acknowledgements iii
Resumo vii
Abstract ix
List of Figures xiv
List of Tables xv
List of Abbreviations xvii
1 Symmetries and Asymmetries in Nature 1
1.1 Evolution of the Concept of Symmetry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 Groups and Symmetry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3 Symmetry in Physics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.3.1 From Classical to Quantum Mechanics . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.3.2 New Kinds of Symmetry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.3.3 The Discrete Symmetries C, P and T . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.3.4 Symmetry Breaking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.4 A Philosophical Interlude . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.5 The Asymmetry of Existence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.5.1 Experimental Evidence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.5.2 The Tuning of Initial Conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.5.3 The Possibility of a B-Symmetric Universe . . . . . . . . . . . . . . . . . . . . . . 16
2 The Standard Model of Particle Physics and (slightly) Beyond 17
2.1 Recap of the Electroweak Sector of the SM . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.1.1 Neutral and Charged Electroweak Currents . . . . . . . . . . . . . . . . . . . . . . 19
2.1.2 The Higgs Mechanism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.1.3 Fermion Masses and Mixing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
2.2 Neutrinos Beyond the SM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
2.2.1 The Neutrino Mass Term . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
2.2.2 The Seesaw Mechanism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
xi
3 Lepton Mixing and Discrete Family Symmetries 35
3.1 Lepton Mixing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
3.2 Discrete Family Symmetries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
3.2.1 Symmetries of the Mass Matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
3.2.2 Direct vs. Indirect Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
3.3 An A4 Model with Spontaneous CP Violation . . . . . . . . . . . . . . . . . . . . . . . . . 41
3.3.1 Spontaneous CP Violation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
3.3.2 Neutrino Masses and Mixing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
3.3.3 Nonzero Reactor Neutrino Mixing Angle . . . . . . . . . . . . . . . . . . . . . . . . 47
4 Baryogenesis through Leptogenesis 49
4.1 Topics of Cosmology and Thermodynamics . . . . . . . . . . . . . . . . . . . . . . . . . . 49
4.1.1 Cosmological Inflation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
4.1.2 Equilibrium Thermodynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
4.1.3 Expansion, Entropy and Degrees of Freedom . . . . . . . . . . . . . . . . . . . . . 52
4.1.4 Brief Thermal History of the Universe . . . . . . . . . . . . . . . . . . . . . . . . . 53
4.2 The Sakharov Conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
4.3 Is the SM Enough? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
4.4 Thermal Leptogenesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
4.4.1 CPT, Unitarity and CP Asymmetries . . . . . . . . . . . . . . . . . . . . . . . . . 60
4.5 Boltzmann Equation(s) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
4.5.1 Two-body Decays and Inverse Decays . . . . . . . . . . . . . . . . . . . . . . . . . 63
4.5.2 2↔ 2 Scatterings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
5 Type II Seesaw Leptogenesis 69
5.1 Flavoured CP Asymmetries from Triplet Decays . . . . . . . . . . . . . . . . . . . . . . . 69
5.2 Boltzmann Equations for Type II Seesaw . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
5.3 Scattering Reaction Densities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
5.4 Leptogenesis in an A4 Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
Conclusions 79
Bibliography 88
A Computing Diagrams with Majorana Fermions 89
B Clebsch-Gordan Coefficients for A4 91
B.1 General Description of the Group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
B.2 Choice of an Explicit 3D Representation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
B.3 The Tensor Product Representation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
B.4 Computing the CGCs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
xii
List of Figures
1.1 Selected symmetry drawings of M. C. Escher (1941) . . . . . . . . . . . . . . . . . . . . . 1
1.2 Results of subjecting an artificial “quasi-lattice” based on a Penrose tiling to optical
diffraction (left), obtained by A. Mackay in 1982, and its physical analogue (right): elec-
tron diffraction patterns of an aluminium-based icosahedral quasicrystal, published by D.
Shechtman et al. in 1984 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
2.1 Mass hierarchy of the elementary fermions observed in Nature. Mass values and uncer-
tainties are obtained from J. Beringer et al. (Particle Data Group) 2012 and references
therein (light quarks present the highest relative mass uncertainties) . . . . . . . . . . . . 25
2.2 Exchange interactions which in the effective theory give rise to the Weinberg operator of
(2.45). Seesaw types I and III correspond to the exchange of fermion fields NR and ΣR,
respectively (left diagram), while the type II seesaw mechanism is implemented through
the exchange of scalar fields ∆ (right diagram) . . . . . . . . . . . . . . . . . . . . . . . . 28
2.3 Vertex contributions from the interaction Lagrangian (2.51) for type I seesaw . . . . . . . 29
2.4 Vertex contributions for type II interactions relevant for effective neutrino mass generation 32
3.1 Depiction of lepton mixing for both a normally ordered and an inverted neutrino mass
spectrum, where the global fit data of Table 3.1 has been considered (left), to be compared
with the tribimaximal ansatz (right) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
3.2 Predicted values for neutrino masses as a function of the CP-violating angle β . . . . . . . 46
3.3 Scatter plot of the experimentally allowed regions in the(ε1, ε2
)plane (left), where exact
TBM is seen to be excluded, and corresponding regions of the(JCP, β
)plane (right) . . . 48
3.4 Values for the neutrinoless double beta decay parameter∣∣mee
∣∣, in the exact TBM and
perturbed cases, as a function of β . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
4.1 Brief thermal history of the Universe . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
4.2 Schematic representation of the vacuum structure of the electroweak theory (left) and
effective diagram for the transition between vacua (right) . . . . . . . . . . . . . . . . . . 57
4.3 Diagram for the expansion of a ‘true vacuum’ bubble . . . . . . . . . . . . . . . . . . . . . 58
4.4 Effect of electroweak sphalerons on the quantum numbers B and L . . . . . . . . . . . . . 60
4.5 Tree-level and one-loop diagrams for the process X → ` ` whose interference generates a
CP asymmetry when compared to the conjugate process . . . . . . . . . . . . . . . . . . . 61
xiii
5.1 Tree-level diagrams for the decays of type II seesaw scalar triplets and one-loop diagrams
contributing to the decay process ∆i → `α `β . . . . . . . . . . . . . . . . . . . . . . . . . 70
5.2 Scalar triplet interactions relevant to the BE out-of-equilibrium analysis, where one con-
siders the diagrams presented for decays, inverse decays and s- and t-channel scatterings
and their charge conjugates, as well as gauge scattering reactions . . . . . . . . . . . . . . 72
5.3 Contours of the (magnitude of the) maximum CP asymmetries in the decays of hierarchical
scalar triplets ∆1,2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
5.4 Scatter plot of the baryon asymmetry generated in randomly-chosen perturbed versions of
the model, for M1 = 1012 GeV (black) and M1 = 5× 1012 GeV (cyan) . . . . . . . . . . . 77
5.5 Reaction densities normalized to the product H(T )nγ(T ) (left) and evolution of the various
densities considered in the BE network (right) . . . . . . . . . . . . . . . . . . . . . . . . . 78
A.1 Rules for writing propagator, vertex and spinor contributions obtained from the article of
Denner et al. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
xiv
List of Tables
1.1 Effect of some S4 permutations on an array of different circles . . . . . . . . . . . . . . . . 6
2.1 Number of parameters contained in complex matrices depending on their properties . . . 23
3.1 Global fit results for the three-neutrino oscillation parameters (mass differences, mixing
angles and Dirac phase) and for both ordering possibilities (see text) . . . . . . . . . . . . 36
3.2 Representation assignments of the various fields under the action of the groups A4, Z4,
and gauge SU(2)L×U(1)Y . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
B.1 Character table for the group A4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
B.2 Explicit decomposition of the elements of A4 in terms of s and t . . . . . . . . . . . . . . 92
xv
xvi
List of Abbreviations
BAU Baryon Asymmetry of the Universe
BBN Big-Bang Nucleosynthesis
BE Boltzmann (transport) Equation
CC Charged Current
CGC Clebsch-Gordan Coefficient
CP Charge Conjugation and Parity
EWPT Electroweak Phase Transition
EWSB Electroweak Symmetry Breaking
FRW Friedmann-Robertson-Walker (metric)
GUT Grand Unified Theory
NC Neutral Current
QCD Quantum Chromodynamics
QED Quantum Electrodynamics
QFT Quantum Field Theory
RIS Real Intermediate State
SM Standard Model
SMC Standard Model of Cosmology
SSB Spontaneous Symmetry Breaking
SUSY Supersymmetry
TBM Tribimaximal
VEV Vacuum Expectation Value
xvii
xviii
Symmetries and
Asymmetries in Nature 1It was from his second visit to the Moorish palace of Alhambra (Granada, Spain), in 1936, that Maurits
C. Escher (1898-1972) drew inspiration to produce sketchbook after sketchbook of patterned drawings in
a style which he called “regular division of the plane” [1]. Unbound by Islamic artistic tradition, which
forbids the depiction of human and animal figures, this Dutch graphic artist, known for his graphical
paradoxes and representation of impossible worlds, was able to produce spectacular and intricate plane
filling motifs as the ones shown in Fig. 1.1.
(a) Symmetry drawing no. 45 (b) Symmetry drawing no. 67 (c) Symmetry drawing no. 88
Figure 1.1: Drawings of M. C. Escher (1941) which exhibit different types of geometrical symmetry. Aside
from translational symmetries, these patterns present (if one ignores colour): a) reflection and four-fold
rotational symmetries, b) glide reflection symmetry, and c) two-fold rotational symmetry.
All M.C. Escher works c© 2013 The M.C. Escher Company - the Netherlands.
All rights reserved. Used with permission. www.mcescher.com
Escher shared his early attempts at plane division with his brother, a geologist at the University of
Leiden, who referred him to the existing work of crystallographers and mathematicians [2]. These were
people who systematically studied the regularities of the flat worlds Escher was trying to recreate. The
artist’s “regular division of the plane” is associated to the mathematical concept of tessellation – which
corresponds to the tiling of a two-dimensional surface – and refers, in particular, to periodic tilings with
translational symmetry in two independent directions. He became especially interested in a 1924 article
by George Polya [3], in which the author proves that any of these tilings can be taken to belong to one
of 17 mathematical classes, corresponding to the so-called wallpaper groups.
1
Figure 1.2: Results of subjecting an artificial “quasi-lattice” based on a Penrose tiling to optical diffraction
(left), obtained by A. Mackay in 1982 [4], and its physical analogue (right): electron diffraction patterns
of an aluminium-based icosahedral quasicrystal, published by D. Shechtman et al. in 1984 [5]. Shechtman
was awarded the 2011 Chemistry Nobel prize for the discovery of quasicrystals.
A wallpaper group is defined by the possible operations one can apply to the aforementioned tilings
while leaving them invariant, i.e. indistinguishable from themselves prior to performing the operation.
This classification of tilings is thus based on the symmetries they possess. The operations are selected
from the set of Euclidean plane isometries, which comprises (aside from translations) rotations, reflections
and glide reflections. If the restriction of demanding translational symmetry in two independent directions
is removed, one might construct tilings which present no periodicity at all, such as Penrose tilings. Yet
may still present rotational and reflection symmetries.
In nature, one finds that crystals can be classified according to a three-dimensional generalization of
the wallpaper groups – the space or Federov groups – of which 230 exist. This classification, of great
use to crystallographers, is possible since perfect crystals present the property of translational invariance
in three independent directions in space. These systems are thus the par excellence example of how
symmetries are present in the physical world: the properties of crystals signal the presence of regularity
at the most fundamental level of matter.
There is, once more, the possibility of constructing nonperiodic physical structures, known as qua-
sicrystals, for which the demand of translational invariance has been lifted. These objects can possess
rotational symmetries not allowed1 in periodic crystals, such as eightfold [6], tenfold (see Fig. 1.2), and
twelvefold [7] rotational symmetries. Examples of symmetry in biological systems proliferate: consider,
for instance, the structure of honeycombs or the fivefold rotational symmetry than can be seen in a
horizontally cut apple. Humans also tend to link symmetry with beauty, be it associated with works of
architecture or the bilateral symmetry of a person’s face [8].
So far we have only considered a geometrical notion of symmetry, associated with regularity. This
corresponds to the layman’s understanding of the word and one might wonder how one got to it.
1In the 2D wallpaper groups and the 3D Federov groups, the only possible rotational symmetries are twofold, threefold,fourfold, and sixfold rotational symmetries.
2
1.1 Evolution of the Concept of Symmetry
The term ‘symmetry’ has its origins in the Ancient Greek word συµµετρια, which results from the
fusion of συν (with) and µετρoν (measure) [9]. As suggested by its etymology, this term was used to
represent a notion of commensurability, the possibility of measuring using a common standard, which
translates into the presence of integer-based proportion relations.
In addition to the mathematical meaning of commensurability, where no subjectivity is implied, the
word συµµετρια became generally associated with beauty, harmony and unity. This is the meaning
which Plato ascribes to the word while referring to the human body in his dialogue Timaeus (c. 360 BC).
It is also in this dialogue that the Greek philosopher theorizes about the nature of the classical elements
– fire, water, air, and earth – which are, based on their properties, associated to four out of five possible
convex regular polyhedra. These are the so-called Platonic solids and Plato’s association is based on the
beauty he sees in them. Jumping forward to the sixteenth century, we see a revival of the attempt of
using platonic solids to describe nature in Johannes Kepler’s Mysterium Cosmographicum (1596). The
solids are now used to describe the geometry of planetary orbits, furnishing Kepler with a reasonable
approximation of the ratios between the radii of orbits, which were first taken to be circular. Despite
Kepler’s appeal to harmony, there is, however, no occurrence of ‘symmetry’ in his work.
Tracing the evolution of the meaning of ‘symmetry’ has proven to be a difficult endeavour to historians
since one is tempted to reinterpret scientific documents in light of a modern view of the concept. G.
Hon and B. Goldstein discriminate between two parallel paths in the evolution of the term, namely a
mathematical path vs. an aesthetic one [10]. In fact, unlike the Greek case in which the two meanings
of συµµετρια were present in a unique word, in Latin these were separated into commensurabiles and
symmetria. The former term was used exclusively in a scientific context, allowing us to follow the
mathematical path, while the latter pertains to a general, aesthetic notion of being well-proportioned,
corresponding to an altogether different evolution. In the remainder of this section, one follows Hon and
Goldstein’s historiographical work.
Concerning the mathematical path, one recognizes the usage of συµµετρια (and derived forms) as
commensurability in ancient works such as Plato’s Theaetetus (c. 369 BC), Aristotle’s Nicomachean
Ethics (c. 350 BC), and Euclid’s Elements (c. 300 BC). Indeed, in the Latin editions of these works,
the term συµµετρια would be translated to commensurabiles. Such is the case with Isaac Barrow’s
(1630-1677) edition of Euclid’s Elements, published in 1655. Barrow sets apart the two meanings of
συµµετρια, showing that they were understood at the time.
Looking into the aesthetic path, where judgement is key, one finds that συµµετρια was used to
describe ‘proper’ proportion, be it in Plato’s Timaeus, where ‘proper’ refers to beauty, in Aristotle’s
Nicomachean Ethics, where certain occurrences of συµµετρια are associated with moderation, or in
Ptolemy’s Almagest (c. 150), where it is used to convey the idea of suitability. It was Vitruvius who
coined the Latin term symmetria to characterize an entity, such as a building or a machine, whose parts
are joined gracefully by imposing well-chosen proportion relations between the parts themselves and
between the parts and the whole. This is how Vitruvius’ transliteration of the Greek word is understood
3
in the context of his theory of architecture, presented in his De architectura (c. 15 BC): a property of
a beautiful, well-coordinated unity. Claude Perrault (1613-1688), who translated the De architectura to
French in 1673, separated Vitruvius’ use of symmetry, which he takes to simply pertain to proportion,
from the common meaning of the word in seventeenth-century France. This common usage was associated
to a kind of geometrical correspondence, such as that which relates the disposition of windows between
the left and right sides of a building’s facade, and it represents an underdeveloped version of what we
today understand as mirror symmetry.
Further uses of symmetry in scientific literature are found to be technical extensions of the concept
of Vitruvius, such as Carl Linnaeus’ (1707-1778) use of the word connected to the idea of functionality
in his classification of plant species or Hauy’s formulation of a ‘law of symmetry’ in the context of
crystallography. The aesthetic path turns into a scientific one. Rene-Just Hauy (1743-1822), a French
mineralogist responsible for important steps in the mathematization of crystallography, initiated the
systematic use of the term ‘symmetry’ in that field. He gives, however, no definition of the term, which
is taken from context to refer to the geometry of the crystal, specifically to the relative disposition of its
facets. What he calls his ‘law of symmetry’ corresponds to the fact that certain rotations of a crystal
yield identical views. This hints towards the modern, geometrical notion of symmetry presented in the
beginning of the current chapter.
Other rare occurrences of the word symmetry in eighteenth-century documents have been recorded
in works of physics such as those by Henri-Louis Duhamel du Monceau (1700-1782) and Gaspard Monge
(1746-1818) regarding ship design and construction. Duhamel du Monceau and Monge appeal, respec-
tively, to equilibrium and to a bilateral equivalence between both sides of a vessel, but give no definition
of symmetry.
Hon and Goldstein argue that it was not until the work of Adrien-Marie Legendre (1752-1833) that
a turning point occurred. His studies of solid geometry in 1794 led him to explicitly define symmetry as
a relation – not a property of a whole – between two solids which are, in our modern terminology, each
other’s mirror image. Solids related in this way cannot be rotated in three-dimensional space such that
they are made to coincide. Even though Legendre gave no reason for his choice of the word symmetry,
one might conceive a link between his definition and Perrault’s geometrical correspondence.
It was after Legendre’s introduction of symmetry in solid geometry that adoption of term gradually
flourished in distinct domains of science. Andre-Marie Ampere (1775-1836) imported Legendre’s defini-
tion into chemistry, while other scientists, who were responsible for the introduction of the concept in
their respective fields, gave distinct, parallel meanings to symmetry. Pierre-Simon Laplace (1749-1827)
introduced it in probability to characterize a sequence of well-ordered events (symmetry as regularity),
while Sylvestre Lacroix (1765-1843) took symmetry in algebra to signal invariability of a function re-
garding permutation of its roots (symmetry as invariance). There is a close relation between Lacroix’s
definition and the way ‘symmetrical function’ is understood in current technical usage. This plurality of
meanings was often taken in stride. Such is the case with Augustin-Louis Cauchy (1789-1857), who used
the term in both algebra and geometry without commenting on the different meanings it was given.
4
A unification of different meanings was accomplished through the group-theoretical definition of sym-
metry. Group theory owes its development to the work of researchers such as Evariste Galois (1811-1832),
Marius Sophus Lie (1842-1899), and Christian Felix Klein (1849-1925). It is concerned with the study of
groups, an algebraic structure to be explored in the following section. The concept of group allows one to
tie togheter the various meanings of symmetry through the idea of invariance under specified operations.
The symmetries of a geometrical figure are thus identified with the possibility of invariance of said figure
under, say, reflections or rotations. The operations need not be geometrical in nature, and thus symmetry
is generalized, in modern science, beyond geometry.
1.2 Groups and Symmetry
In the context of abstract algebra, a group (G, ·) is defined by both an operation ‘·’, designated by
group multiplication or product, and a set2 G of elements g. The product, not necessarily commutative,
of two group elements g ∈ G and g′ ∈ G can thus be denoted by g · g′ or simply by g g′. If the set G is
finite, the group itself is said to be finite or discrete. For an uncountable number of elements the group is
said to be continuous. If, additionally, the group product is commutative, the group is said to be Abelian.
To complete this definition, the elements of G must satisfy the four following axioms:
• Closure: If g, g′ ∈ G and g′′ = g g′, then g′′ ∈ G.
• Identity: There exists an element e ∈ G such that g e = e g = g for every g ∈ G.
• Inverse: For every g ∈ G, there exists an element g−1 ∈ G, such that g g−1 = g−1g = e.
• Associativity: For every g, g′, g′′ ∈ G, the relationship (g g′)g′′ = g(g′g′′) holds.
To relate this abstract definition with geometrical symmetry it suffices to say that the elements of the
group are made to correspond to geometrical operations – such as rotations, reflections or translations
– and the group multiplication denotes composition of operations (associativity is guaranteed). There
should be no confusion between the operations which belong to G and the operation of group multipli-
cation, necessary for defining a group. The group property of closure implies that the composition of
two geometrical operations is itself a geometrical operation. The identity element e corresponds to per-
forming no operation whatsoever, while the existence of an inverse element tells us that for a geometrical
operation, there exists another which cancels its effect.
By looking at one of Escher’s symmetry drawings of Fig. 1.1, one can identify the operations which
leave it invariant (see caption) and thus determine to which wallpaper group it belongs3. The observed
properties of invariance of the system (the drawing), namely the geometrical operations which leave it
indistinguishable from its previous self, define the group.
The converse way of thinking is to specify the group and look for how the system must ‘respond’
to fulfil the conditions of invariance imposed by each group element. Take, for example, the group of
permutations of n elements, denoted Sn (non-Abelian for n > 2). Each of the group elements is a
permutation operation on some abstract space of n objects, such as a queue of n people, or a list of the n
2Often, this set-operation pair description is omitted and the group is identified with the set that defines it.3In the language of the wallpaper groups, Escher’s drawings correspond (if one again ignores colour) to the groups known
as: a) p4g, b) pg, and c) p2.
5
e (12) (13) (24) (234) (432) (132) (1234) (1324) (12)(34) (13)(24) (14)(23)
Table 1.1: Effect of some elements of the permutation group S4 on an array of different circles. Permu-
tations are written in the cycle notation. For example, (123) reads “1 goes to 2, 2 goes to 3, and 3 goes
to 1”, and (13)(24) corresponds to “1 goes to 3, 3 goes to 1, 2 goes to 4, and 4 goes to 2”.
arguments of a function. If one is given the function f(x, y, z) = a1 x2 +a2 y
2 +a3 z2 +b xy+c xyz, where
x, y, z are real variables and ai, b, c ∈ C are taken to be constant, imposing invariance under permutation
of the arguments of f , i.e. imposing invariance under S3, translates into demanding a1 = a2 = a3 and
b = 0 (no restriction arises for c). No exchange of arguments can change the value of the function.
To proceed, it is useful to define what is known as a representation of the group – a mapping between
each element ofG and ‘something’ that carries out the operation. For a drawing of Escher, this ‘something’
could be the command “Take the picture and carefully rotate it by 90 degrees!”. In a mathematical
language, group elements g are mapped into matrices Ug which act as linear transformations on some
vector space. In order to define a representation, the matrices must obey the relation Ug Ug′ = Ug g′ .
The vectors which belong to the vector space correspond to parts of the system under study.
To understand this, take the example of the group S4. Like all permutation groups, S4 is a discrete
group: it has a finite number of elements, #S4 = 4! = 24. Each element of S4 can be denoted, in what is
called the cycle notation, by a collection of number sequences separated by parenthesis. Each sequence
of numbers determines which objects to cycle. This notation can be clarified by looking at Table 1.1,
where some S4 permutations are applied to an array of four different circles. Since the notation relies on
cycles, one has, for example, the equivalence (432) = (324) = (243). The full set can be written as:
S4 = e, (12), (13), (14), (23), (24), (34), (12)(34), (13)(24), (14)(23), (234), (243), (134),
(143), (124), (142), (123), (132), (1234), (1243), (1324), (1342), (1423), (1432).(1.1)
A natural way of representing the group is to map each element g ∈ S4 into a 4× 4 matrix Ug in the
way suggested by the following examples (as one might guess, Ue = 14×4):
U(12)(34) =
0 1 0 0
1 0 0 0
0 0 0 1
0 0 1 0
, U(1234) =
0 0 0 1
1 0 0 0
0 1 0 0
0 0 1 0
, U(243) =
1 0 0 0
0 0 1 0
0 0 0 1
0 1 0 0
. (1.2)
These matrices form a four-dimensional representation of S4 and act on the space (C4) spanned by
the following vectors:
|1〉 ≡
1
0
0
0
, |2〉 ≡
0
1
0
0
, |3〉 ≡
0
0
1
0
, |4〉 ≡
0
0
0
1
. (1.3)
6
The choice of representation matrices of Eq. (1.2) becomes clear if one considers their action on the
vectors of (1.3). In looking for an object which is invariant under the group S4, one finds that multiples
of the vector |1〉+ |2〉+ |3〉+ |4〉 are clearly left unchanged after the action of any group element4, which
is to say after |i〉 → Ug |i〉, ∀ g ∈ S4. Other invariant expressions can be constructed, a simple example
being the scalar 〈y|x〉 = y∗i xi, where |x〉, |y〉 ∈ C4, i.e. |x〉 = xi|i〉, |y〉 = yi|i〉 and xi, yi ∈ C. This is
so since U†g = UTg = (Ug)
−1 = Ug−1 , and so 〈y|x〉 → 〈y|U†gUg|x〉 = 〈y|U(g−1g)|x〉 = 〈y|Ue|x〉 = 〈y|x〉.The operations act on parts of the system and, for a symmetric one, they cancel out. In fact, the above
example of a S3-symmetric function can be recast in this new language of representations. One rewrites
the function as f(x, y, z) = Aij rirj + Cijk rirjrk, with ~r = (x, y, z), the only nonzero entries of Aij and
Cijk being Aii = a for each i and C123 = c. Suppose now that ~r transforms according to5 the natural 3D
representation of S3, meaning that under the action of a certain g ∈ S3, ~r transforms as ri → (U′g)ij rj .
Here, the matrices U′g form a representation of S3 analogous to the one given for S4 in (1.2). One notices
that, given the structure of Aij and Cijk, the function remains unchanged under the action of any group
element, as expected.
A remark should be made regarding group construction. It is possible to take a group G and reduce the
set that defines it in such a way that the remaining structure H is still a group – H is said to be a subgroup
of G. S3 is a subgroup of S4 obtained by restricting ourselves to three objects instead of four, removing
all permutations which would affect that fourth object. Another subgroup of interest to us is A4, which is
obtained from S4 by keeping only elements, denoted even permutations, which can be decomposed into a
product of an even number of ‘pair switches’ – permutations which switch only two objects. Any element
of Sn can be decomposed into a product of ‘pair switches’: e.g. (1234) = (14) · (13) · (12), where we stress
that the present convention implies combining permutations from right to left (first we apply (12), then
(13), and finally (14)). The full set of even permutations of four elements is (#An = #Sn/2⇒ #A4 = 12):
A4 = e, (234), (243), (134), (143), (124), (142), (123), (132), (12)(34), (13)(24), (14)(23). (1.4)
To give an example concerning continuous groups, consider SO(n), the group of rotations in n dimen-
sions, of which SO(m) with m < n is trivially a subgroup (rotations in a plane can be seen as a subset
of rotations in 3D space). These groups are naturally represented by orthogonal rotation matrices.
One can also build groups by joining rather than reducing them. Starting from two groups G1 and
G2, one can construct a third group by considering their direct product, G3 = G1×G2. Here, the set G3
is indeed constructed through the Cartesian product of the others: each g3 can be identified by a pair
(g1, g2) and one has g3 · g′3 = (g1 · g′1, g2 · g′2). Demanding invariance of a system under G3 corresponds to
independently demanding its invariance under G1 and G2.
4This means that there is a subspace of C4 which is invariant under the action of these matrices, and thus there existsa basis in which all matrices Ug take the form:
Ug =
1 [Ug
] .
The matrices Ug correspond to a 3D representation in their own right, and one has thus decomposed (through a change ofbasis) our natural 4D representation into two of smaller dimension. When further reduction is impossible, one is left withso-called irreducible representations (or irreps.) along the diagonal.
5It is common, yet misleading, to phrase this as ~r ‘is in’ or ‘belongs to’ the representation under discussion.
7
To give an example of how the direct product works, consider the Abelian group Zn, which corresponds
to the set a, a2, . . . , an−1, an = e, where the relations between group elements are made explicit. The
direct product Z2 × Z2 produces a group with elements (e1, e2) ≡ e, (a, e2) ≡ a, (e1, b) ≡ b, (a, b) ≡ ab,where one has denoted the elements of the first Z2 by e1, a and the elements of the second Z2 by
e2, b. Although this group presents four elements it is not isomorphic6 to Z4, since a2 = b2 = ab2 = e,
corresponding to the only other possible group with four elements, the Klein group K ≡ Z2 × Z2.
1.3 Symmetry in Physics
1.3.1 From Classical to Quantum Mechanics
Symmetry, taken henceforth to signify invariance under specified operations, has been instrumental
to the development of modern physics. This group-theoretical notion was first imported from crys-
tallography into physics thanks to Pierre Curie who, citing the work of Auguste Bravais and Evgraf
Federov, investigated the connection between symmetries and physical phenomena in his 1894 paper Sur
la Symetrie dans les Phenomenes Physiques [11]. In this work, Curie formulated what is now known as
Curie’s Principle, which recognizes that the symmetries of a physical medium restrict which phenomena
can occur. Hence, to allow certain phenomena, the presence of certain asymmetries is required.
By looking at what one considers to be the laws of nature, one can identify symmetries of nature (or,
as Eugene Wigner called them, laws of invariance). According to Wigner [12], it was Einstein’s work on
special relativity in 1905 that brought the “reversal of a trend”: instead of reading off laws of invariance
from laws of nature, it became natural to establish laws of invariance and from there go on to derive
the laws of nature. Indeed, special relativity admits invariance under the Poincare group (also called
the inhomogeneous Lorentz group) which comprises Lorentz boosts, translations in space, translations in
time, and spatial rotations7. One can regard these invariances as a fundamental part of the structure of
the theory, the last three corresponding respectively to homogeneity, uniformity in time, and isotropy of
space (Wigner’s “older principles of invariance”).
Laws of invariance in theories described by the Lagrangian formalism can be associated with the
conservation of a physical quantity through Emmy Noether’s first theorem (proved in 1915 and published
in 1918 [13]). Invariance under translations in space is thus linked to the conservation of linear momen-
tum, invariance under translations in time to the conservation of energy, and invariance under spatial
rotations to the conservation of angular momentum. Additionally, an interesting connection between
symmetries and non-observable quantities is given by T.D. Lee [14]. Invariance under space translations,
time translations, or rotations implies that no absolute, preferred position, time, or spatial direction can
be observed, respectively.
6Two groups (G, ·) and (G′, ?) are said to be isomorphic, (G, ·) ∼= (G′, ?) if there exists a map ϕ : G→ G′ such that, forevery g1, g2 ∈ G, both ϕ(g1 · g2) = ϕ(g1) ? ϕ(g2) and ϕ(g1) = ϕ(g2) ⇒ g1 = g2 hold. An isomorphism denotes a strongequivalence which translates into saying that the two groups are essentially the same.
7Technically, we are restricting ourselves to transformations Λ which can be connected continuously to the group identity,ignoring time and space inversions for which det Λ = −1. Although these are isometries of Minkowski space, they mightnot be symmetries of a quantum field theory.
8
The advent of (non-relativistic) quantum mechanics brought with it the study of symmetries in a
novel context. Wigner pioneered this study, introducing his eponymous theorem (1931) [15] which states
that symmetry transformations U must be either unitary or antiunitary. This can be seen by taking
quantum states |φi〉, which transform as |φi〉 → |φ′i〉 = |Uφi〉, and noticing that by demanding |〈φ′i|φ′j〉| =|〈φi|φj〉| ∀ i, j one obtains that either 〈Uφi|Uφj〉 = 〈φi|φj〉 (U is a unitary operator) or 〈Uφi|Uφj〉 =
〈φj |φi〉 (U is an antiunitary, nonlinear operator). For infinitesimal transformations, parametrized as
U(ε) = 1 + i ε G (G is denoted the transformation generator), demanding unitarity – the majority
of operators of interest are unitary – implies that the generator is Hermitian, G† = G, allowing it
to be an observable. In the limit of a finite transformation, U(α) = limn→+∞ U(α/n)n, one obtains
U(α) = exp(i α G). In either case, the transformation operator U must commute with the system’s
Hamiltonian, [U , H] = 0, since a proper symmetry should not be spoilt by dynamics. This means that
G also commutes with H, which can be seen in Heisenberg’s representation to directly imply that G is a
conserved quantity, in line with Noether’s theorem.
1.3.2 New Kinds of Symmetry
A shift from spacetime to other kinds of symmetries eventually occurred. Such new symmetries
included permutation symmetry, which was introduced by Heisenberg and pertains to the indistinguisha-
bility of particles, the charge conjugation symmetry C (such discrete symmetries will be addressed in
the following section), and the so-called internal symmetries, such as isospin symmetry. Also due to
Heisenberg, isospin symmetry refers to the invariance of strong interactions when one transforms the
doublet (proton,neutron)T through an element of SU(2). U(n) is the (continuous) group of n×n unitary
matrices and SU(n) is the subgroup of U(n) obtained by keeping only matrices with unit determinant.
These transformations represent rotations in an abstract, ‘internal’ space. Disregarding electromagnetic
interactions and small mass differences, the validity of isospin symmetry allows one to consider protons
and neutrons as different states of the same particle – the nucleon. SU(2) rotations mix these two states
while leaving strong interactions unchanged. It is worth noting that isospin transformations are inherently
global: the same SU(2) rotation is applied in all points of space at the same time [16].
A generalization can be made from global to local symmetries, where the above abstract rotations can
be chosen to differ from point to point. Field theories which are based on invariance under internal, local
transformations are called gauge theories. In this context, the transformations themselves are denoted
gauge transformations. One of the simplest examples of a gauge theory is quantum electrodynamics
(QED), the theory which describes the interactions between light and electrically charged matter. The
corresponding gauge symmetry is U(1) or local phase invariance, which corresponds to admitting invari-
ance under transformations of the charged matter fields of the form ψ(x)→ ei q α(x) ψ(x), where q is the
electric charge associated with the field and α(x) is an arbitrary function of space and time. The matter
fields are said to belong to the defining representation of U(1). The introduction of a so-called gauge
boson with certain transformation properties, in this case the photon field, is crucial to ensure gauge
invariance of the QED Lagrangian. The idea of gauge invariance was first put forward by Hermann Weyl
9
in 1918 [17], as an (ultimately unfruitful) extension of Einstein’s work on general relativity8. It was only
in 1929 that Weyl established a connection between electromagnetism and local phase invariance [18].
Field theories based on local invariance under the non-Abelian gauge group SU(n) are known as
Yang-Mills theories and are at the heart of our current understanding of the subatomic world. They owe
their name to C. N. Yang and Robert Mills who, in 1954, explored the consequences of turning isospin
symmetry into a local symmetry [19]. In such a transition, several incompatibilities with experimental
evidence arise, namely the appearance of several gauge bosons which are massless, leading to forces with
an infinite range, and self-interacting, due to the non-Abelian character of the underlying theory [9, 16].
In 1961, S. Glashow put forward a proposal for the unification of the electromagnetic and weak
interactions based on the enlarged gauge group SU(2)×U(1) [20]. In this electroweak theory, whose
renormalizability (which translates into its calculability) was an issue, mass terms for the gauge bosons
had to be put by hand, explicitly invalidating the symmetry. Glashow’s theory was later independently
extended by S. Weinberg [21] and A. Salam [22] to include the Higgs mechanism (see Section 1.3.4).
Proof that this new theory was renormalizable was given by Gerard ’t Hooft in 1971 [23], while working
under Martinus Veltman. For their scientific contributions, Glashow, Weinberg and Salam were awarded
the 1979 Nobel Prize in Physics, while Veltman and ’t Hooft were similarly honoured in 1999.
The idea of invariance under a group of gauge transformations is a fruitful one. Quantum chromo-
dynamics, the standard theory of the strong interaction, is a gauge theory based on the group SU(3).
Electroweak and strong interactions are unified in the Standard Model of particle physics, which will be
presented in the following chapter. Additional examples of the use of symmetries in modern physics in-
clude grand unified theories (GUTs), based on groups which include the Standard Model as a subgroup,
supersymmetry (SUSY), where a symmetry between bosons and fermions exists, and discrete flavour
symmetries, to be explored in Chapter 3.
1.3.3 The Discrete Symmetries C, P and T
In quantum field theory (QFT), a special status is given to the discrete symmetries under the oper-
ations of parity (P), time reversal (T) and charge conjugation (C). The first two operations correspond
to spacetime symmetries already present in classical mechanics which were imported by Wigner into the
quantum context. The last operation, charge conjugation, presents no classical analogue. In light of
Wigner’s theorem, P and C correspond to unitary operators, while T can only be implemented as an
antiunitary operator [24].
A parity transformation corresponds to an inversion of all three spatial coordinates, (x, y, z) →(−x,−y,−z). This inversion can also be obtained by changing the sign of one coordinate and per-
forming a 180o rotation, hence the association of parity to reflections and mirror symmetry. A time
reversal transformation corresponds to a change in the sign of the time coordinate, t→ −t. Both parity
and time reversal correspond to improper Lorentz transformations (see Footnote 7).
8Weyl considered the possibility of arbitrary and local rescalings of the spacetime metric, which correspond to the changeof a local unit length or gauge, hence the current usage of the latter term [9].
10
Concerning parity, one might ask what goes on the other side of the mirror. As Lewis Carroll’s Alice
puts it, “things go the other way”: a right hand is converted into a left hand and vice-versa. One is
thus confronted with the concept of chirality, which corresponds to Legendre’s use of the word symmetry,
presented in Section 1.1. A object which differs from its mirror image is said to be chiral. The question is
whether or not nature cares about chirality at all. In fact, while neither humans nor cars are bilaterally
symmetric, there seems to be no reason to expect that their mirror images9, with inverted organs and
engines, represent an impossible biology or a faulty vehicle. The chirality of people and cars are considered
to be accidents of evolution and design [26] and parity is still expected to be a symmetry of microphysics.
However, reality defies intuition: parity is not a symmetry of weak interactions. The idea that this
might be so was put forward by T. D. Lee and C. N. Yang in 1956 [27], who proposed an experimental
test to parity by considering the beta decay of cobalt-60. This test, performed in 1957 by C. S. Wu et
al. [28], confirmed their hypothesis: there is a preferred direction for the emission of β radiation.
To clarify what is meant by saying that parity is not a symmetry of nature or, equivalently, that
parity symmetry is violated, consider the following dialogue:
Alice: Imagine I have a ball rotating ‘clockwise’ and invert the axes. I then get a ball rotating
‘counterclockwise’. Is this what parity violation means?
Bob: No, not at all. That’s just a matter of description.
Alice: How so?
Bob: Well, the ball is rotating in a certain way. Whether it is clockwise or counterclockwise
depends on your perspective, on how you define these words.
Alice: I see. Then what does parity violation mean?
Bob: Simple – in its extreme version it means that it is not possible to have the ball rotate in
the other direction. Nature just doesn’t allow it.
Alice: So in a frame where I say it rotates clockwise, no counterclockwise balls can be seen?
Bob: And vice-versa. Exactly.
Regarding time reversal, few observed macroscopic phenomena, like a performance of Bach’s crab
canon, exhibit a symmetry under such operation. However, the irreversibility of processes like burning a
piece of firewood answers to a thermodynamical arrow of time, which is dependent on macroscopically
probable configurations of a system and does not imply microscopical irreversability. In fact, the laws of
classical mechanics are T-symmetric. Nevertheless, nature is not classical, but quantum, and one begs the
question of whether time reversal is a symmetry of the laws of physics. Indeed, compelling experimental
evidence that time reversal symmetry is violated has been found in B-meson decay chains by the BaBar
collaboration [29]. As we shall briefly see, there is a fundamental link between T, P and C.
Charge conjugation is defined in the context of relativistic quantum mechanics and refers to the
exchange of particles and antiparticles. The latter correspond to the physically meaningful states resulting
from reinterpreting the negative energy solutions of the Dirac equation as positive energy solutions with
opposite U(1) charges. Like parity, charge conjugation is a symmetry of electromagnetic and strong
9To consider the mirror image of the world, one must go as far as the molecular level, since certain molecules, likesugars or gasoline, are chiral. The interesting phenomenon of tunnelling between molecular chiral states was investigatedby Friedrich Hund in 1927 [25].
11
interactions, but not of weak interactions. In general, both C-symmetry and P-symmetry are violated
by the same processes while the product CP of these operations seems to be a valid symmetry. This
suggests that it is CP and not C or P individually that constitutes a symmetry of the physical world.
Alas, this is not strictly true: in the neutral kaon system – the study of which owes much to the work
of Murray Gell-Mann and Abraham Pais [30] – a small but nonzero departure from CP-symmetry was
found in 1964 [31] and earned James Cronin and Val Fitch the 1980 Nobel Prize in Physics.
There is still hope for salvaging the fundamental role of the above discrete symmetries in physics if one
considers the CPT theorem, whose discovery and proof in the 1950s is credited to J. Schwinger [32], W.
Pauli [33], J. Bell [34], and G. Luders [35]. This theorem states that a combination of the three presented
discrete symmetries should be a symmetry of any reasonable quantum field theory, i.e. a theory which
has Lorentz invariance, positive energy and local causality [36]. Thus, since CPT is conserved10, CP
violation and T violation are fundamentally connected: one demands the other. So far, no deviation
from CPT-symmetry has been observed [29,37].
1.3.4 Symmetry Breaking
Symmetries can be defined as either exact, meaning unconditional validity, approximate, meaning valid
only under certain conditions, or broken [9]. Symmetry breaking refers to the destruction of symmetry,
the transition (even if abstract) between a situation where the invariance exists to one where it does not.
This ‘break’ can be said to occur explicitly or spontaneously. Let us first consider the former case.
Explicit symmetry breaking refers to a destruction of symmetry that can be traced to the physical
law, in particular, to the presence of Lagrangian terms which spoil the invariance. These terms can arise
either by construction, anomalously, or through higher-order effects. An example of symmetry-breaking
terms which arise by construction corresponds to the parity violating structure of weak interactions.
The transition from a classical description to a quantum one can be responsible for the appearance
of so-called anomalous terms in the Lagrangian which can arise, for example, from the regularization
procedure. Finally, symmetry breaking terms may arise due to non-renormalizable effects. To understand
this, consider a field theory which provides an effective description of reality, meaning it approximates
a broader theory at low energies: symmetries present in the effective theory might not be symmetries
of the high-energy theory. Symmetries which, although not postulated, are found to be present in the
theory and can be broken by quantum corrections or non-renormalizable effects are dubbed accidental.
Accidental symmetries of the SM include the global baryon number (B) and lepton flavour number (Li)
symmetries. These are anomalously broken (see Section 4.3), while B−L (L =∑i Li) is not. However, in
seesaw extensions of the SM (see Section 2.2), the accidental B−L is broken by the non-renormalizable
Majorana mass terms for neutrinos, which arise from the interactions of heavy states in the high-energy
theory.
Spontaneous symmetry breaking (SSB), in turn, corresponds not to an actual destruction of the sym-
metries of the laws but to their hiding, since it is the lowest energy state of the system (or vacuum state,
10This is a blatant abuse of language. To say, for example, that CP is (not) conserved means simply that nature is (not)CP-symmetric. There is no reference to a conserved quantity, but to a symmetry which is maintained.
12
|0〉) which presents an asymmetry11. Following M. Guidry [38], one can classify broken symmetry sys-
tems according to whether U |0〉 = |0〉 (Wigner mode), U |0〉 6= |0〉 and the symmetry is global (Goldstone
mode), or U |0〉 6= |0〉 and the symmetry is local (Higgs mode), U being the operator which realizes the
symmetry operation. The first case may correspond to the above explicit breaking examples while the
last two correspond to spontaneous symmetry breaking scenarios.
Classically, one can conceive of situations in which the choice of the lower energy state spoils existing
symmetries: the fall of a vertically held pole spontaneously breaks rotational symmetry as a particular
direction is chosen out of the existing infinite possibilities. One can see that the symmetry is hidden and
not destroyed because it is still present in the full set of solutions. Likewise, if the pole is constrained to
move in a plane, there are only two possible ground states and the choice of any of them spontaneously
breaks the symmetry. In quantum systems, however, spontaneous symmetry breaking will not occur
if the number of degenerate ground states is finite, since it is possible to construct a state from their
superposition [39]. The concept is only applicable to idealized infinite systems, such as a ferromagnet.
If a ferromagnetic material is heated above the Curie (critical) temperature, Tc, no preferred ori-
entation for the magnetic dipoles exists, resulting in zero net magnetization. However, as soon as the
temperature drops below Tc, the system transitions to a ground state where a net magnetization develops
in one of the infinite possible directions. A link to Curie’s principle is here readily found: it is only the
asymmetry of the situation that allows the phenomenon, i.e. the appearance of magnetization in the
absence of an applied magnetic field.
An important result which arises in the context of SSB is Goldstone’s Theorem [40], which refers to
the appearance of massless bosons – termed (Nambu-)Goldstone bosons – when a continuous symmetry
is spontaneously broken. The number of such bosons matches the number of generators12 of the broken
continuous group. Goldstone himself remarks that although “a method of losing symmetry is [...] highly
desirable in elementary particle theory”, there seems to be no way to avoid in this context the introduction
of “non-existent massless bosons”. A solution to this unphysical problem is found in the Higgs mechanism,
which owes its origins to the work of P. Higgs [43, 44], R. Brout and F. Englert [45], and G. Guralnik,
C. Hagen, and T. Kibble [46]. This mechanism consists in the appearance of gauge boson mass terms
when the gauge symmetry of the theory is spontaneously broken, thanks to the Goldstone bosons which
one expects to arise from the break. These unphysical massless bosons are ‘absorbed’ as mass degrees
of freedom by the previously massless gauge bosons. The Higgs mechanism is a crucial feature of the
Standard Model of particle physics, since it allows the generation of mass terms for fermions and gauge
bosons without explicitly compromising the underlying symmetry group. The recent discovery by the
ATLAS and CMS collaborations [47, 48] of a boson with a mass of 125.9 ± 0.4 GeV [37], compatible
with the Standard Model Higgs particle (which corresponds to an unabsorbed degree of freedom), will
certainly allow for a deeper look into the question of the origin of particle mass.
11Thus, unlike what one might suppose, the elliptical motion of planets as opposed to the isotropic character of Newton’slaw of universal gravitation does not constitute an example of spontaneous symmetry breaking.
12One might regard the group Zn = e, a, a2, . . . , an−1 as being generated by powers of one single element (to wit, a).Similarly, the elements of a continuous (Lie) group which lie infinitesimally close to the identity element form a vector space– tangent to the manifold which the group defines – spanned by basis vectors which are called the group generators [41].Exponentiation of these infinitesimal elements yields the remainder of the group (recall Section 1.3.1). In special cases,which will be of no concern to us, the number of Goldstone bosons might be less than the number of group generators [42].
13
1.4 A Philosophical Interlude
According to E. Castellani [49], symmetries in physical theories play four separate roles. A classifica-
tory role is easily identified in the crystallographic enterprise as well as in the classification of elementary
particles, an example being the work of Wigner on the representation theory of the Poincare group [50].
Symmetries also possess a normative role, in the sense that they regulate the form of the theory, as well
as a unifying role, present in the construction of theories, such as GUTs, which seek to join the funda-
mental interactions under a simple symmetry group. Lastly, symmetries can be attributed an explanatory
role, as they are taken to be fundamental principles which dictate how nature must behave. A notion
of simplification is transversal to the four roles. According to Curie’s principle, it is asymmetry which
allows a diversity of phenomena, which is to say, complexity.
The normative and explanatory roles lead to a methodological aspect: model building in modern
particle physics often relies on postulating the presence of certain symmetry properties, as well as the
breaking of these invariances, determining which Lagrangian terms are allowed. The increasingly central
role which has been given to symmetries is not without justification: both the prediction of the Ω−
particle, based on an incomplete classification of baryons and mesons (the Eightfold Way of Gell-Mann
and Ne’eman [51]), and its subsequent discovery, along with the prediction and discovery of the W and
Z bosons, mark extraordinary successes of the use of symmetry in physics. This predictive power seems
to imply that symmetries as a basis for the description of the physical world are ‘here to stay’13.
It is important to emphasize the epistemological character of symmetries: as Wigner points out [12],
the artificial (but fruitful) division of reality into initial conditions and laws of nature would not be
possible in the absence of invariace under spacetime displacements. Physics would then differ from place
to place and time to time, compromising the way scientific knowledge is obtained. There is also a close
connection between symmetries and objectivity. Lorentz invariance, for instance, establishes a physical
equivalence between different observers, with different perspectives. Equivalence renders conventions
irrelevant and only that which is invariant under the symmetry group is considered physical.
The effectiveness of symmetry in physics is analysed by P. Kosso [9], who makes a case against any
association of symmetry with design, stating that such a link is incompatible with the objective nature of
symmetries: these are introduced, in part, to remove dependences on the decisions of a conscient observer
from the physical laws. To end this interlude, one once again turns to Wigner, who ponders about the
unreasonable effectiveness of mathematics in the study of natural sciences [52]. In his words, “the miracle
of the appropriateness of [...] mathematics for the formulation of the laws of physics is a wonderful gift
which we neither understand nor deserve”.
13An interesting connection is established by Wigner [12] concerning the relationship between physical laws and eventsand that of symmetries and laws. Were we to know all future events, the correlations between them (the laws) wouldbecome unnecessary. In the same way, the full knowledge of the laws of nature (if such a thing is possible) would maketheir symmetry properties a mere curiosity.
14
1.5 The Asymmetry of Existence
We live in an apparently biased Universe in which there is, as far as one can see [53], a clear preference
for the presence of matter over antimatter. One of the greatest challenges of modern physics is to explain
the observed asymmetry (in the state of the system, not in the laws which govern it) in light of our present
knowledge of cosmology and particle physics. How could this imbalance come to be? One assumes that
it is either a consequence of the initial conditions of the Universe, i.e. an accident, or it is dynamically
generated as the Universe cools down. The latter hypothesis calls for a mechanism – baryogenesis – that
produces an excess of baryons over antibaryons at some point in the past and, therefore, allows for our
own existence.
1.5.1 Experimental Evidence
Although antimatter has been produced and studied, no primordial antimatter has been detected in
the observable Universe as of today. Local evidence for the baryon asymmetry of the Universe (BAU)
includes the scarcity of antimatter in the Earth, the absence of γ-ray production due to solar wind, and
the successful landing of planetary probes in the planets of our solar system.
Cosmic rays offer an additional probe into far away regions of the Universe: nonvanishing ratios
between the number of cosmic antiprotons and protons have been measured, the magnitude of which is
consistent with secondary proton production in collisions of cosmic rays with matter in the interstellar
medium, in processes such as p + p → 3p + p [54]. Additionally, no antinuclei such as antihelium (He)
have been found in cosmic radiation [55]. In short, the cosmic ray data is consistent with the absence of
large structures of antimatter in the Universe. An even more compelling evidence for the absence of such
structures comes from the missing γ-ray background one would expect from the collision of galaxies and
anti-galaxies, at the hypothetical boundaries of matter-antimatter domains.
1.5.2 The Tuning of Initial Conditions
One turns now to the problem of whether the observed BAU was generated dynamically or accidentally.
The reason why the accidental asymmetry scenario, based on proper initial conditions, is usually rejected
is twofold. On the one hand, generating the observed asymmetry would require one extra quark for every
∼ 107 antiquarks in the early Universe [56]. This presents itself as a problem of naturalness, and one
usually avoids this much fine-tuning. On the other hand, a constraint arises if the effects of inflation
are considered. Inflation corresponds to a rapid expansion of the scale factor in the early Universe (see
Section 4.1.1) and is predicted to exponentially dilute any initial asymmetry.
In the dynamical case, the symmetry violating processes should be weak enough in order to produce
the observed values of the BAU, which can be quantified in the present epoch through the parameter η,
defined as the ratio between the number density of baryonic number, nB, and that of photons, nγ :
η ≡ nB
nγ≡ nb − nb
nγ' nbnγ. (1.5)
15
The quantity nB is itself defined as the difference between the number densities of baryons (nb) and
antibaryons (nb).
Apart from the baryon-antibaryon imbalance, the value of η also signals the imbalance between matter
and radiation in the present-day Universe. The abundances of light elements and the anisotropies of the
cosmic microwave background (CMB) constrain the value of η independently. One verifies that both
conditions give compatible values for the BAU parameter, which when combined give [37]:
η = (6.19± 0.15)× 10−10. (1.6)
An equivalent description is given by the quantity YB, corresponding to the ratio between nB and the
current entropy density of the Universe. The approximate relationship YB ' η/7.04 holds.
1.5.3 The Possibility of a B-Symmetric Universe
The observation of matter dominance does not seem to exclude the existence of large quantities of
antimatter in far away regions of the Universe that would effectively make it baryon symmetric. Let
us assume a B-symmetric Universe, with any possible initial asymmetry being washed out by inflation.
One can then compute baryon and antibaryon densities at freeze-out – the time when the expansion rate
of the Universe, measured by the Hubble parameter H(t), surpasses the rate Γ of nucleon-antinucleon
annihilation (for details, see Section 4.1). These densities will coincide due to B-symmetry of the laws.
The temperature at freeze-out (H(T ) ∼ Γ) can be estimated as T ∼ 20 MeV [57], well below the nucleon
mass, mN ∼ 1 GeV, for which the following relation holds (cf. Eq. (4.17)):
nbnγ
=nbnγ∼(mN
T
)3/2
exp(−mN/T ). (1.7)
An unreasonable value of nb/nγ ∼ 10−19 is thus obtained at freeze-out. One concludes that some
mechanism must have segregated matter and antimatter when nb/nγ ∼ 10−10, which corresponds to
T ∼ 40 MeV, or earlier. However, such mechanism would need to operate at the scale of the observable
Universe and, at this time, the horizon (causally connected region) only contained a negligible amount
of matter (∼ 10−7M) [54]. Therefore one cannot avoid the conclusion that the Universe must have
possessed a baryon asymmetry already at early times.
To understand the creation of this asymmetry, one has to go beyond the realms of pure cosmology and
consider the role that particle physics may have played in its production. The following chapter is thus
dedicated to the Standard Model of particle physics and to extensions which can account for neutrino
masses, whose smallness will be a concern. Their presence allows for lepton mixing, which will be discussed
in Chapter 3. A compelling case was presented for a dynamically generated baryon asymmetry, which
implies the presence of an asymmetry within the physical laws (recall Curie’s principle). As previously
hinted, the construction of the SM relies heavily on the concepts of symmetry and symmetry breaking,
and one should investigate whether the asymmetry conditions for generating the BAU – the Sakharov
conditions – are satisfied in its context. This will be done in Chapter 4, where a popular baryogenesis
scenario accommodating naturally small neutrino masses – leptogenesis – is presented. In Chapter 5, the
viability of leptogenesis in a model which includes a discrete flavour symmetry and spontaneous breaking
of the CP symmetry is analysed.
16
The Standard Model of Particle
Physics and (slightly) Beyond 2Particle physics relies on the principle that the interactions between the constituents of the subatomic
world can be described, to a remarkable approximation, by the Standard Model. In spite of its successes
and repeated experimental verification, the SM cannot provide a satisfatory answer to some questions
which remain open. Among these is the fact that neutrino masses cannot be accounted for in the model,
as required by neutrino oscillations. In this chapter we review relevant aspects of the SM and consider
extensions in which small neutrino masses arise in a natural way.
2.1 Recap of the Electroweak Sector of the SM
The Standard Model is a relativistic quantum Yang-Mills theory built by postulating both an un-
derlying SU(3)c×SU(2)L×U(1)Y gauge symmetry group – the subscripts are labels which correspond,
respectively, to colour, left-handedness, and hypercharge – as well as the following field content:
φ ≡(φ+
φ0
)∼(2, 1/2
), `αL ≡
(ναLlαL
)∼(2,−1/2
), qαL ≡
(uαLdαL
)∼(2, 1/6
),
lαR ∼(1,−1
), uαR ∼
(1, 2/3
), dαR ∼
(1,−1/3
).
(2.1)
In the above, φ, `α and qα represent the Higgs, lepton, and quark doublets, respectively, while the fields
uα and dα correspond to up- and down-type quarks, and να and lα to neutrinos and charged leptons.
The index α runs over three generations (or families) of fermions.
Absent from the postulated field content are the twelve gauge bosons which appear in the theory
following the demand of local gauge invariance. One will henceforth focus on the electroweak sector,
which corresponds to the SU(2)L×U(1)Y gauge group, ignoring the (unbroken) strong SU(3)c symmetry.
The quantities given between parentheses in (2.1) refer to representation assigments under this electroweak
subgroup. The first one specifies how the field transforms under SU(2)L, to which three generators Ii
are associated. A value of ‘1’ signals that the field transforms as a singlet, Ii = 0, whereas ‘2’ indicates
that it transforms as a doublet, Ii = τ i/2 (τ i are the Pauli matrices). The second quantity refers to the
value taken by the hypercharge operator Y , the single generator of U(1)Y , for each (non-gauge) field.
The SM is also a chiral theory, meaning that its basic fermionic ingredients are eigenfunctions of the
chirality matrix γ5. In fact, it is possible to decompose a spinor field ψ into these right-handed and
17
left-handed chiral eigenfunctions (with eigenvalues +1 and −1 respectively):
ψ = ψR + ψL. (2.2)
One thus defines the operators PR,L which obey:
ψR =1 + γ5
2ψ ≡ PR ψ, ψL =
1− γ5
2ψ ≡ PL ψ. (2.3)
These operators are said to be chiral projectors since (PR,L)2 = PL,R, PR+PL = 1, and PRPL = PLPR =
0. The chiral fields ψR,L are two-component spinors and belong to the simplest nontrivial representations
of the Lorentz group [58], which grants them a fundamental role in the construction of the SM. The
dynamics of the fields are encoded in the Lagrangian (density) of the theory which reads1:
LSM = (Dµ φ)†(Dµ φ)− V (φ)− 1
4AiµνA
iµν − 1
4BµνB
µν
+ i `αL /D `αL + i qαL /D qαL + i lαR /D lαR + i uαR /D uαR + i dαR /D dαR
−(Ylαβ `αL φ lβR + H.c.
)−(Yuαβ qαL φ uβR + H.c.
)−(Ydαβ qαL φdβR + H.c.
).
(2.4)
One recognizes, in the above expression, the Klein-Gordon, Dirac and Proca kinetic Lagrangian terms
for scalar (spin 0), spinor (spin 1/2) and vector (spin 1) fields, respectively. However, the requirement of
gauge invariance under SU(2)L×U(1)Y implies a change to the usual kinetic terms of the fermions and
scalar Higgs doublet fields, namely the substitution of the ordinary derivative by a covariant one [41]:
∂µ → Dµ ≡ ∂µ − i g2Aiµ I
i − i gY Bµ Y, (2.5)
where g2 and gY are coupling constants associated with the gauge groups SU(2)L and U(1)Y , respectively.
This leads to the introduction of four real boson fields Aiµ and Bµ in the theory, whose transformation
properties are such that the kinetic terms are kept invariant under the action of the gauge group.
A modification is also in order for the Proca Lagrangian. Whereas the term for the Bµ field is a
typical one, with the usual definition Bµν ≡ ∂µBν − ∂νBµ, the term for the Aiµ gauge fields relies on the
following definition of Aiµν , to ensure invariance of the kinetic Proca term under the group SU(2)L:
Aiµν ≡ ∂µAiν − ∂νAiµ + g2 εijkAjµA
kν , (2.6)
where εijk is the totally antisymmetric (Levi-Civita) tensor of rank 3.
Further inspection of Eq. (2.4) reveals the absence of mass terms for all SM particles. Charged
fermion masses cannot be written down without explicitly breaking the symmetry group, since – as one
can see from Eq. (2.9) – there is no invariant way of combining an SU(2)L doublet with an SU(2)L
singlet, as required by the form of a Dirac mass term(PR,Lψ = ψPL,R
):
−mψψ = −m(ψR ψR + ψR ψL + ψL ψR + ψL ψL
)= −m
(ψR ψL + ψL ψR
). (2.7)
1 Aside from ignoring the strong sector, one omits gauge fixing and Faddeev-Popov terms in LSM. The Feynmanslash notation is also employed: /A ≡ γµAµ, where γµ are the Dirac matrices (µ = 0, 1, 2, 3) obeying γµ = γ0㵆γ0 and
γµ, γν = 2gµν . The chirality matrix is given by γ5 ≡ γ5 ≡ iγ0γ1γ2γ3 = γ5† = (γ5)2 and anticommutes with every γµ
(the number ‘5’ is a mere label, not a Dirac index). Additionally, one defines ψ ≡ ψ†γ0 and φ ≡ iτ2φ∗.
18
This clash with reality is resolved though the Higgs mechanism: after electroweak symmetry breaking
(EWSB), the Yukawa terms in the Lagrangian will give rise to the desired fermionic mass terms. The
former make up the last line of (2.4), where Yl, Yu and Yd are general complex matrices and ‘H.c.’
denotes Hermitian conjugation. Mass terms for the SM bosons will also arise after EWSB. In particular,
one might suppose that, due to the form of the scalar potential in LSM,
V (φ) ≡ µ2φ†φ+ λ(φ†φ
)2, (2.8)
there is a mass term for the scalar doublet components. However, this is not so straightforward since one
requires µ2 < 0 for EWSB to occur. Thus, this term is not a Klein-Gordon mass term.
Before proceeding, we present, for the sake of completeness, the explicit way in which the fields trans-
form under and element g of the gauge group, parametrized by the four local parameters(θi(x), η(x)
):
ψ → Ug ψ = exp(i θi(x) Ii
)︸ ︷︷ ︸
SU(2)L
exp(i η(x)Y
)︸ ︷︷ ︸
U(1)Y
ψ. (2.9)
2.1.1 Neutral and Charged Electroweak Currents
Expanding the covariant derivative (2.5) in LSM, one can extract the way in which the gauge bosons
interact with fermions. For the three generations of leptons,
L`Gauge =1
2`αL
(g2 /Ai τ i − gY /B
)`αL − gY lαR /B lαR
=1
2
(ναL lαL
) ( g2 /A3 − gY /B g2
(/A1 − i /A2
)
g2
(/A1 + i /A2
)−g2 /A3 − gY /B
) (ναLlαL
)− gY lαR /B lαR.
(2.10)
The diagonal interaction terms in weak isospin space – the space of SU(2)L rotations – are grouped
into the so-called neutral current (NC) Lagrangian, while the off-diagonal terms correspond to a charged
current (CC) Lagrangian. These terms determine the interactions of fermions with electroweak bosons:
L`NC =1
2ναL
[(g2 sW − gY cW
)γµAµ +
(g2 cW + gY sW
)γµZµ
]ναL
− 1
2lαL
[(g2 sW + gY cW
)γµAµ +
(g2 cW − gY sW
)γµZµ
]lαL
− gY lαR[cW γµAµ − sW γµZµ
]lαR,
(2.11)
L`CC =g2√
2
(ναL γ
µWµ lαL + lαL γµW †µ ναL
)≡ g2√
2j`,µCCWµ + H.c.. (2.12)
In the previous expressions one has employed the definitions
Wµ ≡A1µ + i A2
µ√2
, Zµ ≡ cos θW A3µ − sin θW Bµ , Aµ ≡ sin θW A3
µ + cos θW Bµ, (2.13)
where θW is the weak mixing or Weinberg angle, actually introduced by Glashow in 1961 [20], and the
shorthands cW ≡ cos θW and sW ≡ sin θW . The fields Wµ and Zµ correspond to the bosons responsible
for weak interactions while Aµ (no numerical index) corresponds to the photon.
By requiring that the neutrinos do not couple to the electromagnetic field, it follows that:
g2 sin θW = gY cos θW ⇒ gY /g2 = tan θW . (2.14)
19
Plugging this relation back into (2.11), one can extract the electromagnetic coupling of charged leptons:
L`A = −1
2lαL(g2 sW + gY cW
)γµAµ lαL − gY cW lαRγ
µAµ lαR
= −gY cW(lαL γ
µAµ lαL − lαR γµAµ lαR)
= −gY cW lL γµ lLAµ.
(2.15)
One recognizes, in the last line of this expression, the electromagnetic current and establishes the iden-
tification e = gY cos θW = g2 sin θW , where e represents the elementary charge. The procedure here
outlined can be similarly applied to extract the expressions for quark CCs and NCs. The electroweak
gauge interaction terms can thus be brought to the form:
LGauge = e jµAAµ +g
cos θWjµZ Zµ +
(g√2jµCCWµ + H.c.
). (2.16)
Here, jµA represents the electromagnetic NC for quarks and charged leptons, jµZ the weak NC for all quarks
and leptons, and jµCC the (weak) CCs, defined for the case of leptons in Eq. (2.12).
The presence of different conventions in the literature demands careful bookkeeping of signs entering
the definitions of couplings and fields – see Ref. [59] for a convention-independent notation. The choices
taken above coincide with those of Peskin and Schroeder [41].
2.1.2 The Higgs Mechanism
In the SM, the vacuum (recall Section 1.3.4) is identified by minimizing the scalar potential V (φ) of
(2.8) with respect to the fields φ. If µ2 > 0 and λ > 0, this is trivially satisfied at 〈φ+〉 = 〈φ0〉 = 0,
where the brackets denote the vacuum expectation value (VEV). In order to spontaneously break the
gauge symmetry, one takes µ2 < 0 and λ > 0. Now, some of the fields must possess a nonzero value in
the vacuum, where the potential is minimum. However, in order to preserve the charge neutrality and
isotropy of the vacuum, only neutral scalar fields can acquire a nonzero VEV, which is possible for the
lower isospin component of the Higgs doublet alone:
〈φ〉 =
(〈φ+〉〈φ0〉
)≡(
0
v
), v ∈ C, v 6= 0. (2.17)
The minimum of the potential will then be given by:
∂V (〈φ〉)∂〈φ〉 = 0 ⇒ |v|2 = −µ
2
2λ⇒ v = eiϑ
√−µ2
2λ. (2.18)
Choosing a particular minimum, hereafter ϑ = 0, realizes the spontaneous breaking of the symmetry.
There is still a residual, unbroken U(1) symmetry corresponding to electric charge conservation. The
electric charge operator Q is related to the weak isospin generator I3 (assumed to correspond to a
diagonal matrix) and to the hypercharge operator Y by the Gell-Mann–Nishijima relation, which reads:
Q = I3 + Y. (2.19)
To see which symmetry groups are preserved by the vacuum, one simply needs to check if the VEV of
the fields belongs to the kernel of the group generatorsG, since in this case ei αG〈φ〉 = (1+i αG+. . . )〈φ〉 =
20
〈φ〉 + i αG 〈φ〉 + · · · = 〈φ〉. This ceases to happen for the generators Ii and Y when the Higgs doublet
acquires a nonzero VEV, but not for the combination of (2.19), as expected:
Q 〈φ〉 =[I3 + Y
]〈φ〉 =
[1
2
(1 0
0 −1
)+
1
2
(1 0
0 1
)](0
v
)=
(1 0
0 0
)(0
v
)= 0. (2.20)
Since physical fields must have a zero value in the vacuum, one reparametrizes the Higgs doublet as:
φ(x) = exp
(iξi(x)
v
τ i
2
)
0
v +H(x)√
2
, (2.21)
with 〈H(x)〉 = 〈ξi(x)〉 = 0. H(x) and ξi(x) correspond to real fields, unlike φ0(x) and φ+(x) which are
complex, making transparent that there are four degrees of freedom in φ.
Prior to EWSB, one has the freedom to perform gauge transformations, which leave the Lagrangian
invariant. Choosing a particular gauge – the unitary gauge – one can bring φ to the following form by
cancelling the exponential of (2.21):
φ(x) =
0
v +H(x)√
2
. (2.22)
After EWSB, H(x) will thus correspond to the physical Higgs field and physics can be read off directly
from LSM by replacing φ with that of (2.22). In particular, one has:
Dµφ =
(0
∂µ(H/√
2))
+i g2
2
( √2W †µ
sec θW Zµ
)(v +H/
√2), (2.23)
V (φ) = λ
(H4
4+√
2 v H3 + 2 v2H2
)+ const., (2.24)
which imply, disregarding the constant as well as interaction terms:
∥∥Dµ φ∥∥2 − V (φ) =
1
2(∂µH)(∂µH) +
1
2(4v2λ)H2 +
g22 v
2
2W †µW
µ +1
2
g22 v
2
2 cos2 θWZµZ
µ + . . . (2.25)
From this last expression, one extracts the masses of the Higgs, W and Z bosons2:
mH = 2 v√λ =
√−2µ2, mZ =
g2 v√2 cos θW
, mW =g2 v√
2= cos θW mZ . (2.26)
Experiment yields the values mH = 125.9 ± 0.4 GeV, mZ = 91.1876 ± 0.0021 GeV, and mW =
80.385 ± 0.015 GeV [37] for the massive SM bosons. One sees that, in line with what was discussed
in Section 1.3.4, the ‘gauging-away’ of the three would-be Goldstone bosons ξi corresponds to their ab-
sorption as longitudinal/mass degrees of freedom of the gauge bosons, W± and Z. In the SM, by virtue
of (2.26), one expects the below defined ρ parameter to be unit at tree level:
ρ ≡ m2W
cos2 θW m2Z
. (2.27)
2The normalization of the kinetic Klein-Gordon term in (2.25) implies removing a factor of 1/2 in order to identify m2H .
For a Proca mass term, the factor to remove is also 1/2. While this works for mZ , the W mass term is an exception dueto the complex nature of the field, possessing an additional degree of freedom. Therefore, m2
W is simply the coefficient of
the W †µWµ term.
21
The inclusion of additional Higgs multiplets Φk in the standard recipe results in a deviation to this
formula, since all of the new VEVs will contribute to the masses of gauge bosons (Ik and I3k are the weak
isospin of the multiplet Φk and third isospin component of the multiplet component which acquires the
VEV, respectively) [60]:
ρ =
∑k
[Ik(1 + Ik)− (I3
k)2]v2k
2∑k (I3
k)2 v2k
. (2.28)
A surplus of Higgs singlets or doublets is seen to be inconsequential, whereas VEVs of higher isospin
multiplets are severely constrained by the experimental value ρexp = 1.0004+0.0003−0.0004 [37]. The closeness of
ρ to one is protected by the custodial symmetry [61,62], an accidental approximate symmetry of the SM
which becomes exact in the limit gY → 0, where θW = 0 by (2.14).
2.1.3 Fermion Masses and Mixing
Upon spontaneous breaking of the gauge symmetry, the Yukawa part of the Lagrangian (2.4) reads:
LYuk. = −(v +
H√2
)(Ylαβ lαL lβR + Yu
αβ uαL uβR + Ydαβ dαL dβR
)+ H.c.
= −Mlαβ lαL lβR −Mu
αβ uαL uβR −Mdαβ dαL dβR + H.c. + LHint.,
(2.29)
where Mψ ≡ vYψ is the mass matrix for the ψ fields and LHint. contains the interaction terms H ψLψR.
The fermion mass terms arising from EWSB are generally not diagonal, meaning one must ‘rotate’ the
fields to bring them to the physical basis. The mass eigenstates are obtained by diagonalizing3 Ml, Mu
and Md. This (bi-)diagonalization is achieved by unitary matrices V l,u,dL,R :
V lL†Ml V lR = diag(me,mµ,mτ ) ≡ Dl,
V uL†Mu V uR = diag(mu,mc,mt) ≡ Du,
V dL†Md V dR = diag(md,ms,mb) ≡ Dd,
(2.30)
where the mi are real and positive (for a proof, see Section 4.1 of [58]).
In light of (2.30), one sees that performing the following rotations4 corresponds to changing to a basis
where fermion fields have a definite mass:
lαL →(V lL)αβlβL, lαR →
(V lR)αβlβR,
uαL →(V uL)αβuβL, uαR →
(V uR)αβuβR,
dαL →(V dL)αβdβL, dαR →
(V dR)αβdβR.
(2.31)
Since, unlike what happens in a so-called weak-basis transformation, the previous transformations
distinguish SU(2)L doublet components, there will be a misalignment in the quark gauge interaction
terms. In particular, the quark charged-current Lagrangian terms, analogous to those of Eq. (2.12), now
3The word ‘diagonalizing’ might be misleading since one does not seek the eigenvalues of these matrices, but insteadwants to bring them to diagonal form through generally unrelated rotations (in flavour space) of the fields which make upthe mass term in the Lagrangian.
4Writing ψ → Uψ, for an invertible matrix U , is a quick way of saying that one can define a field ψ′ ≡ U−1ψ, then usethe relation ψ = Uψ′ to write the Lagrangian as a function of ψ′, and finally drop the primes.
22
Property # Parameters # Moduli # Phases
n×m General 2nm nm nm
n× n Symmetric n(n+ 1) n(n+ 1)/2 n(n+ 1)/2
n× n Unitary n2 n(n− 1)/2 n(n+ 1)/2
n× n Hermitian n2 n(n+ 1)/2 n(n− 1)/2
Table 2.1: Number of real parameters contained in complex matrices depending on their properties.
become (in terms of the rotated fields):
LqCC =g2√
2uβL
(V uL)∗αβγµ(V dL)αρdρLWµ + H.c.
=g2√
2uβL γ
µ(V uL† V dL
)βρdρLWµ + H.c.
≡ g2√2uαL γ
µ (VCKM)αβ dβLWµ + H.c.,
(2.32)
while the neutral current of Eq. (2.11) remains unchanged, implying that there are no flavour chang-
ing neutral currents at tree-level (GIM mechanism [63]). The matrix VCKM is known as the Cabibbo-
-Kobayashi-Maskawa matrix [64,65] or quark mixing matrix, and can be seen as a transformation which
relates massive down-type quarks with the linear combinations of them, dβ′L ≡ (VCKM)αβ dβL, that
interact with massive up-type quarks through the CC interaction.
The number of parameters for various classes of complex matrices can be found in Table 2.1. Being
unitary, the VCKM matrix can be described by n(n − 1)/2 moduli, which correspond to mixing angles,
since all entries of a unitary matrix have moduli ≤ 1, and n(n + 1)/2 phases, where n is the number
of generations. However, not all phases are physical, since some can be removed through the rephasing
of quark fields. This is possible because, aside from the CC terms, the Lagrangian possesses a U(1)2n
symmetry, i.e. is invariant under:
uαL,R → ei ϕuα uαL,R, dαL,R → ei ϕ
dα dαL,R (2.33)
Thus, one expects to identify 2n phases as unphysical. However, this is not the case since one can
decompose this transformation as a global rotation of, say, ei ϕu2 , and a subsequent rephasing where only
2n − 1 phases are available. The Lagrangian is unaffected by the first rotation, due to a global U(1)B
symmetry – corresponding to baryon number conservation – and so 2n − 1 phases are removable by
rephasing. For the case of interest, n = 3, one obtains three mixing angles θ12, θ13, θ23 ∈ [0, π/2], and a
single phase δ ∈ [0, 2π[. The standard parametrization of the VCKM is given by [66,67]:
VCKM =
c12 c13 s12 c13 s13 e−iδ
−s12 c23 − c12 s23 s13 eiδ c12 c23 − s12 s23 s13 e
iδ s23 c13
s12 s23 − c12 c23 s13 eiδ −c12 s23 − s12 c23 s13 e
iδ c23 c13
, (2.34)
where sij ≡ sin θij and cij ≡ cos θij . The δ phase is the only source of CP violation in the SM [26].
One now turns to the case of leptons. Since a right-handed component for the neutrino field is missing
from the SM field content of (2.1), one sees that the appearance of a Dirac mass term in LSM upon EWSB,
23
as in the case of other fermion fields, is not possible. Hence, neutrinos are strictly massless in the SM.
This implies that performing a rotation of the neutrino fields in flavour space is inconsequential as far as
LYuk. is concerned, since there is no mass term to spoil: neutrinos have definite (zero) mass in all bases.
This freedom can be used to cancel the effects that diagonalizing the charged lepton mass matrix has on
the lepton charged current Lagrangian, L`CC. By performing the transformation (recall (2.31)):
ναL →(V lL)αβνβL, (2.35)
the Lagrangian of (2.12) remains unchanged when expressed in terms of fields with definite mass. Lepton
mixing is thus absent from the SM.
Flavour neutrino fields νe,µ,τ are defined as the neutrino field combinations which are coupled by the
charged current to each massive charged lepton e, µ, τ . Since interactions do not mix lepton flavours,
lepton flavour numbers Li (i = e, µ, τ) – associated to the possibility of invariant rephasings of lepton
fields – and, by extension, total lepton number L =∑i Li, are conserved in the SM.
One is finally in the position to count the number of physical parameters in the SM. Aside from two
parameters which pertain to the strong sector, namely the strong coupling constant and a parameter
which quantifies the strong CP problem (see below), the SM depends on seventeen parameters, to be
taken as experimental input. These are the two electroweak couping constants gY and g2, six quark
masses mu,...,t and three charged lepton masses me,µ,τ , two scalar potential parameters, which can be
chosen to be mH and v, and the four VCKM mixing parameters.
Fermion masses comprise a large portion of this parameter list. Since the theory offers no prediction
for their values, their origin remains a mystery. One might also wonder why are there three generations
of fermions. The question of why the experimental values are what they are seems in general to be
metaphysical and unproductive. However, both the closeness of the VCKM to the identity matrix as well
as the presence of mass hierarchies between generations can be seen as suggestive of hidden relations
between parameters. Disparities between the orders of magnitude of fermion masses are made clear in
Fig. 2.1, where a summary of the fermionic content of the SM is given.
Other shortcomings of the theory include the absence of a description of gravity and of a viable dark
matter candidate, as well as the lack of solutions for the hierarchy and strong CP problems. The hierarchy
problem is related to the fine-tuning of parameters required to keep mH near the electroweak scale (∼ 102
GeV) as one demands validity of the model up to the Planck scale (∼ 1019 GeV). The strong CP problem
is in turn concerned with the absence of a CP violating term in the strong sector of the SM, despite there
being no symmetry which forbids it. Additionally, the SM cannot achieve gauge coupling unification at
high-energies, unlike what happens in supersymmetric models or GUTs, and, as we shall see in Section
4.3, is incapable of explaining the observed BAU.
These issues support the idea that the SM is an effective low-energy theory and not a final one.
Although some of these problems can be classified as theoretical prejudice, undeniable evidence for
physics beyond the Standard Model arises when one considers the experimental evidence for neutrino
oscillations [68]: neutrinos have small but nonzero masses. In the next section, simple extensions of the
SM will be considered in order to generate neutrino masses which are naturally small.
24
Figure 2.1: Mass hierarchy of the elementary fermions observed in Nature. Mass values and uncertainties
are obtained from [37] and references therein (light quarks present the highest relative mass uncertainties).
Charges are presented in units of elementary charge. Kinematical bounds from beta decay are considered
for neutrino masses.
2.2 Neutrinos Beyond the SM
2.2.1 The Neutrino Mass Term
A Dirac mass term for neutrinos can be defined consistently with the gauge symmetry if one considers a
straightforward extension obtained by adding three right-handed sterile neutrino fields ναR to the particle
content. These extra fields are two-component spinors and weak isospin singlets with null hypercharge
(hence ‘sterile’), and the extra Lagrangian terms read, before and after spontaneous symmetry breaking:
LνYuk. = −Yναβ `αL φ νβR + H.c.
EWSB−−−−−→ −(v +
H√2
)Yναβ ναL νβR + H.c. . (2.36)
Diagonalization of the neutrino mass matrix Mν = vYν is achieved by rotating the neutrino fields into a
basis of massive states ν1,2,3. The rotation (as was the case for quarks) spoils the CC alignment, making
lepton mixing possible. This mixing will be completely analogous to the quark case presented in the
previous section, with a mixing matrix parametrized as that of Eq. (2.34). However, by looking at Fig.
2.1, one sees that the entries of Yν must be unnaturally smaller than those of Yl,u,d in order to generate
neutrino masses . 1 eV. In fact, one might expect that by virtue of having the same origin, namely
electroweak symmetry breaking, the twelve elementary fermions would have comparable masses. Even
though this is not the case for charged fermions, which span over five orders of magnitude in mass, the
fact that neutrinos are such a strong outlier seems to signal the presence of an alternative mass generation
mechanism at work.
Since neutrinos are neutral fermions, it is possible to construct for them a Majorana mass term [69].
This is achieved by assuming that the chiral components of the neutrino field are not independent.
Consider the Dirac equation for a (free) fermion field with mass m, and the equations obtained by
25
applying to it the projectors PL and PR:
(i γµ∂µ −m
)ψ = 0 ⇒
i γµ∂µψR = mψL
i γµ∂µψL = mψR. (2.37)
If the chiral fields ψR and ψL are not independent, then one of these equations is redundant. Manipulating
the first one yields:
i γµ∂µψR = mψLH.c.−−−→ −i ∂µψ†R 㵆 = mψ†L
×γ0
−−−→ −i ∂µψR γµ = mψLtranspose−−−−−−−→ −i γµT ∂µψR
T= mψL
T.
(2.38)
Since the matrices −γµT also satisfy the anticommutation relation of Footnote 1 for the γµ matrices,
there is a unitary matrix C in Dirac space such that (see for instance section 5.2 of Ref. [70]):
γµT = −C−1 γµ C. (2.39)
This matrix, which also obeys CT = −C, is known as the charge conjugation matrix, as applying the
charge conjugation transformation (discussed in Section 1.3.3) to a field ψ results, with a standard phase
choice, in ψ → ψC ≡ C ψ T . Rewriting (2.38) and multiplying it by C to the left gives:
−i γµT ∂µψRT
= mψLT ⇔ i C−1 γµ ∂µ
(C ψR
T)
= mψLT
C×−−−→ i γµ ∂µ
(C ψR
T)
= m(C ψL
T)⇔ i γµ ∂µψ
CR = mψCL .
(2.40)
Performing the identification ψR = ψCL (⇒ ψL = ψCR), leads to a single, independent equation:
i γµ ∂µψL = mψCL , (2.41)
and the decomposition of Eq. (2.2) now reads ψ = ψL + ψCL , which implies
ψ = ψC , (2.42)
known as the Majorana condition. This condition entails that the particles associated with the field ψ,
now called a Majorana field, are their own antiparticles. Since charges of antiparticles are reversed, only
neutral fermions such as neutrinos can fulfil the said condition. Taking ψ = ν, one can build a neutrino
Majorana mass term, leading to a Majorana Lagrangian of the form (2.41) as follows:
LνMaj. =1
2ν(i /∂ −m
)ν . (2.43)
The 1/2 factor was introduced to prevent double counting of independent fields, guaranteeing consistency
in the normalization of QFT field operators [71]. The mass term alone reads:
Lν massMaj. = −1
2mν ν = −1
2m(νCL νL + H.c.
), (2.44)
where νCL can be rewritten by considering that νCL = −νTL C†. One notices that the above mass term is
allowed since νCL not only behaves as a right-handed field (PL νCL = 0, as expected), but also because it
transforms as νL under a Lorentz transformation and, thus, LνMaj. preserves Lorentz invariance. There
is, nonetheless, a symmetry that is clearly broken by the Majorana Lagrangian, namely lepton number L
26
– which automatically implies the breaking of B−L and B+L – as one has lost the liberty to rephase the
neutrino field. The rephasing ν → eiϕ ν would imply νC → eiϕ νC , since ν = νC , which is inconsistent
with the very definition of νC . If the mass m is null, however, one is still free to rephase the component
νL → eiϕ νL, since the kinetic term dissociates νL from νCL . The Majorana mass term can then be
understood, if m is small, as a perturbation to an effective Lagrangian which generates transitions ∆L = 2.
The fact that lepton number is broken implies that not as many phases can be removed from the lepton
mixing matrix as in the quark case. An overview of lepton mixing is postponed to Section 3.1.
One might ask if it is possible to introduce Majorana masses for neutrinos in the SM, given the available
field content. A mass term of the type (2.44) is forbidden since the νL are components of an SU(2)L
doublet and there is no field in the SM with the required electroweak assignments to aid in producing
an invariant term of that type which is also renormalizable. If one forgoes this last requirement, then
the lowest dimensional term which can be constructed using SM fields and induces Majorana neutrino
masses is the 5-dimensional Weinberg operator [72], which can be written as [73]:
LWein. = cαβ1
Λ
(`αCL φ∗)(φ† `βL
)+ H.c. = − cαβ
1
Λ
(φ† `αL
)TC†(φ† `βL
)+ H.c., (2.45)
where the cαβ are complex coefficients and Λ corresponds to an energy cutoff. The presence of nonrenor-
malizable terms in the Lagrangian implies that the SM is an effective theory, valid at low-energy, whereas
the ‘real’ theory becomes manifest only at energies of the order of a new, high-energy scale Λ.
It is worth noting that φ transforms under the electroweak group as a doublet with hypercharge
Y = −1/2, opposite to that of φ, and that after EWSB one gets5
φ(x) =
v +
H(x)√2
0
. (2.46)
As such, upon EWSB, one indeed obtains a Majorana neutrino mass term from (2.45):
LWein.EWSB−−−−−→ v2
Λ
(cαβ ναCL νβL + H.c.
)+ LHνint. ≡ −
1
2
(Mν
αβ ναCL νβL + H.c.
)+ LHνint. , (2.47)
where LHνint. contains interaction terms of neutrinos with the Higgs boson H(x) and Mν is the neutrino
(Majorana) mass matrix, for which flavour is taken into account.
The above considerations are independent of the form of the high-energy theory which gives rise to
the effective Weinberg operator LWein. at low-energy. In the next section, we consider seesaw extensions
of the SM in which Majorana neutrino masses arise. Their values turn out to be very small compared to
the rest of the fermions if a large enough scale Λ is considered, as suggested by (2.47).
2.2.2 The Seesaw Mechanism
If one regards the SM as an effective description of particle physics, one forcibly has to add new degrees
of freedom to the theory, taken to be heavy (with masses of order Λ), which are typically available at high-
-energies but decouple from the theory at low-energy. In seesaw extensions of the SM, the 5-dimensional
Weinberg operator arises after integrating out massive states from tree-level interactions in which they
5This fact was taken into account in (2.29) in order to obtain the mass terms of up-type quarks.
27
`L `L
φ φ
NR,ΣR
`L
`L φ
φ
∆YN,Σ YN,Σ Y∆ µ∗
Figure 2.2: Exchange interactions which in the effective theory give rise to the Weinberg operator of
(2.45). Seesaw types I and III correspond to the exchange of fermion fields NR and ΣR, respectively (left
diagram), while the type II seesaw mechanism is implemented through the exchange of scalar fields ∆
(right diagram).
are exchanged. Such an interaction reduces, at lower energy, to a four-point interaction of the form
φφ``, which produces neutrino mass terms following EWSB. Therefore, one must introduce fields ψ with
interactions terms of the form ψφ`, corresponding to type I or type III seesaw mechanisms, or both ψφφ
and ψ``, as is the case with the type II seesaw mechanism (see Fig. 2.2).
The Lagrangian terms which include the new fields must not spoil the SM gauge symmetry, which is
not accidental and must therefore be respected by the high-energy theory. Consequently, if one wishes to
include interaction terms of the form ψφ`, one immediately realizes that ψ must have zero hypercharge.
In order to preserve invariance under SU(2)L, one has only two possibilities: either ψ transforms as
a singlet (type I) or as a triplet (type III). That no other possibility exists follows from the fact that
the decomposition of the Kronecker product 2 ⊗ 2 ⊗ n of SU(2) representations into a direct sum (see
Footnote 4 of Chapter 1 and Ref. [74]) contains only a trivial representation, 1, for the cases n = 1,3.
Additionally, angular momentum conservation mandates these ψ to be fermionic (spin 1/2) fields. For
interaction terms of the form ψφφ and ψ``, one needs a field of hypercharge Y = 1 which could also be
an SU(2)L singlet or triplet. However, (2.19) dictates that such a singlet would be charged, making an
effective neutrino mass term impossible. Therefore, one is left with the triplet option, corresponding to
the type II mechanism. Each of these three standard extensions is now reviewed.
Type I Seesaw
In the type I seesaw paradigm [75–79], n′ sterile neutrino fields NiR ∼ (1, 0) are added to the SM
field content. Aside from a Yukawa term of the form (2.36), one is free to add a Majorana mass term as
that of (2.44) for the NiR. The extended Lagrangian of the theory will then read [73,80]:
Ltype I = LSM + iNiR /∂ NiR −[(
YN †)αi`αL φ NiR +
1
2MN
ij NiCRNjR + H.c.
], (2.48)
where greek indices run over SM generations while roman ones are tied to the n′ new fields. Thanks
to the sterility of the NiR, the covariant derivative in the kinetic term reduces to a regular one and the
presence of an L-violating Majorana mass term is not forbidden by gauge symmetry. While YN is a
general n′ × 3 complex matrix, MN is an n′ × n′ matrix which can be taken to be symmetric in flavour
28
space, since it is fully contracted with a symmetric quantity:
NiCRNjR = −NiTR C†NjR = −
(NiR
)aC†ab
(NjR
)b
=(NjR
)bC∗ba
(NiR
)a
= −NjTR C†NiR = NjCRNiR,
(2.49)
where the indices a, b refer to Dirac space and one has taken into account the anticommutation of fermion
fields. In order to write Feynman rules with which the amplitudes of physical processes are computed
one considers interacting fields of definite mass. Before EWSB, and neglecting finite temperature QFT
corrections, lepton and Higgs fields are massless and the focus is shifted to rotating the heavy mediators:
NiR →(V N)ijNjR, (2.50)
where V N is an n′ × n′ unitary matrix. By looking at Table 2.1 one verifies that V N contains enough
moduli and phases such that the above rotation brings MN to diagonal form, with real and positive
entries Mi. The mass eigenstates will then be the fields Ni ≡ NiR + NiCR, with NiR = PRNi. The
Lagrangian of (2.48) is henceforth assumed to be written in this mass basis.
Feynman rules are directly read off from the expanded interaction term in Ltype I:
Lint.type I = −
(YN †)
αi`αL φ NiR + H.c. = −YN∗
iα `αL ε φ∗ PRNiR + H.c.
= `kαL φj∗[YN∗iα εjk PR
]NiR +NiR
[YNiα ε
jk PL
]φj `kαL,
(2.51)
where superscript roman indices refer to weak isospin space and εij = i(τ2)ij
is the rank 2 antisymmetric
tensor. In these diagrams, no arrow is associated to Majorana fields. Rules for the proper treatment
of Feynman diagrams involving Majorana fermions are presented in Appendix A. Making use of the
φj
`kαL
NiRiYN∗
iα εjk PR
φj
`kαL
NiRiYN
iα εjk PL
Figure 2.3: Vertex contributions from the interaction Lagrangian (2.51) for type I seesaw. These contri-
butions do not depend on the choice of a fermion flow (see Appendix A).
rules presented in Fig. 2.3, one can write the amplitude for the type I exchange of the form `φ → `φ
corresponding to both the s-channel diagram of Fig. 2.2 and a t-channel contribution:
M(`iαL(p1)φj(p2)
NmR−−−→ φk(p3) `lβL(p4))
= u(p4)[iYN
mβ εkl PL
] i(/p1 + /p2 +Mm
)
(p1 + p2)2 −M2m + iε
[iYN
mα εji PL
]u(p1)
+ u(p4)[iYN
mβ εjl PL
] i(/p1 − /p3 +Mm
)
(p1 − p3)2 −M2m + iε
[iYN
mα εki PL
]u(p1).
(2.52)
In the above, no index sums are implied. In order to obtain a description at low-energy, the heavy
sterile fields must be integrated out from this amplitude, which can be achieved by considering the limit
29
M2m p2
n for all n = 1, . . . , 4 and m = 1, . . . , n′. One then obtains:
M(`iαL(p1)φj(p2)
NmR−−−→ φk(p3) `lβL(p4))' i u(p4) YN
mβ
1
MmYNmα
[εklεji + εkiεjl
]PL u(p1). (2.53)
Summing (2.53) over all possible NmR mediators yields:
M(`iαL(p1)φj(p2)→ φk(p3) `lβL(p4)
)' i u(p4)
(YNT 1
MNYN
)
αβ
[εklεji + εkiεjl
]PL u(p1), (2.54)
and matching the above to a four-point interaction results in the following effective Lagrangian term:
L4-pointtype I =
1
4`iαCL `
lβL
φk φj(
YNT 1
MNYN
)
αβ
[εklεji + εkiεjl
], (2.55)
where a 1/4 factor has been included since an identical particle factor of 4 arises when the vertex con-
tribution, present in (2.54), is extracted from (2.55). Upon closer inspection, one sees that this effective
Lagrangian term coincides6 with the Weinberg operator of (2.45) if one takes:
cαβ1
Λ=
1
2
(YNT 1
MNYN
)
αβ
. (2.56)
A direct consequence of this result is the relation between the high-energy matrices MN ,YN and the
low-energy neutrino mass matrix Mν (cf. (2.47)), which reads:
Mναβ = −2 v2 cαβ
1
Λ⇒ Mν = −YNT v2
MNYN . (2.57)
The number of extra parameters that the type I seesaw extension adds to the SM can be determined
by considering the basis where sterile fields have definite masses and lepton doublets and right-handed
charged-lepton fields have been rotated by V lL and V lR, respectively, so that Yl is diagonal. For n
SM lepton generations, the matrix YN contains 2n′n parameters (see Table 2.1) from which n can
be removed by rephasing the lepton doublets `αL (and singlets lαR). This amounts to n′ + 2n′n − nphysical parameters, of which n′(n + 1) are moduli and n(n′ − 1) are phases. For n = 3 and n′ = 3,
the high-energy theory relies on 7n′ − 3 = 18 parameters, to be contrasted with only 9 parameters at
low-energy: 3 masses, 3 mixing angles and 3 phases (see Section 3.1). An excess is present even when
one considers a minimal implementation of the type I mechanism, where n′ = 2. In such a case, only
two out of three SM neutrinos acquire nonzero masses and hence the low-energy description depends on
7 parameters (2 masses, 3 mixing angles, and 2 phases), while the high-energy theory is described by 11
physical quantities.
Type II Seesaw
The minimal type II seesaw scenario [81–85] requires the addition of a single scalar7 triplet ~∆ with
hypercharge Y = 1 to the SM content. One can choose a basis for the 3 × 3 SU(2)L generators Ii = T i
where the third generator T 3 is diagonal:
T 1 =1√2
0 1 0
1 0 1
0 1 0
, T 2 =
1√2
0 −i 0
i 0 −i0 i 0
, T 3 =
1 0 0
0 0 0
0 0 −1
. (2.58)
6The H.c. term of (2.45) follows from repeating the above procedure for the conjugated (reversed arrows) diagrams.7Renormalizability of the high-energy theory requires the new fields ψ to be bosonic. Although angular momentum
conservation forbids fields with spin higher than two, it does not in principle preclude the choice of a vector mediator.
The imposition of Lorentz invariance would in that case lead to a ψ`` term of the form ψµ `CL Γµ `L, where Γµ is a linearcombination of γµ and γ5γµ. However, due to the presence of left projectors PL, said term would automatically vanish.
30
The triplet components thus have a definite charge (see (2.19)) and one may write:
~∆ ≡
∆++
∆+
∆0
∼
(3, 1). (2.59)
In wanting to include a Lagrangian term involving two lepton doublets `L and one scalar triplet ~∆ one
must choose a combination of these fields which is invariant under SU(2)L. Consider the Clebsch-Gordan
decomposition of the Kronecker product of two doublets and one triplet of SU(2):
2⊗ 2⊗ 3 =(1⊕ 3
)⊗ 3 = 3⊗ 3⊕ 3 = 1⊕ 3⊕ 5⊕ 3. (2.60)
By analysing this decomposition, one notices that in order to extract an invariant term (associated with
the singlet representation, 1) one must obtain it from the product of two triplet representations, one of
which is obtained from the product of the two doublet representations 2 ⊗ 2. Looking at the Clebsch-
-Gordan coefficient table for SU(2) [86], one sees that for two doublets ~a = (a1, a2) and ~b = (b1, b2) the
combination which transforms as a triplet is:
a1 b11√2
(a1 b2 + a2 b1
)
a2 b2
∼ 3 , (2.61)
whereas the combination of the elements of two triplets ~c = (c1, c2, c3) and ~d = (d1, d2, d3) which trans-
forms as a singlet is given by:
1√3
(c1 d3 − c2 d2 + c3 d1
)∼ 1 . (2.62)
Overall factors, such as the 1/√
3 above can be ignored, since the combination is still invariant under the
gauge group. By taking ~d = ~∆ and ~c as the triplet found in (2.61), with ~a = ~b = `L, one finds that:
a1 b1 d3 −1√2
(a1 b2 + a2 b1
)d2 + a2 b2 d1 ∼ 1
⇒ νL νL ∆0 − 1√2νL lL ∆+ − 1√
2lL νL ∆+ + lL lL ∆++ ∼ 1
⇒(νL lL
)( ∆0 −∆+/√
2
−∆+/√
2 ∆++
)
︸ ︷︷ ︸≡∆
(νL
lL
)= `TL ∆ `L ∼ 1 ,
(2.63)
where in going from the second to the third line one has collected the components of ~∆ in the here
defined ∆ matrix. It is worth noting that the structure of this invariant term in isospin space has been
established independently of its structure in both Dirac and flavour spaces. In fact, to preserve Lorentz
invariance, `cL must appear to the left of `L, which is achieved by adding the matrix −C† to the term.
One thus concludes that the term −`TL C†∆ `L = `CL ∆ `L can be included in the type II Lagrangian, as
well as a similar term containing φ doublets (notice that φ∗ does not transform as an SU(2) doublet,
while φ = εφ∗ does). Under the SU(2)L×U(1)Y transformation of (2.9), one has ∆→ U∗g ∆ U†g.
Consider a straightforward generalization of the above which consists in adding not one but n′ scalar
triplets ∆i with Y = 1 to the field content. The above matrix notation allows one to write the extended
31
theory Lagrangian in the following compact form [80,87]:
Ltype II = LSM +(Dµ ~∆i
)†(Dµ
~∆i
)− V (φ,∆i)−
(Y∆i
αβ `αCL ∆i `βL + µi φ
T ∆i φ+ H.c.), (2.64)
where the remainder of the scalar potential reads (the Higgs-only term V (φ) is already included in LSM):
V (φ,∆i) =(M∆
ij
)2Tr[∆†i∆j
]+ λ1
ij φ†∆†i ∆j φ+ λ2
ij φ† φTr
[∆†i ∆j
]
+(λ3ijkl Tr
[∆†i ∆j
]Tr[∆†k ∆l
]+ λ4
ijkl Tr[∆†i ∆j ∆†k ∆l
]+ H.c.
).
(2.65)
While the Y∆i are n × n symmetric matrices, λ1,2 and M∆ are n′ × n′ Hermitian matrices (λ3,4 have
general complex-valued entries). A glance at Table 2.1 allows one to see that the unitary transformation
∆i → (V ∆)ij ∆j (2.66)
contains enough parameters to remove all phases from M∆ and all moduli except n′, which correspond
to triplet masses. Mimicking what was done in the type I case, one will henceforth work on this basis
where the ∆i have definite masses, i.e. M∆ij = Mi δij . The complex parameter µi has dimensions of mass
and one is now free to define the dimensionless parameter λi ≡ µi/Mi.
The Feynman rules can be read off8 from the Lagrangian, yielding the vertices of Fig. 2.4.
∆×i
`kβL
−i(−√2)1+δjk
Y∆i∗αβ PR
`jαL
∆×i
`kβL
−i(−√2)1+δjk
Y∆iαβ PL
`jαL
φj
φk
∆×i −i
(√2)1+δjk
µi
φj
φk
∆×i −i
(√2)1+δjk
µ∗i
Figure 2.4: Vertex contributions for type II interactions relevant for effective neutrino mass generation.
For each diagram, the appropriate ∆×i ∈ ∆0i ,∆
+i ,∆
++i is chosen such that electric charge is conserved.
The presented contributions are once again independent on the choice of a fermion flow.
The amplitude for the type II ``→ φφ exchange (rightmost diagram of Fig. 2.2) reads:
M(`iαL(p1) `jβL(p2)
∆×m−−→ φk(p3)φl(p4))
= v(p1)[− i(−√
2)1+δij
Y∆m
αβ PL
] i
(p1 + p2)2 −M2m + iε
[− i(√
2)1+δkl µ∗m
]u(p2),
(2.67)
8Technically one here applies functional derivatives with respect to the interacting fields to the related Lagrangian term.
32
where, once more, no index sums are implied. Integrating out of the heavy triplets is achieved by
considering the limit M2m p2
n for all n = 1, . . . , 4 and m = 1, . . . , n′:
M(`iαL(p1) `jβL(p2)
∆×m−−→ φk(p3)φl(p4))
' i v(p1)[(−√
2)1+δij
Y∆m
αβ
] 1
M2m
[(√2)1+δkl µ∗m
]PL u(p2)
= i v(p1)
(−(−√
2)δij(√
2)δkl 2µ∗m
M2m
Y∆m
αβ
)PL u(p2).
(2.68)
Taking into account that, for the reactions allowed by charge conservation, −(−√
2)δij(√
2)δkl = εklεji +
εkiεjl, one sees that this tree-level amplitude matches the following four-point Lagrangian:
L4-pointtype II =
1
4`iαCL `
lβL
φk φj(
2µ∗mM2m
Y∆m
αβ
)[εklεji + εkiεjl
], (2.69)
which is seen to correspond to the Weinberg operator for the choice:
cαβ1
Λ=
µ∗mM2m
Y∆m
αβ . (2.70)
The low-energy neutrino mass matrix is then given by (where a sum over the n′ triplets is considered):
Mν = −2λ∗i v2
MiY∆i ≡ 2ui Y
∆i . (2.71)
An alternative way of obtaining the above contribution for the neutrino mass matrix is by considering
that, thanks to EWSB, both φ0 and ∆0i acquire nonvanishing VEVs. In the approximation Mi v, the
Higgs VEV v remains unchanged, while the triplet VEV – which is said to be an induced VEV – is given
by [87]:
ui ≡ 〈∆0i 〉 = −λ
∗i v
2
Mi, (2.72)
and one has |ui| v for all i. In fact, due to the experimental constraint of a ρ parameter very close to
one, there is not much room available for the values of the ui. Considering Eq. (2.28), one sees that:
ρtype II =1 + 2
∑i u
2i /v
2
1 + 4∑i u
2i /v
2, (2.73)
and so the quantity√∑
i u2i is constrained by electroweak precision data to be of magnitude . 1 − 10
GeV [88,89]. However, for coupling constants of O(10−9) or higher, the most stringent constraint on the
scale of the Mi comes not from precision measurements but from the absolute scale of neutrino masses.
An unsatisfactory aspect of seesaw models pertains to a potential aggravation of the (Higgs) hierarchy
problem, which can be avoided by lowering the seesaw scale or considering supersymmetric versions of
the theory [90]. In the type II case, the presence of quartic couplings of the type φ2∆2, for instance,
would affect the running of the Higgs mass mH . Ignoring hereafter the scalar potential quartic couplings,
one sees that the type II extension introduces n′ triplet masses Mi, n′ Higgs-type couplings λi, and
12n′ − 3 Yukawa-type couplings (recall that the Y∆i are symmetric) to the parameter count, which in
the minimal case (n′ = 1) reduce to 11 undetermined high-energy quantities. A desirable feature of the
present model is the fact that it preserves the flavour structure, i.e. there is a direct correspondence
between high-energy and low-energy parameters in flavour space, as one can see from (2.71), as opposed
to the case of Eq. (2.57). Additionally, the presence of gauge interactions with triplet components opens
up new phenomenology, previously unavailable in the sterile type I scenario.
33
Type III Seesaw
One finally turns to the type III seesaw extension [91], in which n′ fermion triplets ~ΣiR with null
hypercharge are added to the SM. Taking the SU(2) generators to be those defined in (2.58), one has:
~ΣiR ≡
Σ+i R
Σ0i R
Σ−i R
∼
(3, 0). (2.74)
In order to see what extra Lagrangian terms are allowed by gauge symmetry, one refers to Eqs. (2.62)
and (2.63). From (2.62) one sees that invariantly combining two triplets ~ΣiR and ~ΣjR gives:
−MΣij
(1
2Σ0iC
R Σ0jR− Σ+
i
C
R Σ−j R
)+ H.c. ∼ 1, (2.75)
where MΣij is a symmetric n′ × n′ matrix. One can additionally extract from (2.63) an invariant combi-
nation of fields involving fermionic triplets and the doublets ˜L ≡ ε `∗L and φ = ε φ∗:
(νL lL
)(Σ0i R/√
2 −Σ+i R
Σ−i R −Σ0i R/√
2
)
︸ ︷︷ ︸≡ΣiR
(φ0∗
−φ−
)+ H.c. = `L ΣiR φ+ H.c. ∼ 1. (2.76)
In the above, one has employed φ− ≡ (φ+)∗ and a γ0 matrix has been included. Taking into account
flavour, the high-energy Lagrangian can then read [87,88]:
Ltype III = LSM + i ~ΣiR /D ~ΣiR −((
YΣ†)αi`αL ΣiR φ−+
1
2
(MΣ
ij Tr[ΣiCR ΣjR
]+ H.c.
), (2.77)
where one has made the redefinition Σ+i R → −Σ+
i R, making sense of (2.75) as a mass term: the fields
Σ0i = Σ0
iC
R + Σ0i R are given Majorana masses, whereas the Ψi ≡ Σ+
i
C
R + Σ−i R correspond to Dirac
fermions [92]. Feynman Rules can be straightforwardly obtained from the above Lagrangian. By following
an integrating-out procedure similar to that of the type I seesaw case, one arrives in this context at the
following effective neutrino mass matrix:
Mν = −YΣT v2
MΣYΣ . (2.78)
Turning to parameter counting one sees that, since MΣ is symmetric, there is enough freedom to diag-
onalize the triplet mass matrix, obtaining from this n′ parameters. Additionally, by rephasing lepton
doublets (and charged lepton singlets) one can remove n phases from the 2n′n parameters of MΣ. This
totals 7n′ − 3 undetermined high-energy quantities, as was the case for the type I seesaw mechanism.
The given canonical seesaw realizations of the Weinberg operator can be motivated by GUTs and
correspond only to a subset of possible SM extensions in which small neutrino masses are naturally
generated. Alternative mechanisms at tree-level include the inverse-seesaw model [93, 94], where the
conventional type I framework is extended through the addition of fermionic singlets whose nonzero
lepton number assignment is softly broken. Neutrino masses can also be generated radiatively, as is the
case of the Zee-Babu model [95,96], where two charged scalar singlets are added to the SM and Majorana
neutrino masses arise due to two-loop quantum corrections. Other exotic options for neutrino mass
generation include the breaking of R-parity without inducing proton decay in SUSY models [97, 98], as
well as theories with extra dimensions or expanded gauge symmetries (see [99] and references therein).
34
Lepton Mixing and
Discrete Family Symmetries 3In the previous chapter we have focused on the Standard Model of particle physics and seesaw ex-
tensions, in which naturally small neutrino masses are generated. As discussed of Section 2.1.3, lepton
mixing becomes non trivial if massive neutrinos are brought into the picture. At low-energies, significant
differences between the quark and lepton mixing patterns are experimentally observed. In particular, the
structure of lepton mixing hints towards the presence of discrete symmetries in the lepton sector, which
play a normative role in the physical theory (see Section 1.4). In the present chapter, lepton mixing
is briefly addressed and the role of family symmetries in the shaping of the mass matrices and mixing
pattern is explored. A model which generates near tribimaximal mixing, accommodating all available
neutrino data, is reviewed.
3.1 Lepton Mixing
As in the case of quarks, lepton mixing arises from the mismatch of lepton states which are massive
and those which participate in CC interactions. Assuming henceforth that neutrinos possess Majorana
masses, one sees that it is possible to rotate the fields to the mass basis through:
lαL →(V lL)αβlβL, lαR →
(V lR)αβlβR, ναL →
(V νL)αiνiL, (3.1)
since a unitary transformation V νL contains enough parameters to diagonalize the symmetric matrix
arising in the Majorana mass term (cf. Eq. (2.47) and Table 2.1). The neutrino fields with definite
masses are denoted by νi (i = 1, 2, 3). The neutrino mass matrix Mν then becomes:
V νLT Mν V νL = diag(m1,m2,m3) ≡ Dν . (3.2)
This rotation produces a misalignment in the Lagrangian term of Eq. (2.12):
L`CC →g2√
2νiL γ
µ(V νL† V lL
)iαlαLWµ + H.c. ≡ g2√
2νiL γ
µ (U†PMNS)iα lαLWµ + H.c., (3.3)
where the dagger in the definition of the lepton mixing matrix UPMNS, known as the Pontecorvo-Maki-
-Nakagawa-Sakata matrix [100–102], is purely conventional. The relation between states of definite flavour
(greek indices) and definite mass (roman indices) is given by:
ναL =(UPMNS
)αiνiL ⇔
νeLνµLντL
=
Ue1 Ue2 Ue3
Uµ1 Uµ2 Uµ3
Uτ1 Uτ2 Uτ3
ν1L
ν2L
ν3L
. (3.4)
35
ParameterResult of Global Fit with 1σ 3σ errors
Normal Ordering Inverted Ordering
∆m221
(10−5 eV2
)7.62 ± 0.19
+0.58−0.50
∆m231
(10−3 eV2
)2.53 +0.08
−0.10
+0.24−0.27
−(
2.40 +0.10−0.07
+0.28−0.25
)
sin2 θ12 0.320 +0.015−0.017
±0.050
sin2 θ23 0.49 +0.08−0.05
+0.15−0.10
0.53 +0.05
−0.07
+0.11−0.14
sin2 θ13 0.026 +0.003−0.004
+0.010−0.011
0.027 +0.003
−0.004
+0.010−0.011
δ(
0.83 +0.54−0.64
)π
any (
0.07 +1.93−0.07
)π
any
Table 3.1: Global fit results taken from Ref. [107] for the three-neutrino oscillation parameters (mass
differences, mixing angles and Dirac phase) and for both ordering possibilities (see text). At the 3σ level
there is no constraint on the value of δ (this is also true for the inverted ordering case already at 1σ).
The number of physical parameters contained in UPMNS can be determined as was done for the quark
case in Section 2.1.3 with one important caveat: due to the Majorana nature of neutrinos one has lost the
freedom to invariantly rephase the three corresponding fields. Therefore, one can only remove n phases
through the rotation of charged leptons instead of 2n − 1 from the unitary mixing matrix (the global
U(1)L is also broken), ending up, for n = 3 SM generations, with n(n − 1)/2 = 3 mixing angles and
n(n+ 1)/2− n = n(n− 1)/2 = 3 physical phases. A parametrization of the UPMNS is given by [37]:
UPMNS =
c12 c13 s12 c13 s13 e−iδ
−s12 c23 − c12 s23 s13 eiδ c12 c23 − s12 s23 s13 e
iδ s23 c13
s12 s23 − c12 c23 s13 eiδ −c12 s23 − s12 c23 s13 e
iδ c23 c13
︸ ︷︷ ︸≡VPMNS
1 0 0
0 eiα1 0
0 0 eiα2
︸ ︷︷ ︸≡KPMNS
, (3.5)
where sij ≡ sin θij and cij ≡ cos θij refer to the mixing angles θ12, θ13, θ23 ∈ [0, π/2] specific to the lepton
sector, not to be confused with those of (2.34). The δ ∈ [0, 2π[ phase is a Dirac-type phase, in an analogy
to quark mixing, while α1, α2 ∈ [0, 2π[ arise due to the Majorana character of neutrinos and are thus
called Majorana-type phases [58]. As mentioned in Section 2.2.1, lepton mixing breaks lepton flavour
numbers Li (and the total L in the Majorana case). This leads to the possibility of observing charged
lepton flavour violating processes which would arise radiatively, namely µ → e γ, µ − e conversion in
nuclei, and µ → 3 e [103]. So far, searches have yielded negative results, setting upper limits of order
10−12 on branching ratios and conversion rates [104–106].
As a consequence of lepton mixing, neutrinos produced with a definite flavour are allowed to oscillate
between flavours since they constitute quantum superpositions of massive states whose evolution in time
depends on the values of their (different) masses. Neutrino oscillations are only sensitive to the mass-
squared differences ∆m2ij ≡ m2
i −m2j , providing no information on the absolute neutrino mass scale. Since
the sign of ∆m231 is indeterminate, whereas ∆m2
21 is positive, there is room for two possible orderings of the
mass spectrum: normal ordering, with m1 < m2 < m3, and inverted ordering, for which m3 < m1 < m2.
If, furthermore, the absolute neutrino mass scale (corresponding to the mass of the lightest νi) is small
36
νe ν ντ
ν1
ν2
ν3
Figure 3.1: Depiction of lepton mixing for both a normally ordered and an inverted neutrino mass
spectrum, where the global fit data of Table 3.1 has been considered (left), to be compared with the
tribimaximal ansatz (right). The probability that a massive state νi is found as an α-flavour neutrino is
given by∣∣(UPMNS)αi
∣∣2.
enough, the spectrum can be regarded as hierarchical. One then has either a normal hierarchy, with
m1 < m2 m3, or an inverted hierarchy, for which m3 m1 < m2.
Results from the latest global fit to neutrino oscillation data [107] are summarized in Table 3.1. The
best fit values correspond to a solar neutrino mixing angle θ12 ' 34o, an atmospheric angle θ23 ' 44o
or 47o, depending on whether the mass spectrum displays normal or inverted ordering respectively, and
a reactor neutrino angle θ13 ' 9o, no longer consistent with zero at the 3σ level. The dimensionless
parameter r ≡ ∆m221 / |∆m2
31| ∼ 0.03 quantifies the hierarchy between the two mass-squared differences
∆m221 and ∆m2
31, which drive solar and atmospheric oscillations, respectively. The above results can be
given a visual interpretation through the diagram of Fig. 3.1.
Oscillation experiments are not sensitive to the Dirac or Majorana nature of neutrinos [108], and the
phases α1,2 cannot be measured through them. CP violation in the leptonic sector will depend on the
values of the three mixing phases. Thus, CP and T violations potentially measurable by neutrino oscilla-
tions will depend solely on the Dirac phase δ (Dirac-type CP violation). The experimental determination
of its value is sensitive to both the magnitude of the reactor angle and to the ordering of the neutrino
mass spectrum [80]. One gateway to the determination of the Majorana-type phases is neutrinoless dou-
ble beta decay [109], a rare process which is only possible if neutrinos are their own antiparticles, i.e. if
they are described by the Majorana formalism. The relevant model-dependent quantity for the rate of
said process is |mee|, defined as the absolute value of the element Mν11 computed in the basis where the
charged leptons have definite masses and the charged current is diagonal. It is useful, in this context
alone, to transform α2 → α2 + δ and regard the new α2 as the Majorana phase [110]. As such, one has:
∣∣mee
∣∣ =
∣∣∣∣∣∑
i
(UPMNS
)2eimi
∣∣∣∣∣ =∣∣∣(m1 c
212 +m2 s
212 e
2 i α1)c213 + m3 s
213 e
2 i α2
∣∣∣. (3.6)
37
A recent upper limit on |mee| corresponds to 0.2− 0.4 eV, given by the GERDA collaboration [111].
As was mentioned above, neutrino oscillations are not sensitive to the absolute neutrino mass scale,
but only to mass-squared differences. The value of this scale can in principle be directly obtained through
the observation of the beta decay spectrum end-point. In this case, one would measure an effective mass
mνe ≡√∑
i
∣∣UPMNS
∣∣2eim2i . Current tritium decay searches give an upper limit of mνe < 2 eV [37].
An additional (indirect) constraint on the absolute scale of neutrino masses is given by cosmological
considerations, which limit the sum mtotal =∑imi. Recent results from the Planck collaboration have
established an upper limit of mtotal < 0.66 eV (95% CL) [112] (where WMAP polarization data has
been considered). This value can however be seen to vary depending on the analysis performed. In
particular, adding baryon acoustic oscillation data to the analysis constrains the neutrino mass sum to
be mtotal < 0.23 eV (95% CL) [113].
From inspection of Fig. 3.1 it is apparent that the experimental mixing data bears a close resemblance
to the tribimaximal (TBM) pattern. The TBM hypothesis, put forward by Harrison, Perkins, and
Scott [114], corresponds to taking |Ve3|2 = 0, |Vµ3|2 = 1/2, and |Ve2|2 = 1/3, where V here denotes the
quark-like part VPMNS of the mixing matrix in (3.5). The matrix VPMNS is unitary and, under the TBM
ansatz, orthogonal, due to s13 = 0. The above assumptions therefore allow one to write:
V ′TBM =
√23
1√3
0
− 1√6
1√3
1√2
1√6− 1√
31√2
, (3.7)
where θ12 = arccos(√
2/3)' 35o, θ23 = 45o, said to be maximal1, and θ13 = 0o. Alternatively, by
making a redefinition of the Majorana phases αi → αi + π and by resorting to the rephasing of the
charged lepton fields (including a global phase eiϕ), one may write:
UTBM ≡
−eiϕ 0 0
0 −eiϕ 0
0 0 eiϕ
V ′TBM
1 0 0
0 −eiα1 0
0 0 −eiα2
= eiϕ
−√
23
1√3
0
1√6
1√3
1√2
1√6
1√3− 1√
2
︸ ︷︷ ︸≡VTBM
KPMNS, (3.8)
where KPMNS = diag(1, eiα1 , eiα2) is given in terms of the new redefined phases. The rephasing of
charged leptons, in conjunction with the Majorana phase redefinition, amounts to changing the mixing
angle ranges from [0, π/2] to [π/2, π]. The new mixing angles are then obtained through θij → π − θij .The possibility of a TBM mixing pattern motivated the study of discrete family symmetries (consid-
ered in the remainder of this chapter) which might govern the leptonic sector. However, exact TBM is
excluded by the aforementioned results on a nonzero reactor angle, given by the Daya-Bay, RENO and
Double Chooz collaborations [115–117] after hints from the T2K and MINOS collaborations [118, 119].
Nevertheless, one can still pursue the family symmetry approach, without having to resort to mass an-
archy [120]. Taking symmetry as a guiding principle, one may accommodate the above results by either
considering larger discrete groups in model building or regarding specific mixing patterns – like the TBM –
as a leading-order approximation, to be perturbed when higher-order corrections are taken into account.
1Since the νµ ↔ ντ oscillation probability is proportional, in the two-neutrino mixing approximation, to sin2(2 θ23), onesees that it is the choice θ23 = 45o which maximizes said probability.
38
3.2 Discrete Family Symmetries
Appealing to the normative role of symmetries, one sees that, in order to regulate the structure of
the leptonic mixing matrix, the symmetry operations must act on leptons of different flavours. Symme-
tries whose action is carried out in flavour space are known as flavour, family or horizontal symmetries,
to be contrasted with the case of GUT symmetries, which act on different members of the same fam-
ily/generation and are therefore vertical, in the sense of Fig. 2.1.
Family symmetries may be based on Abelian groups, such as the U(1)FN of the Froggatt-Nielsen
mechanism [121], which can account for hierarchies between masses and mixing matrix elements. For
non-hierarchical structures, however, one turns to non-Abelian symmetries, where different flavour fields
can be arranged into multiplets of the family symmetry group. One finally considers discrete (non-
-Abelian) family symmetry groups, which in general provide a simpler solution to the vacuum alignment
problem (to be defined in Section 3.2.2) than continuous groups [122]. For a review on lepton mixing and
family symmetries, we address the reader to Ref. [123].
Since family symmetries are to be imposed while respecting the underlying SM gauge group, one sees
that the fact that charged leptons and neutrinos belong to the same multiplet of SU(2)L forbids shaping
the charged lepton mass matrix Ml and the neutrino mass matrix Mν independently [124]. One therefore
resorts to a dynamical origin of the lepton Yukawa couplings adding new heavy (scalar) fields, dubbed
flavons, transforming non-trivially under the family symmetry, which is then spontaneously broken by
the flavon VEVs, at a scale typically much higher than that of EWSB.
3.2.1 Symmetries of the Mass Matrices
Due to the specific structure of the charged-lepton and neutrino mass terms, one can identify transfor-
mations of the fields in flavour space which leave those terms invariant, thus establishing the symmetries
of the mass matrices. Unless otherwise stated, we will work in a basis where both the charged-lepton
mass matrix and the CC Lagrangian are diagonal2. Thus, for charged leptons, one has in this basis:
Ml = Dl = diag(me,mµ,mτ ). (3.9)
By asking which are the symmetries of the mass matrix upon (unitary) rotations of the lepton fields in
flavour space, one sees that Ml is clearly invariant under the transformations lL → T lL and lR → T lR:
T †Ml T = Ml, (3.10)
where one has ommited flavour indices for simplicity and T corresponds to the three-dimensional matrix:
T =
1 0 0
0 ω 0
0 0 ω2
, (3.11)
where ω ≡ ei 2π/3. Hence, one identifies Z3 – the lowest-order group which constrains Ml to be diagonal
– with the symmetry of the charged lepton mass matrix3.
2This is often simply phrased as “working in the basis of diagonal charged leptons”.3In generalizing the present considerations to grand unified theories, it would be useful to consider a lepton mass matrix
only approximately diagonal, since in GUTs this matrix is related to the down-type quark matrix, which in light of quarkmixing is often taken to be non-diagonal.
39
Regarding the neutrino mass matrix, one sees that the Majorana mass term of Eq. (2.47) constrains
Mν to be symmetric, since, as was the case with (2.49), one can check that ναCL νβL is symmetric under
the exchange α ↔ β. The (bi)diagonalization of Mν is achieved through the UPMNS mixing matrix
(UPMNS = V lL†V νL = V νL in the basis we are working with):
Mν = U∗PMNS Dν U†PMNS = U∗PMNS diag(m1,m2,m3)U†PMNS. (3.12)
The matrix Mν will then be invariant under the (unitary) transformation νL → Ki νL, where
Ki = UPMNS ηi U†PMNS, η =
(13×3, diag(−,+,−), diag(+,+,−), diag(−,+,+)
), (3.13)
since:
KiT Mν Ki =(UPMNS η
i U†PMNS
)TMν
(UPMNS η
i U†PMNS
)
= U∗PMNS ηiT(UTPMNS U
∗PMNS
)Dν(U†PMNS UPMNS
)ηi U†PMNS
= U∗PMNS Dν U†PMNS = Mν .
(3.14)
Other four possible ηi matrices exist, obtained from the above by changing their sign, ηi → −ηi, but are
redundant in the description of the symmetry of Mν since a global sign cancels independently of its form.
By noticing that(η2)2
=(η3)2
= 1 and η4 = η2η3 = η3η2, one identifies the symmetry of the neutrino
mass matrix with the Klein group K = Z2 × Z2 of Section 1.2.
What was presented up to now regarding the symmetries of mass matrices is independent of the
form of UPMNS. For the particular TBM pattern of Eq. (3.8), one obtains the specific form of the Ki
transformation matrices from (3.13):
K1 = 1, K2 ≡ S =1
3
−1 2 2
2 −1 2
2 2 −1
, K3 ≡ U =
1 0 0
0 0 1
0 1 0
, K4 = S U = U S. (3.15)
Rephasing charged leptons and redefining Majorana phases would yield the same symmetry group but
different Ki, corresponding simply to a different choice of basis for the group representation.
3.2.2 Direct vs. Indirect Models
Invoking a discrete non-Abelian family symmetry, associated with a symmetry group G, one might
ask what are the available directions in which model building may proceed. The charged-lepton and
neutrino mass matrices exhibit the Z3 and K symmetries presented above, respectively, in the basis of
diagonal charged leptons. Focusing on a specific mixing pattern, one must consider matrices T , S and U
of a specific form (e.g. those of Eqs. (3.11) and (3.15)), which generate symmetry groups GT , GS and
GU , respectively [125]. One then wishes to obtain such symmetries at low-energy, after the spontaneous
breaking of the family symmetry by the VEVs of flavon fields. One obvious model building option is to
take GT , GS and GU as subgroups of G. Such class of models, where the desired symmetries of both
charged leptons and neutrinos are already present in the family group and are preserved by the flavon
VEVs, are called direct models [124].
In alternative to direct models, one may choose to work with a group which contains none of the
low-energy symmetries as subgroups, but nonetheless gives rise to them, accidentally, upon SSB. Such
40
models are termed indirect models, and the group G plays the indirect role of determining the allowed
directions for the flavon VEVs. This speaks directly to the so-called vacuum alignment problem, as the
model must account for both the GT symmetry in the charged-lepton sector, as well as the GS and GU
symmetries in the neutrino sector. These are here absent from the high-energy theory and must arise
due to a particular alignment of VEVs. Direct models are not free from the alignment problem since the
directions of the VEVs must preserve the relevant subgroups. Although the source of the flavon VEV
alignment is out of the scope of the present work, solutions of the alignment problem can be found, for
example, in the context of supersymmetric theories (see, for instance, [126]). Besides the two solutions
described above, an intermediate solution is found in semi-direct models, for which the family group G
contains some but not all of the desired subgroups generated by T , S and U .
Considering the particular case of TBM mixing, one sees that the generators take the forms given in
(3.11) and in (3.15). The simplest group which allows for exact TBM in the context of direct models
is G = S4 [127], presented in Section 1.2, as it is the smallest group which contains the TBM GT , GS
and GU as subgroups, i.e. can accomodate having the generators in the desired form in a common
basis. Regarding semi-direct models, a popular choice of G corresponds to the alternating group A4
[125,128–130], presented in the same section as above. This is not without justification, since A4 already
contains the TBM GT and GS as subgroups and is the smallest group which possesses a three-dimensional
irreducible representation. The three lepton flavours are thus naturally connected in A4-based models
as components of a single multiplet. The structure of the charged lepton mass matrix arises due to a
preserved GT ∼= Z3, while that of the neutrino mass matrix is established by both the presence of an
intact GS ∼= Z2 and of an accidental GU ∼= Z2.
In passing, we remark that extending A4-based models to the quark sector interestingly leads to
a unit mixing matrix at leading order, if one requires nondegenerate and nonvanishing quark masses
[131]. However, higher-order corrections seem to be too small to account for the observed VCKM mixing
angles [125], prompting the study of larger-order groups to achieve such unification (see, for instance,
Ref. [132]).
3.3 An A4 Model with Spontaneous CP Violation
Having established the philosophy by which one must abide, we turn, in the present section, to
the analysis of the model of Ref. [133], which implements an A4 family symmetry in a type II seesaw
framework with spontaneous CP violation. In this model, deviations to exact TBM are obtained by
perturbing the flavon VEV alignment. Aside from two real flavon fields, Φ and Ψ, which are singlets of
SU(2)L with null hypercharge, one adds two SU(2)L triplets ∆1,2 with Y = 1 and one complex scalar
field S, which is a singlet of the whole gauge group, to the particle content of the SM. Apart from the SM
gauge group and the A4 flavour symmetry, a Z4 symmetry which constrains the form of the Lagrangian
is also considered. The symmetry assignments of the fields are given in Table 3.2.
An interesting property of the present model is the absence of explicit CP violation at the Lagrangian
level, i.e. all parameters are real in the fundamental theory. The source of low and high-energy CP
41
Symmetry `L eR, µR, τR ∆1 ∆2 φ S Φ Ψ
A4 3 1,1′,1′′ 1 1 1 1 3 3
Z4 i −i 1 −1 i −1 i 1
SU(2)L × U(1)Y (2,−1/2) (1,−1) (3, 1) (3, 1) (2, 1/2) (1, 0) (1, 0) (1, 0)
Table 3.2: Representation assignments of the model fields under the action of the groups A4, Z4, and
gauge SU(2)L×U(1)Y . SM fields not covered in the present table transform trivially under A4 × Z4.
violation, namely that necessary to account for the BAU (see Chapter 4), is unique: the complex phase
of the VEV which S acquires. With the assignments of Table 3.2 in mind, one can find out which terms
are allowed in the Lagrangian of the model, the relevant part of which reads:
L = Ll + Lν − V CP×Z4 − V (Φ,Ψ), (3.16)
where Ll and Lν make up the Yukawa Lagrangian and V (Φ,Ψ) denotes the flavon part of the scalar
potential while V CP×Z4 corresponds to its remainder. The flavon-free potential is then given by:
V CP×Z4 = VS + Vφ + V∆ + VSφ + VS∆ + Vφ∆ + VSφ∆, (3.17)
which presents the following explicit form (one can straightforwardly check that these terms are the only
ones allowed by the imposed symmetries):
Vφ = µ2(φ) φ
†φ+ λ(φ)
(φ†φ
)2,
Vφ∆ =∑
i
(λ1i
(φ†∆†i∆iφ
)+ λ2
i
(φ†φ
)Tr[∆†i∆i
])+(λ2M2 φ
T ∆2 φ+ H.c.),
V∆ =∑
i
M2i Tr
[∆†i∆i
]+∑
i,j
[λ3ij Tr
[∆†i∆i
]Tr[∆†j∆j
]+ λ4
ij Tr[∆†i∆i∆
†j∆j
]],
VS = µ2S
(S2 + S∗2
)+m2
S S∗S + λS
(S∗S
)2+ λ′S
(S4 + S∗4
)+ λ′′S S
∗S(S2 + S∗2
),
VSφ = ηS(S∗S
) (φ†φ
)+ η′S
(S2 + S∗2
) (φ†φ
),
VS∆ =∑
i
Tr[∆†i∆i
](ξi(S2 + S∗2
)+ ξ′i S
∗S),
VSφ∆ = φT ∆1 φ(λ∆ S + λ′∆ S∗
)+ H.c..
(3.18)
As for the Yukawa Lagrangian, one sees that the usual SM charged lepton Yukawa term is forbidden
by the Z4 symmetry, as well as the standard type II seesaw coupling of triplets ∆1,2 to lepton doublets
(a coupling of the Higgs type, ∆φφ, is only allowed for ∆2). Instead, terms of these forms will arise as
effective operators – suppressed by a certain scale – where flavon fields and the singlet field S enter.
A detailed analysis of the structure of A4 is performed in Appendix B, where a specific basis for the
three-dimensional representation is chosen. This is the basis in which the representations of the group
elements s = (12)(34) and t = (123) (see Appendix B) simultaneously agree with the forms (3.11) and
(3.15) for the generators S and T : U3(s) = S, U3(t) = T .
In wanting to write A4-invariant terms involving lepton triplets, one must consider the Clebsch-Gordan
decomposition of the tensor product of two A4 triplet representations, 3⊗ 3 = 1⊕ 1′ ⊕ 1′′ ⊕ 3s ⊕ 3a.
42
The full computation of the (Clebsch-Gordan) coefficients which govern this decomposition is performed
in Appendix B. Due to the fact that the Majorana mass matrix is symmetric in flavour space, one restricts
oneself to products where the symmetric 3s arises. To obtain A4-invariant terms which include the barred
fields `L, one considers(`eL, `τL, `µL
), which transforms as an A4 triplet. For completeness, one presents
the invariant arising from the symmetric product of three triplets ~a, ~b and ~c, obtainable from (B.51) (a
global factor of 1/3 is included to keep in touch with the conventions of [133]):
1
3
(2a1b1c1 + 2a2b2c2 + 2a3b3c3 − a1b2c3 − a1b3c2 − a2b1c3 − a2b3c1 − a3b1c2 − a3b2c1
)∼ 1. (3.19)
Up to O(1/Λ, 1/Λ′), where Λ and Λ′ are taken to correspond to an assumed unique flavon scale and to
the scale of S decoupling, respectively, one obtains the following Yukawa Lagrangian, compatible4 with
the postulated symmetries (l1,2,3 = e, µ, τ):
Ll = −yle
Λ
(`L Φ
)1φ eR −
ylµΛ
(`L Φ
)1′′φµR −
ylτΛ
(`L Φ
)1′φ τR + H.c.,
Lν =1
Λ′∆1
(`TL C
† `L)1
(y1 S + y′1 S
∗)+y2
Λ∆2
(`TL C
† `L Ψ)1
+ H.c.,
(3.20)
where the bold superscripts indicate which field combination is chosen from the decomposition of the
triplet tensor product. Suppose now that the singlet S acquires a complex VEV 〈S〉 = vS eiα and the
flavon fields develop VEVs, which must be global minima of the scalar potential, in the generic directions
(ri, si ∈ R):
〈Φ〉 = (r1, r2, r3) , 〈Ψ〉 = (s1, s2, s3) . (3.21)
The (leptonic) Yukawa Lagrangian then becomes:
Ll = −Ylαβ `αL φ lβR + H.c. , Lν = −Y∆1
αβ `αCL ∆1 `βL −Y∆2
αβ `αCL ∆2 `βL + H.c., (3.22)
where one recognizes the familiar charged lepton Yukawa and type II seesaw terms, with:
Yl =1
Λ
r1 r3 r2
r2 r1 r3
r3 r2 r1
yle 0 0
0 ylµ 0
0 0 ylτ
, (3.23)
Y∆1 = y∆1
1 0 0
0 0 1
0 1 0
, y∆1
≡ vSΛ′(y1 e
iα + y′1 e−iα), (3.24)
Y∆2 =1
3
y2
Λ
2 s1 −s3 −s2
−s3 2 s2 −s1
−s2 −s1 2 s3
. (3.25)
Imposing the symmetries GT,S,U (with the desired generator representations T , S, and U) to be
manifest at low-energy constrains the direction of the flavon expectation values. Requiring Yl to be
symmetric under GT implies that it is diagonal and, as such, r1 ≡ r, while r2, r3 = 0. Demanding Y∆2
4This Lagrangian is not, however, the most general one allowed by the symmetries, up to the chosen order, as botha Weinberg operator term and the renormalizable ∆2
(`TL C
† `L)1
term can be included. Although the former might beremoved by a shaping symmetry, there is no possible additional symmetry which can forbid the latter without destroyingother desirable terms. Schematically, if (φφ)(∆1S), (`L`L)(∆1S), and (φφ)(∆2) are allowed, then so must (`L`L)(∆2).Such a term would modify Y∆2 by a term ∝ Y∆1 without affecting the mixing pattern, as one can see from (3.36).
43
to be symmetric under GU (often referred to as a ‘µ− τ symmetry’) yields s2 = s3 ≡ s. Finally, one can
check that for symmetry under the chosen representation of GS (commonly denoted as ‘magic symmetry’)
the constraint s1 = s arises. The VEV directions which allow for the TBM mixing pattern are thus:
〈Φ〉TBM = (r, 0, 0) , 〈Ψ〉TBM = (s, s, s) . (3.26)
One could equivalently have found this solution by requiring T 〈Φ〉 = 〈Φ〉 and S〈Ψ〉 = U〈Ψ〉 = 〈Ψ〉.
3.3.1 Spontaneous CP Violation
The spontaneous breaking of CP results from a non-trivial complex phase of the singlet VEV, 〈S〉 =
vS eiα. Assuming the scale of S decoupling to be much higher than both the electroweak and seesaw
scales, we can restrict ourselves to the VS term of the scalar potential (no induced VEVs arise) to verify
whether the potential minimization conditions allow for a non-trivial VEV S = 〈S〉 = vS eiα. For that,
we take
V0 ≡ VS∣∣S=〈S〉 = m2
S v2S + λ2
S v4S + 2
(µ2S + λ′′S v
2S
)v2S cos(2α) + 2λ′S v
4S cos(4α). (3.27)
The values of α and vS are obtained through the minimisation conditions:
∂V0
∂vS= 0
∂V0
∂α= 0
⇒
vS
[m2S + 2λS v
2S +
(2µ2
S + 4λ′′S v2S
)cos(2α) + 4λ′S v
2S cos(4α)
]= 0
v2S
[2(µ2S + λ′′S v
2S
)sin(2α) + 4λ′S v
2S sin(4α)
]= 0
. (3.28)
Since sin(4α) = 2 sin(2α) cos(2α), the lower condition defines three classes of solutions:
vS = 0 ∨ ( vS 6= 0 ∧ sin(2α) = 0 ) ∨ ( vS 6= 0 ∧ sin(2α) 6= 0 ) (3.29)
No CP violation arises from either the first (trivial) solution or from the second possibility (see Section
23.6 of Ref. [26]). One is then left with sin(2α) 6= 0, for which
m2S + 2λS v
2S + 4λ′S v
2S cos(4α)
+(2µ2
S + 4λ′′S v2S
)cos(2α) = 0
µ2S + λ′′S v
2S + 4λ′S v
2S cos(2α) = 0
⇒
vS =
√µ2S λ′′S − 2m2
S λ′S
4λS λ′S − 8λ′S2 − λ′′S
2
cos(2α) = −µ2S + λ′′S v
2S
4λ′S v2S
(3.30)
holds. Since the angle α is defined in the interval [0, 2π[, the above condition yields four possible solutions:
α, −α, π+α, and π−α (all to be taken mod 2π). One can see that the solution of Eq. (3.30) corresponds
to a global minimum in the region λS > 2λ′S > 0 ∧ λ′′S ' µS ' 0. In fact, for the CP-conserving solutions
which obey vS 6= 0 ∧ sin(2α) = 0, Eqs. (3.27) and (3.28) in this region give:
v '√
−m2S
2(λS + 2λ′S
) ⇒ V0 ' −m4S
4(λS + 2λ′S
) , (3.31)
since cos(4α) = 1−sin2(2α) = 1 for the considered class of solutions, while for the non-trivial CP-violating
solutions vS 6= 0 ∧ sin(2α) 6= 0 one gets:
vS '√
−m2S
2(λS − 2λ′S
) ⇒ V0 ' −m4S
4(λS − 2λ′S
) , (3.32)
since cos(2α) ' 0, corresponding to the four solutions: π/4, 3π/4, 5π/4, and 7π/4. Consistency mandates
that m2S < 0 for the given solutions to be viable. The assumption λS > 2λ′S > 0 selects the non-trivial
CP-breaking solution as that which minimizes the potential.
44
Parametrizing the complex singlet field as S = vS eiα + σ1 + i σ2, with 〈σi〉 = 0 ⇒ 〈S〉 = vS e
iα, one
can extract from the potential VS the mass eigenvalues for the two σi degrees of freedom. Collecting
the terms quadratic in the new fields – which amounts to taking the second derivatives of the potential
with respect to σi and σj – allows one to reach, for both the non-trivial minimization solution and the
particular parameter region considered above:
V σ1,σ2mass '
−m2S
λS − 2λ′S
[(λS + 2λ′S
)(σ2
1 + σ22
)+ 2(λS − 6λ′S
)σ1 σ2
]. (3.33)
Diagonalization of this quadratic form finally yields the positive squared masses M2σ1' −4m2
S and
M2σ2' −16m2
S λ′S/(λS − 2λ′S).
3.3.2 Neutrino Masses and Mixing
We have seen that, upon symmetry breaking, the Yukawa terms reduce to the leptonic part of the
type II seesaw Lagrangian. For the alignment condition of Eq. (3.26),
Yl =
ye 0 0
0 yµ 0
0 0 yτ
, Y∆1 = y∆1
1 0 0
0 0 1
0 1 0
, Y∆2 =
y∆2
3
2 −1 −1
−1 2 −1
−1 −1 2
, (3.34)
where ye,µ,τ ≡ r yle,µ,τ/Λ and y∆2≡ s y2/Λ. The coupling of heavy triplets to the Higgs bosons is also
available for both ∆i, since VSφ∆ produces such a term for ∆1 as soon as S acquires a VEV:
VSφ∆ = φT ∆1 φ(λ∆ S + λ′∆ S∗
)+ H.c.
SSB−−−→ λ1M1 φT ∆1 φ+ H.c., (3.35)
with λ1 ≡ vS(λ∆ eiα + λ′∆ e−iα
)/M1. Relying on the formalism already developed in Section 2.2.2, one
sees that the neutrino mass matrix reads (cf. Eq. (2.71)):
Mν =
2∑
i=1
−2λ∗i v2
MiY∆i = 2u1 Y∆1 + 2u2 Y∆2 , (3.36)
where the ui represent the triplet VEVs defined in (2.72). The differing sign of (3.36) with respect to [133]
is due to a different definition of the Yukawa couplings, y(′)1,2 → − y
(′)1,2 .
Neutrino masses can be obtained, in light of (3.2), by taking the square root of the eigenvalues of
Mν†Mν , with UPMNS as the change of basis matrix. It is useful to define beforehand the quantities zi
(> 0) and β through 2u1 y∆1≡ z1 e
iβ and 2u2 y∆2≡ z2. All information regarding CP violation is then
encoded5 in the phase β. The diagonalization procedure yields:
Dν = diag(m1,m2,m3) = diag(∣∣z1 e
iβ + z2
∣∣, z1,∣∣z1 e
iβ − z2
∣∣), (3.37)
and one finds that, as expected, the mixing matrix is of TBM form: UPMNS = UTBM, with indeterminate
ϕ and KPMNS. To obtain them, one notices that, since KPMNS is diagonal,
e−iϕK∗PMNS = ±√V TTBM Mν VTBM (Dν)−1. (3.38)
Recalling that KPMNS = diag(1, eiα1 , eiα2), and defining σ± ≡ arg(z2 ± z1 eiβ), one arrives at:
ϕ = −σ+/2 , α1 = (σ+ − β)/2 , α2 = (σ+ − σ−)/2, (3.39)
where we have restricted ourselves to the positive sign solution.
5In fact, as the mixing pattern is TBM, there will be no Dirac-type CP violation and β enters solely in Majorana phases.
45
mtotal
m1
m2
m3
Π 9 Π
8
5 Π
4
11 Π
8
3 Π
2
1
10-1
10-2
mi and mtotal HeVL
Β
Planck
Exclusion
maximum mtotal allowed by cosmology
Figure 3.2: Predicted values for neutrino masses as a function of the CP-violating angle β. The plot is
restricted to the region [π, 3π/2] since the functions mi(β) are symmetric around β = π. Due to Eqs.
(3.41) and (3.42), the absolute neutrino mass scale is bounded from below: m1 & 1.6 × 10−2 eV. The
spectrum is the most hierarchical for β = π, while it presents the highest degeneracy near β = π/2, 3π/2.
The Planck upper limit of mtotal < 0.66 eV has been considered (see Section 3.1).
As for the mass spectrum, one sees that m2 > m1 implies z2 + 2 z1 cosβ < 0, while z2 − 2 z1 cosβ is
positive for normal ordering (m3 > m2) and negative for inverted ordering (m3 < m2). Normal ordering
(considered herafter) then entails cosβ < 0 ⇒ β ∈ [π/2, 3π/2], while it is impossible to implement
inverted ordering since the above conditions demand that both cosβ and − cosβ be negative (z1,2 > 0).
By manipulating (3.37), one can express z1 and z2 as a function of β and mass-squared differences:
z2 =
√m2
1 +m23 − 2m2
2
2=
√∆m2
31 − 2 ∆m221
2
z1 =m2
1 −m23
4 z2 cosβ= − 1
2 cosβ
∆m231√
2(∆m2
31 − 2 ∆m221
). (3.40)
Taking into account the large hierarchy bewteen solar and atmospheric squared-mass differences, one
obtains the approximate solution (using the best fit data of Table 3.1):
z2 '√
∆m231
2' 3.56× 10−2 eV ⇒ z1 ' −
1
2 cosβ
√∆m2
31
2=−z2
2 cosβ' −1.78× 10−2 eV
cosβ. (3.41)
Also, since m2 = z1, one is able to express neutrino masses in terms of z1(β) and ∆m221,31:
m1 =√m2
2 −∆m221 =
√z2
1 −∆m221 , m3 =
√m2
2 −∆m223 =
√z2
1 −∆m221 + ∆m2
31, (3.42)
a result which is shown in Fig. 3.2.
In the approximation of Eq. (3.41), the Majorana phases α1,2 are given by:
σ1 ' −β
σ2 ' arctan
(1
3tanβ
) ⇒
α1 ' −β
α2 ' −1
2
[β + arctan
(1
3tanβ
)] . (3.43)
46
3.3.3 Nonzero Reactor Neutrino Mixing Angle
In order to attain an acceptable agreement with experimental data within the presented framework,
one must account for the deviation θ13 6= 0 from the TBM pattern. This is achieved by considering
perturbations to the flavon alignment of Eq. (3.26), which might arise due to non-renormalizable correc-
tions to the potential V (Φ,Ψ). In particular, one focuses one the case where only the flavon VEV 〈Ψ〉is modified, with small ε1,2, to:
〈Ψ〉 = s(1, 1 + ε1, 1 + ε2). (3.44)
This case, which is the one considered in Chapter 5 for the analysis of the viability of leptogenesis in
the present model, allows for Dirac-type CP violation, in principle observable via neutrino oscillations,
unlike what would happen if 〈Φ〉 alone were perturbed6. Perturbing the 〈Ψ〉 VEV does not affect the
charged lepton Yukawa matrix – which conveniently remains diagonal – nor Y∆1 , but modifies Y∆2 ,
yielding, according to Eq. (3.25):
Y∆2 =y∆2
3
2 −1− ε2 −1− ε1
−1− ε2 2(1 + ε1
)−1
−1− ε1 −1 2(1 + ε2
)
. (3.45)
Through the same diagonalization procedure described in the previous section, keeping terms up to
O(ε1, ε2), one arrives at the perturbed neutrino mass eigenvalues7:
m21,3 ' z2
1 +1
3
(z2
2 (3 + 2ε1 + 2ε2)± 2 z1 z2 (3 + ε1 + ε2) cosβ)
, m22 ' z2
1 . (3.46)
For small enough perturbations, |ε1,2| 1, only normal ordering is once again allowed, and Fig. 3.2
remains qualitatively unaltered. As for the mixing pattern, one obtains the following deviations to TBM:
sin2 θ12 '1
3
[1 +
2
3(ε1 + ε2)
], sin2 θ23 '
1
2
[1 +
1
3(ε1 − ε2)
], sin2 θ13 '
1
2
(ε1 − ε2
6 cosβ
)2
. (3.47)
The above result can be straightforwardly obtained in the context of nondegenerate perturbation theory
(see, for instance, section 6.1 of [86]) and by considering the approximation ∆m231 ' ∆m2
32 ∆m221.
As far as mixing phases are concerned, one can measure Dirac-type CP violation through the following
invariant [80,134], which depends on the Dirac phase δ:
JCP ≡ Im(U11 U22 U
∗12 U
∗21
)=
1
8sin(2 θ12) sin(2 θ13) sin(2 θ23) cos θ13 sin δ, (3.48)
where one has used U as short for UPMNS. For ∆m231 ∆m2
21 and at first order in ε1,2, one obtains the
result JCP ' tanβ (ε2− ε1)/36⇒ sin δ ' ± sinβ (the sign depends on the direction of the perturbation).
One is now in a position to consider experimental constraints on the perturbed version of model.
This is done in Fig. 3.3: the left part illustrates the allowed regions of the (ε1, ε2) plane, while the right
part shows the corresponding possible values of JCP. One obtains a limit 0.02 . |JCP| . 0.05 (at 3σ)
associated with the ranges π/2 . β . 5π/8 and 11π/8 . β . 3π/2, which imply 3π/8 . δ . 5π/8 or
11π/8 . δ . 13π/8.
6Such a case is explored in detail in Ref. [133], where it is found to fail in accomodating the recent experimental results.A mixed option, where both 〈Ψ〉 and 〈Φ〉 are perturbed is also possible.
7This result revises that of Eq. (40) in [133], in which a factor of 2 is missing.
47
Figure 3.3: Scatter plot of the experimentally allowed regions in the(ε1, ε2
)plane (left), where exact TBM
is seen to be excluded, and corresponding regions of the(JCP, β
)plane (right). There is no constraint
on the sign of either JCP or sinβ. Red bands denote the Planck collaboration exclusion. Perturbations
were varied in the range [−0.2, 0.2]. Uncertainty intervals were obtained from Table 3.1.
Π 9 Π
8
5 Π
411 Π
83 Π
2
1
10-1
10-2
10-3
ÈmeeÈ
Β
TBM case
Perturbed case
ÈmeeÈ upper limit from GERDA
Figure 3.4: Values for the neutrinoless double beta decay parameter∣∣mee
∣∣, in the exact TBM and
perturbed cases, as a function of β (the region [π/2, π] is redundant). Both ε1 and ε2 were varied in
the range [−0.2, 0.2]. Red bands correspond to Planck and GERDA exclusions, while the green band
indicates the (aproximate) region of survival of the model, obtained from Fig. 3.3.
Finally, one focuses on the Majorana-type phases, which impact neutrinoless double beta decay.
Since the perturbations do not affect the Mν11 matrix element at first order (cf. (3.45)), the quantity
∣∣mee
∣∣ as a function of z1, z2 and β coincides, in the perturbed model, with that of the TBM case:∣∣mee
∣∣TBM
=∣∣z1 e
iβ + 2 z2/3∣∣. However, the relationship (3.41) between the mi and the zi is modified8:
z1 ' −1
2 cosβ
√∆m2
31
2, z2 '
[1− 1
3
(ε1 + ε2
)]√
∆m231
2, (3.49)
which reduces to the TBM case when ε1,2 → 0, as expected. The allowed values for the parameter∣∣mee
∣∣
as a function of the model phase β are presented in Fig. 3.4.
This perturbed version of the model will be reconsidered later on, in Chapter 5, where the viability
of implementing leptogenesis within its context is analysed.
8This once more revises a result given in [133].
48
Baryogenesis through
Leptogenesis 4We have so far considered how to extend our current knowledge of particle physics to accomodate the
experimental observation of nonzero neutrino masses. One might now wonder if the theory is sufficient to
explain the observed imbalance between matter and antimatter. In this chapter, the ingredients needed to
produce a nonzero baryon asymmetry are presented. We will recall why the SM is insufficient to account
for the observed BAU, and how seesaw extensions can provide a natural solution to this problem through
thermal leptogenesis. With this purpose, we start by briefly surveying some topics from the domains of
cosmology and statistical physics, which present themselves as relevant to the remainder of this work.
4.1 Topics of Cosmology and Thermodynamics
4.1.1 Cosmological Inflation
In the Standard Model of Cosmology (SMC), space-time is described by the Friedmann-Robertson-
-Walker metric (FRW) [135]:
ds2 = dt2 −R(t)2
[dr2
1− k r2+ r2
(dθ2 + sin2 θ dθ dφ2
)], (4.1)
which describes an isotropic space of constant (normalized) curvature k, and where (t, r, θ, φ) are dimen-
sionless comoving coordinates. The present cosmological data is consistent with a flat Universe [136],
corresponding to k = 0. The variable R(t) denotes the cosmic scale factor and its evolution is determined
by the Friedmann equation, where H(t) ≡ R(t)/R(t) is the Hubble parameter:
H(t)2 =8πG
3ρ(t)− k
R(t)2+
Λ
3. (4.2)
In this equation, Λ represents the cosmological constant, G Newton’s gravitational constant and ρ(t) the
total energy density of matter and radiation in the Universe at cosmic time t.
Cosmological inflation [137,138] is invoked in order to explain the flatness of the present-day Universe,
as well as to solve the horizon1 and GUT monopole problems. According to the inflationary scenario, the
scale factor of the Universe underwent an exponential increase in the early Universe. During this period,
the energy density ρ(t) remained approximately constant so that H ' const., implying R(t) ∝ exp(H t).
1The horizon problem pertains to the observation of a homogeneous and isotropic Universe, in line with the cosmologicalprinciple but in contradiction with what is expected without inflation. Inflation allows for the expansion of a region whichis causally connected (and thus allowed to thermalize in the early Universe) to the size of the observable Universe.
49
Furthermore, the density of any conserved charge scaled with the inverse of the volume of the Universe,
leading to:
nB = nb − nb ∼ R(t)−3 , (4.3)
for the density of baryonic charge nB defined in Section 1.5.2. Since successful inflation demands a number
H t & 70 of exponential folds [139], primordial asymmetries are extremely diluted. One should thus have
a great imbalance in baryon number from the start in order to survive this dilution, in contradiction with
the assumption of a constant matter energy density during inflation. As this is an unfruitful solution,
one turns to mechanisms which dynamically generate the observed baryon asymmetry.
4.1.2 Equilibrium Thermodynamics
In order to describe the evolution of early Universe particle content after inflation has taken place (but
prior to EWSB), one turns to the domain of statistical physics. To each particle species i one associates a
phase-space distribution fi(~p, ~x, t), which is taken to depend solely on energy E and time t, fi ≡ fi(E, t),as a consequence of the assumed homogeneity (no dependence on position) and isotropy (no dependence
on the direction of linear momentum) of the early – hot and dense – Universe plasma. The internal
degrees of freedom gi for a certain particle species (of mass mi) appear explicitly in its expressions for the
number density ni, energy density ρi and pressure pi, which are given as a function of the phase-space
distribution by:
ni ≡gi
(2π)3
∫d3p fi
(Ei(~p)
), (4.4)
ρi ≡gi
(2π)3
∫d3pEi(~p) fi
(Ei(~p)
), (4.5)
pi ≡gi
(2π)3
∫d3p
|~p |23Ei(~p)
fi(Ei(~p)
), (4.6)
where energy depends only on the magnitude of the momentum, Ei(~p) = Ei(|~p|) =√|~p|2 +m2
i (we have
omitted time dependence). The non-trivial formula for pressure has been derived in the context of kinetic
theory (see, for instance, section 4.2 of Ref. [140]). For a species of spin si, one has gi = 2si+1 if mi > 0,
gi = 2 if mi = 0 and si > 0, and gi = 1 if2 both mi = 0 and si = 0 [141].
A situation of kinetic equilibrium, which is typically enforced through scatterings in the plasma, is
defined by the possibility of having the different fi cast into the form (units of kB = 1 are considered):
fi(E(~p)
)=
1
exp[(Ei(~p)− µi) /Ti
]± 1
, (4.7)
corresponding to Fermi-Dirac (‘+’ sign) and Bose-Einstein (‘−’ sign) quantum distribution functions,
associated with fermions and bosons, respectively. In the cases for which the exponential in the denom-
inator overshadows the ±1, the above distribution reduces to the known (classical) Maxwell-Boltzmann
one. The uncertainty in the fi(E(~p)
)is thus encoded into a single time-dependent parameter, the chem-
ical potential µi, whose value is fixed by the interactions if the system is in chemical equilibrium [142].
In such a case, the interaction i + j → k + l implies that the relation µi + µj = µk + µl holds. We will
2In the context of a chiral description, each chiral fermionic component ψR,L has gi = 1.
50
henceforth use the superscript ‘eq’ to refer to situations of (local3) thermal equilibrium, achieved when
both kinetic and chemical equilibria conditions are verified.
Neglecting quantum aspects of the distribution, the kinetic equilibrium density ni reduces to:
ni = eµi/Tigi
(2π)3
∫d3p e−Ei/Ti =
gi eµi/Ti
(2π)3
∫∫dΩ
∫e−Ei/Ti |~p|2 d|~p|, (4.8)
where∫∫
dΩ = 4π since there are no angular dependences. Omitting the index i and defining z ≡ E/T and
x ≡ m/T allows one to perform a change of integration variables, yielding (notice that |~p| d|~p| = E dE):
n e−µ/T =g
2π2
∫ ∞
m
e−E/T E |~p| dE =g T 2
2π2
∫ ∞
x
e−z z |~p| dz =g T 3
2π2
∫ ∞
x
e−z z√z2 − x2 dz. (4.9)
The above can be expressed in terms of a second-kind modified Bessel function of order 2. The modified
Bessel function of the second kind of order ν, Kν , can be found on page 376, section 9.6.23 of the
Abramowitz and Stegun Handbook of Mathematical Functions [144] and is given by (for Re (ν) > −1/2
and for |arg(y) | < π/2):
Kν(y) ≡√π(y/2)ν
Γ(ν + 1/2)
∫ ∞
1
e−yt(t2 − 1)ν−1/2 dt, (4.10)
where Γ represents the gamma function, which extends the factorial to complex arguments. We will be
interested in cases for which y ∈ R and ν ∈ N, in particular ν = 1, 2(Γ(3/2) =
√π/2, Γ(5/2) = 3
√π/4
).
For the case ν = 2, one then has:
K2(y) =y2
3
∫ ∞
1
e−yt(t2 − 1)3/2 dt =1
y2
∫ ∞
y
e−t t√t2 − y2 dt. (4.11)
This last result has been obtained through the change of variables y t→ t and integration by parts. One
then concludes that a classical equilibrium distribution for a particle of mass mi is given by:
ni =gi T
3i
2π2eµi/Ti x2
i K2(xi)mi→ 0−−−−−→ ni =
gi T3i
π2eµi/Ti , (4.12)
where the massless (or relativistic) limit, Ti mi implies ximi/Ti → 0, has been also considered.
The opposite, non-relativistic limit Ti mi for the number density can be obtained by looking at the
behaviour the Bessel function as x→∞:
K2(x)x→∞−−−−−→ e−x
[√π
2
√1
x+O
(1
x3/2
)], (4.13)
leading to, for Ti mi:
ni = gi
(mi Ti
2π
)3/2
exp[−(mi − µi
)/Ti
], (4.14)
from which one can see that the density for a non-relativistic species is Boltzmann suppressed.
Considering quantum corrections to the relativistic species density of (4.12) results in:
ni =
ζ(3)gi T
3i
π2(Bose-Einstein)
3
4ζ(3)
gi T3i
π2(Fermi-Dirac)
, (4.15)
3Thermal equilibrium will never be attained globally but only locally since there is no timelike spatially constant Killingvector in the FRW metric [143]. For a slow enough expansion, one can ignore this technicality.
51
where one has also considered the limit µi Ti (see, for example, Appendix C of Ref. [141]), valid as
long as one ignores scenarios with degenerate fermions or Bose condensation. The number density for
photons, for which gγ = 2 (two polarizations) and µ(eq)γ = 0 (photon number is not conserved), is thus
given by:
nγ =2 ζ(3)
π2T 3, (4.16)
where T denotes the photon temperature, often simply refered to as the temperature of the Universe,
since it describes the thermal equilibrium state of the early Universe plasma.
For non-relativistic particles, on the other hand, polylogarithmic corrections introduced by quantum
corrections can be ignored, as the quantum expression for the number density matches the classical
expression (4.14), at first order in mi/Ti.
We can now make Eq. (1.7) sensible, as it results from the ratio between the (thermal, Ti = T )
equilibrium densities (4.14) and nγ given in Eq. (4.16):
neqi
nγ=gi√π
25/2
(mi
T
)3/2
exp[−(mi − µeq
i )/T]' gi
√π
25/2
(mi
T
)3/2
exp(−mi/T
), (4.17)
where the approximation is valid in the limit µeqi /T 1.
4.1.3 Expansion, Entropy and Degrees of Freedom
In what follows, we will be interested in the radiation dominated era of the thermal history of the
Universe (T & 1 eV), for which the total pressure and energy density are related by the equation of state
p = ρ/3. To determine the dependence of the Hubble parameter on the temperature T , one turns to the
Friedmann equation. For a flat metric, with no cosmological constant, Eq. (4.2) reduces to:
H(T )2 =8πG
3ρ(T ) ⇒ H(T ) =
√8πG
3
√ρ(T ) . (4.18)
The dominant contribution for the total energy density comes from relativistic species, for which:
ρi(T ) =
π2
30gi T
4i (Bose-Einstein)
7
8
π2
30gi T
4i (Fermi-Dirac)
. (4.19)
Hence, the total energy density ρ(t) is given by:
ρ(T ) =∑
i
ρi(T ) =
[ ∑
bosons
gi
(TiT
)4
+7
8
∑
fermions
gi
(TiT
)4]π2
30T 4 ≡ g∗(T )
π2
30T 4. (4.20)
The dependence of the total number of relativistic degrees of freedom g∗ on temperature may be ignored
and, therefore, g∗ is taken to be constant at high enough energies. Summing over SM species, which
are all relativistic for temperatures T & 100 GeV (Ti ' T ), one obtains g∗ = 106.75. By considering√G = m−1
Pl , where mPl ' 1.22× 1019 GeV is the Planck mass, one then has:
H(T ) =
√8π3
90g
1/2∗
T 2
mPl' 1.66 g
1/2∗
T 2
mPl. (4.21)
52
As a consequence of the first law of thermodynamics and (4.18), the equation of state p = ω ρ implies
ρ ∝ R−3(1+ω) and R ∝ t2(1+ω)/3, and so, in the radiation dominated era, R ∝√t. This allows one to write
the Hubble parameter as a function of time, as well as to relate cosmic time with photon temperature:
H(t) =1
2t⇒ t(T ) ' 0.301 g
−1/2∗
mPl
T 2. (4.22)
It is useful to introduce the entropy density s, defined as the ratio between the entropy per comoving
volume S and the physical volume V = R3, i.e.
s ≡ S
R3=∑
i
pi + ρiTi
=2π2
45
[ ∑
bosons
gi
(TiT
)3
+7
8
∑
fermions
gi
(TiT
)3]T 3 ≡ 2π2
45g∗S T
3, (4.23)
where the equation of state p = ρ/3 and Eq. (4.19) have been considered. Together with Eq. (4.16), the
above relation establishes the relation s =(π4/(45 ζ(3))
)g∗S nγ ' 1.8 g∗S nγ . For the early periods of
the history of the Universe we are interested in, g∗S can be replaced by g∗. In the absence of entropy
production (S = const.), s(t) ∝ R(t)−3, which implies that the ratio Yi ≡ ni/s remains constant in
the absence of production or destruction of i-species particles, as the number of particles in a comoving
volume element, Ni ≡ niR3, is then proportional to ni/s. The ratio of number density per entropy
density, Yi, will then be a useful quantity to track the evolution of a particle species. One is free to write
ni/nγ ' 1.8 g∗S Yi. At present times, the seas of decoupled photons and neutrinos determine the value
g∗S = 3.9, implying ηi ≡ ni/nγ ' 7.04Yi. Constant entropy per comoving volume additionally implies
that g∗S T3R3 remains constant during the expansion of the Universe, and so T ∝ g
−1/3∗S R−1. In the
case of constant g∗S , the relation T ∼ R−1 follows.
We will henceforth consider that the assumption of kinetic equilibrium entails the equality of all Ti
governing the distributions (4.7) of particles in the thermal bath which, in particular, will match the
photon temperature T .
4.1.4 Brief Thermal History of the Universe
Significant events in the evolution of the Universe have been driven by departures from thermal equi-
librium. As a rule of thumb, one may consider an interaction to be out of equilibrium if the corresponding
rate Γ is not fast enough to accompany the expansion of the Universe4, i.e. Γ H(T ). If this is the case,
interactions freeze-out and the particle species decouples from the plasma. On the contrary, if interactions
are fast, Γ H(T ), one takes the corresponding particle species to be in (thermal) equilibrium. The
non-trivial case is located in between, with Γ ∼ H(T ), and demands a more careful and quantitative
treatment, which relies on the Boltzmann transport equation (to be presented in Section 4.5).
The central events in the thermal evolution of the Universe are summarized in Fig. 4.1. Following the
inflationary era, a baryon asymmetry must have developed prior to (e.g. GUT baryogenesis or leptogenesis
scenarios) or in coincidence with (electroweak baryogenesis) the electroweak phase transition (EWPT),
which occurred for temperatures of the order of T ∼ 100 GeV. As the temperature decreased, more and
more SM species became non-relativistic, contributing to the decrease of both g∗ and g∗S . After the
4The case of a massless species which has decoupled from the plasma presents an exception, as such a species will retainan equilibrium distribution (with a naturally different temperature from that of the plasma) even after the decoupling.
53
Figure 4.1: Brief thermal history of the Universe. The family symmetry breaking scale is often taken to
be of the order of the GUT scale. The location of the seesaw scale, which coincides with that of thermal
leptogenesis (Section 4.4), can vary greatly depending on the model implementation.
quantum chromodynamics (QCD) phase transition, which is believed to occur for temperatures around
200−400 MeV [145], the existing baryon asymmetry was fed into the process of Big Bang (or primordial)
nucleosynthesis (BBN), corresponding to the formation of light nuclei for 10 MeV & T & 100 keV. All
excess nucleons and antinucleons would have annihilated for temperatures below T ∼ mπ ∼ 100 MeV,
while electrons and positrons did so at a lower scale, T ∼ 2me ∼ 1 MeV. At even lower temperatures,
T ∼ 0.1 eV, nuclei and free electrons were brought together to form neutral atoms, in what is misleadingly
referred to as recombination. Atom formation was accompanied by the lowering of the number of free
electrons, allowing for the decoupling of photons, which made up the cosmic microwave background,
characterized at present times by a temperature T 0γ ' 2.73 K. One might wonder about the origin of the
excess of electrons over positrons if a mechanism focused solely on the generation of a baryon asymmetry
is considered. Such excess may arise through weak interactions (fast at high-energies), without affecting
total lepton number – part of which may be stored in yet unobserved neutrino seas – or the apparent
electric charge neutrality of the Universe.
4.2 The Sakharov Conditions
From the discussion of Sections 1.5 and 4.1.1 one concludes that, in order to explain the present
BAU, a baryon number asymmetry must be dynamically generated after inflation has taken place. In
fact, taking an initial state of zero baryon number, B = 0, a baryon asymmetry can be generated if the
following sufficient conditions, presented by Andrei Sakharov in 1967 [146], hold:
• B symmetry is violated.
• C and CP symmetries are violated.
• Some departure from thermal equilibrium occurs.
54
Although the Sakharov conditions are not necessary conditions for the BAU generation [147], trying to
get around any one of the three is generally not easy and might require troublesome assumptions, such
as CPT violation, which is associated to the breaking of Lorentz invariance (see Section 1.3.3).
B Violation
The baryon number operator is given, as a function of time, by [57]:
B(t) =1
3
∑
quarks q
∫d3x :q†(~x, t) q(~x, t) : , (4.24)
where q(~x, t) corresponds to the quark field operator and colons refer to Wick ordering. From the
discussion of Section 1.5.3, it is clear why B-violating interactions are needed for the generation of any
baryon asymmetry. Although no direct experimental proof for non-conservation of B exists yet, grand
unified theories and standard electroweak theory demand it at sufficiently high energies.
C and CP Violation
In order to understand the need for both C and CP violation, one must consider the action of these
transformations on the B operator. Adopting standard conventions for the phases, one has:
P q(~x, t) P−1 = γ0 q(−~x, t), C q(~x, t) C−1 = i γ2 q†(~x, t), T q(~x, t) T−1 = −i q(~x,−t) γ5 γ0γ2, (4.25)
where P , C and T are the operators which carry out the parity, charge conjugation and time reversal
operations, respectively. Taking into account the unitarity of P and C and the anti-unitarity of T , one
easily obtains the transformations for q†:
P q†(~x, t) P−1 = q†(−~x, t) γ0, C q†(~x, t) C−1 = i q(~x, t) γ2, T q†(~x, t) T−1 = −i γ2γ0 γ5 q†(~x,−t).(4.26)
Considering that, since quarks are fermions, :q q† : = − :q† q : holds, one obtains:
P B P−1 = B , C B C−1 = −B , CP B (CP )−1 = −B, (4.27)
CP T B(0) (CP T )−1 = −B(0). (4.28)
This shows that the B is odd under C and CP transformations and, thus, a nonzero expectation value of
B, demands the presence of both C and CP violation.
Departure from Equilibrium
The expectation value of the B(t) operator is given by:
⟨B(t)
⟩= Tr
[ρD(t) B(t)
], (4.29)
where ρD(t) denotes the density operator. In the Heisenberg representation, one has:
B(t) = eiHt B(0) e−iHt. (4.30)
55
The third Sakharov condition can be understood by considering a scenario of thermal equilibrium at a
temperature T , for which ρD = exp(−H/T )/Z, with a partition function Z = Tr[exp(−H/T )
]. Using
the above relations and considering an equilibrium scenario, one obtains:
⟨B(t)
⟩T
=1
ZTr[e−H/T B(t)
]=
1
ZTr[e−H/T eiHt B(0) e−iHt
]
=1
ZTr[e−H/T B(0)
]=⟨B(0)
⟩T,
(4.31)
where the cyclic nature of the trace and the possibility of commuting the exponential operators have been
used. Assuming a CPT invariant Hamiltonian, CP T H (CP T )−1 = H, and taking into account (4.28)
and (4.31), one sees that:
⟨B(t)
⟩T
=⟨B(0)
⟩T
=1
ZTr[e−H/T B(0)
]
=1
ZTr[(CP T )−1(CP T ) e−H/T (CP T )−1(CP T ) B(0)
]
=1
ZTr[(CP T ) e−H/T (CP T )−1(CP T ) B(0) (CP T )−1
]
=1
ZTr[e−H/T
(−B(0)
)]= −
⟨B(0)
⟩T.
(4.32)
Hence, the expectation value of the baryon number operator vanishes in thermal equilibrium and so, even
in the presence of B, C and CP violating interactions, no asymmetries can be generated.
Naıvely, one might expect that the reason the two first Sakharov conditions do not suffice is because
inverse decays would wash out the asymmetries generated by decays if thermal equilibrium is imposed.
However, this is not the case. In fact, decays and inverse decays both push the asymmetry in the same
direction. Other scattering processes must then be responsible for the washout, which is to be expected
from the unitarity of the QFT scattering matrix. A departure from equilibrium conditions is then needed,
and corresponds to the third and final requirement considered for the BAU generation.
4.3 Is the SM Enough?
The successful generation of the observed BAU depends on the nature of the model under considera-
tion. Despite its great successes, the Standard Model of particle physics seems to fall short in providing
viable conditions for baryogenesis. In fact, as we shall now verify, all necessary Sakharov ingredients are
avaliable in the SM, but cannot satisfactorily account for the value of η ∼ 10−10 given in Eq. (1.6).
B + L Violation
The SM Lagrangian is invariant under global rephasing of both quark and lepton fields. At the classical
level, one can associate to these transformations the currents JBµ (x) and JL
µ (x), obeying ∂µJB,Lµ (x) = 0.
This guarantees the conservation of the quantum numbers (baryon and lepton numbers) associated to
the charge operators B =∫d3xJB
0 (x) and L =∫d3xJL
0 (x). However, these accidental SM symmetries
are broken beyond the classical field aproximation as the JB,Lµ are no longer conserved due to the Adler-
-Bell-Jackiw chiral anomaly [148,149]. In the present case, one obtains the anomalous divergences [57]:
∂µJBµ = ∂µJL
µ = n
(g2
2
32π2Ai
µνAiµν −
g2Y
32π2Bµν Bµν
), (4.33)
56
≠`ieL
`iµL`iτL
qj1L
qk1L
ql1L
qj2L
qk2L
ql2L
qj3L
qk3L
ql3L
Figure 4.2: Schematic representation of the vacuum structure of the electroweak theory (left) and effective
diagram for the transition between vacua (right). Esphaleron corresponds to the height of the energy
barrier. Vacuum states are assigned an integer topological charge NCS. In the 12-fermion diagram,
numbers denote generations, quark line colours refer to the SU(3)c group, and roman indices pertain to
isospin (not all combinations are possible, as the amplitude is proportional to εij εkl).
where n = 3 is the number of SM fermion generations and the tilde represents the dual of a field strength
tensor, Fµν ≡ εµναβFαβ/2 (the convention ε0123 = +1 is considered). The above implies that B and
L (and consequently B + L) are violated – the SM satisfies the first Sakharov condition – whereas the
combination B − L is conserved. This is clear from (4.33) since one can construct a conserved current
JB−Lµ ≡ JB
µ − JLµ with ∂µJB−L
µ = 0. Due to cancellations, the gauge symmetry group of the SM is not
anomalously broken and the theory remains renormalizable [150].
The non-Abelian character of the gauge theory translates into a non-trivial topological structure,
which includes an infinite number of ground states to which a topological charge, the Chern-Simmons
winding number NCS ∈ Z [151], is associated. The non-conservation of baryon and lepton numbers may
be visualized as transitions between vacua (leftmost diagram of Fig.4.2), where the meaningful quantity
is the difference between charges, ∆NCS. A selection rule applies:
∆B = ∆L = n∆NCS = 3 ∆NCS . (4.34)
Such transitions may occur via the instanton field, which is associated to a tunneling scenario between
adjacent vacua: ∆B = ∆L = ±3. At present day collision energies, however, these processes are expo-
nentially suppressed by a factor of e−16π2/g22 ∼ 10−160 [152].
An alternative way of switching between vacua is due to the sphaleron field [153, 154] (see rightmost
diagram of Fig. 4.2): if the temperature of the thermal bath to which the SM fields are coupled is
high enough (T > TEW ∼ 100 GeV), (B + L)-violating processes may occur as the barrier Esphaleron
of Fig. 4.2 can be surpassed by thermal fluctuations. Comparing the rate of these (B + L)-violating
reactions, Γsphalerons, with the expansion rate of the Universe, one concludes that they are in thermal
equilibrium for temperatures up to ∼ 1012 GeV [155]. Thus, the SM presents a mechanism for baryon
number violation which in the early Universe is fast and unsuppressed.
57
≠
»
Figure 4.3: Diagram for the expansion of a ‘true vacuum’ bubble. The imbalances produced in CP
violating interactions near the wall will be converted into a baryon asymmetry by sphaleron processes
(still thermal ouside).
The Electroweak Phase Transition
Skipping ahead to the third Sakharov condition, one can make two key points regarding the departure
from thermal equilibrium in a SM baryogenesis scenario. First, that such a departure must occur during
the electroweak phase transition (T ∼ TEW ), and second that a first-order phase transition is required
for sucessful baryogenesis. The claim that baryogenesis within the SM corresponds to a particular case
of electroweak baryogenesis can be understood by considering that for T > TEW the reaction rates
for SM interactions are much larger than H(T ), and so no deviations from thermal equilibrium occur.
Additionally, since the particle content of the model is massless above the EWPT, no CP-violation effects
are observed for such temperatures (CP is conserved due to quark mass degeneracy [26]).
A first-order EWPT proceeds as follows [156]: as the Universe cools, it reaches a critical temperature
Tc at which both the formation and expansion of ‘true vacuum’ bubbles with a nonzero Higgs VEV
take place. Particles in the Universe plasma interact with the bubble walls, as illustrated in Fig. 4.3,
generating an asymmetry in some quantum number, which is then carried back to the unbroken space
(where the Higgs VEV is still zero). In this space, sphalerons will be responsible for converting the
produced asymmetry into a baryon number asymmetry. As the bubble expands, it sweeps regions where
a baryon number imbalance is present and freezes it, since the rate of sphaleron reactions inside the bubble
is highly suppressed5. In the above description, the Higgs VEV plays the role of an order parameter.
Were the EWPT second-order (or not strong enough) and sphaleron processes would still be active after
the transition, washing away any asymmetry that was potentially generated. The order of the EWPT
is highly dependent on the physical Higgs mass, mH . In particular, for mH & 80 GeV [157] (as is the
case) the phase transition is already second-order. This analysis thus rules out the SM ‘as is’ as a viable
candidate in explaining the observed BAU, independently of any CP violation mechanism which may be
present within the model.
5The condition of high suppression of sphaleron effects corresponds to a so-called strong first-order phase transition.
58
Not Enough CP
We now address the remaining Sakharov condition. Setting aside the problem of whether or not the
EWPT is first-order, one can estimate the strength of CP violation in the SM through the measure dCP ,
constructed from rendering the invariant (m2t−m2
c)(m2t−m2
u)(m2c−m2
u)(m2b−m2
s)(m2b−m2
d)(m2s−m2
d)JCP
dimensionless by dividing it by the 12th power of TEW . One obtains dCP ∼ 10−19, which should also
be an estimate for the matter-antimatter asymmetry generated in the interactions between SM particles
and the expanding bubble wall. The fact that this value is orders of magnitude away from η ∼ 10−10
constitutes an argument for why the amount of CP violation present in the SM is insufficient for successful
baryogenesis. Although there has been some debate over the validity and possible enhancement of the
presented estimate [158, 159], it is a general consensus that one has to search for extra, non-standard
sources of CP violation. Nevertheless, since one has established that the phase transition is not first-
-order, this remains a moot point in the context of electroweak baryogenesis.
4.4 Thermal Leptogenesis
The Standard Model appears to be insufficient in explaining the presently observed BAU in light of
the Sakharov conditions. However, one may consider extensions to the SM, such as supersymmetry or
the addition of a second Higgs doublet, which provide additional sources of CP violation and guarantee
that the electroweak phase transition is strongly first-order. In particular, two Higgs doublet models
are found to provide a sufficiently enlarged parameter space to accomodate the desired phase transition
strength [160]. For the case of supersymmetric models, a large enough mass splitting between the two
stops is required for the EWPT to be first-order.
Aside from electroweak baryogenesis and other exotic scenarios [147] such as the Affleck-Dine mech-
anism [161], associated with the rolling of a scalar field (possibly a superpartner of an SM fermion), an
alternative class of baryogenesis scenarios presents itself if one considers the out-of-equilibrium decays of
considerably massive particles in the very early Universe, T TEW . One such particularly interesting
scenario is that of leptogenesis [162], in which the decays of heavy particles – such as the seesaw medi-
ators of Section 2.2.2 – violate lepton number L. Lepton-number asymmetries would then be converted
into baryon-number asymmetries by electroweak sphaleron processes, whereas the required CP violation
would arise from complex couplings in the heavy particle interaction Lagrangian. We will henceforth
focus on the case of thermal leptogenesis, meaning that the heavy particles are produced thermally, fol-
lowing reheating, by scattering processes in the plasma. The additional assumption of a hierarchical mass
spectrum for the added heavy species is taken6, implying that the significant contribution to the BAU
comes from the decays of the lightest of such particles: asymmetries produced by the heavier ones are
subject to erasure through interactions involving the lightest.
Having accounted for possible deviations from thermal equilibrium due to the extreme mass of the new
states and the expansion of the Universe, one turns to the remaining Sakharov conditions. If the decays of
6Such an assumption might conflict with the requirement of low reheating temperatures included in the solution of the so--called SUSY gravitino problem. One way to avoid this incompatibility is to turn to resonant leptogenesis scenarios [163,164],which can accomodate low-scale leptogenesis with small mass splitings between the new particles.
59
Figure 4.4: Effect of electroweak sphalerons on the quantum numbers B and L. For a certain amount of
existing B−L, unaffected by sphalerons, the plasma evolves along one of the thin diagonal lines until the
condition of (4.35) is met.
the new species violate L alone, then B−L is also broken. This is a crucial aspect of baryogenesis models
operating at high scales T TEW , as electroweak sphalerons (which tend to wash out all asymmetries)
cannot erase B−L, thus classified as a non-thermalizing mode (see Fig. 4.4). For a second-order EWPT,
the following relation between lepton and baryon numbers at the electroweak scale is imposed by sphaleron
interactions in equilibrium [165,166]:
YB =12
37YB−L ⇒ YB = −12
25YL = −0.48YL . (4.35)
Next, we discuss the remaining Sakharov condition, namely CP violation.
4.4.1 CPT, Unitarity and CP Asymmetries
In the context of QFT, to the transition between asymptotic states (or sets of states) i and j, one
associates a quantum amplitude M(i → j), to be calculated following the Feynman rules of the theory.
Denoting the CP-conjugates of i and j by i and j, respectively, CPT invariance implies [141]:
CPT invariance: M(i→ j) =M(j → i), (4.36)
whereas the imposition of CP invariance translates simply to:
CP invariance: M(i→ j) =M(i→ j). (4.37)
By joining the above conditions, one obtains a relation valid under T invariance:
T invariance: M(i→ j) =M(j → i). (4.38)
Finally, the unitarity of the scattering matrix in QFT (also known as the S-matrix) implies7:
∑
j
|M(i→ j)|2 =∑
j
|M(j → i)|2 CPT−−−−→∑
j
|M(i→ j)|2 =∑
j
∣∣M(i→ j)∣∣2 , (4.39)
7 One is presently ignoring the effects of quantum statistics. A more general treatment is presented, for instance, inAppendix A.2 of [141], where Pauli blocking and Bose enhancing factors enter the above expression.
60
`
`
Mone-loop
X = X
`
`
Mtree
X
Figure 4.5: Tree-level and one-loop diagrams for the process X → ` ` whose interference generates a CP
asymmetry when compared to the conjugate process. The higher-order diagram is illustrative and may
represent either a wave or vertex correction.
where a sum over all possible states or sets of states j has been considered (such sum includes any particle j
as well as its CP-conjugate j). The above relations allow one to parametrize the CP asymmetries between
different decay modes of a heavy state, X, with mass M . Suppose X = X and X have two distinct decay
modes, namely X → ` ` and X → ` `. One can then write:
|M(X → ` `)|2 = |M(` `→ X)|2 = (1 + ε/2) |M0|2,
|M(X → ` `)|2 = |M(` `→ X)|2 = (1− ε/2) |M0|2,(4.40)
where |M0|2 is the value |M|2 would take in the absence of a CP asymmetry ε. From Eq. (4.40) and
from the fact that the two-body decay rate Γ of a particle obeys8 Γ ∝ |M|2, with the proportionality
constant depending solely on particle masses, one obtains:
ε = 2Γ(X → ` `)− Γ(X → ` `)
Γ(X → ` `) + Γ(X → ` `). (4.41)
The expressions given above can be straightforwardly generalized. An additional consequence from the
unitarity of the S-matrix is that squared absolute values of amplitudes for CP-conjugate processes may
only differ beyond the tree-level computation. In fact, the first-order contribution towards a nonzero CP
asymmetry is found in the interference between the tree-level and the one-loop correction diagrams (see
Fig. 4.5). One has, for coupling constants of order y:
Γ ∝ |Mtree +Mloop +O(y5)|2 = |Mtree|2 +M∗treeMloop +MtreeM∗loop +O(y6). (4.42)
Thus, in the numerator of Eq. (4.41) one must expand decay rates up to the order of the interference,
while for the denominator one can take the approximation Γ ' |Mtree|2. By extracting coupling constants
yi from amplitudes in the following schematic manner:
Mtree ≡ y1Atree , Mloop ≡ y2 y∗3 y4Aloop , (4.43)
8The square modulus of the amplitude is here assumed to be averaged over initial and summed over final degrees offreedom of the interacting particles.
61
one can, in practice, write the above numerator as:
Γ(X → ` `)− Γ(X → ` `) =(M`` ∗
treeM``loop +M``
treeM`` ∗loop
)−(M` ` ∗
tree M` `loop +M` `
treeM` ` ∗loop
)
= 4 Im(y1 y
∗2 y3 y
∗4
)Im(A∗treeAloop
).
(4.44)
This expression clearly shows that, in order to have a nonvanishing CP asymmetry, one must allow
complex coupling constants and Im(A∗treeAloop
)6= 0. Verifying these conditions demands for the presence
of at least twoX-type particles whose masses are such that the particles running along the loop are allowed
to be on-shell.
4.5 Boltzmann Equation(s)
The Boltzmann transport equation (BE) allows one to quantitatively describe the evolution in time
of the (non-equilibrium) phase-space distribution fψ(Eψ, t) of a particle species ψ influenced both by
interactions – which can be responsible for the creation or destruction of such particles – as well as by
the expansion of the Universe. The BE can be summarily written as:
L fψ = C fψ , (4.45)
where L corresponds to the Liouvillian operator [167]:
L fψ ≡∂fψ∂t−H(t)
|~pψ|2Eψ
∂fψ∂Eψ
, (4.46)
which codifies the effects of the expansion of the Universe on the evolution of particle distributions, while
C denotes the collision operator [140]:
C fψ ≡ −∑
ψ,a,...↔ i,j,...
1
2Eψ
∫dΠ′ (2π)4 δ4(pout − pin)
[|M|2dir fψfa . . . − |M|2inv fifj . . .
], (4.47)
which accounts for the effect of particle interactions. In the above expression, one sums over all possible
reactions involving the ψ species. The shorthand dΠ′ = dΠa . . . dΠi dΠj . . . is considered, where one has
made use of the definition:
dΠi ≡gi
(2π)3
d3pi2Ei
, (4.48)
where the 2Ei factors arise from a delta function imposing the energy-momentum relations E2i = |~pi|2 +
m2i . The argument of the delta function of (4.47) contains both the initial and final four-momenta,
defined as pin ≡ pψ + pa + . . . and pout ≡ pi + pj + . . .. The amplitudes |M|2dir and |M|2inv correspond to
the reactions ψ, a, . . . → i, j, . . . and i, j, . . . → ψ, a, . . ., respectively, and are averaged over both initial
and final spin degrees of freedom. The (global) signs with which these amplitudes enter the BE reflect
whether the process is responsible for the creation or destruction of ψ particles. Both a symmetry factor
and a multiplicity factor will have to be included in the above expression to account for identical particles
in the interaction and the possibility of producing/destroying more than one ψ-type particle, respectively.
Quantum effects have been ignored in writing (4.47), and can be included through the substitutions:
fψfa → fψfa (1± fi) (1± fj), fifj → fifj (1± fψ) (1± fa), (4.49)
62
where the ‘+’ and ‘-’ signs correspond to Bose enhancing and Pauli blocking factors, respectively (see
Footnote 7), which we henceforth neglect. To proceed, one considers the evolution of particle densities
ni, obtained from fi through Eq. (4.4). One thus integrates both sides of the BE, obtaining, for the
left-hand side (and using integration by parts):
gψ(2π)3
∫L fψ d
3p = nψ −∫∫
dΩ
∫ | ~pψ|4Eψ
∂fψ∂Eψ
d| ~pψ| = nψ + 3H nψ , (4.50)
while the right-hand side yields:
gψ(2π)3
∫C fψ d
3p = −∑
ψ,...↔...
∫dΠ (2π)4 δ4(pout − pin)
[|M|2dir fψfa . . . − |M|2inv fifj . . .
], (4.51)
where dΠ ≡ dΠψ dΠ. One can further manipulate the BE by noticing that:
ni + 3Hni =1
R3
d
dt
(niR
3)
=S
R3Yi = s Yi . (4.52)
By defining the variable z ≡ M/T , where M represents a relevant mass scale (chosen in the case of
thermal hierarchical leptogenesis to coincide with that of the lightest decaying particle) one sees that:
Yi =dYidz
dz
dt=
dYidz
Md(1/T )
dt=
dYidz
M
(− 1
T 2
)(dH
dt
/dH
dT
)= z H
dYidz
, (4.53)
where Eq. (4.22) for H(t) in the radiation dominated era has been used. The left-hand side of the BE
now reads:
ni + 3Hni = s z HdYidz
. (4.54)
Putting together what we have so far explicitly gives:
sHzdYψdz
=−∑
ψ,...↔...
(#ψ)
∫gψ d
3pψ(2π)32Eψ
ga d3pa
(2π)32Ea, . . .
gi d3pi
(2π)32Ei
gj d3pj
(2π)32Ej. . .
×(2π)4 δ4(pi + pj + . . .− pψ − pa − . . .)1
Πk(#k)!
[|M|2dir fψfa . . . − |M|2inv fifj . . .
],
(4.55)
where (#ψ) counts the number of destroyed ψ particles in the interaction ψ + a+ . . .→ i+ j + . . ., and(Πk(#k)!
)−1is the product of symmetry factors 1/(#k)! for all particles k involved in the reaction, each
with a multiplicity of (#k). The gi factors, present in the definition (4.48), are henceforth absorbed by
the amplitudes, which are no longer averaged but both summed over the degrees of freedom of the initial
and final states. dΠi will then simply read dΠi = d3pi/((2π)32Ei
).
All that remains is to simplify the right-hand side of the BE. We will be interested in doing so for
two particular cases, namely those of two-body decays and 2↔ 2 scatterings.
4.5.1 Two-body Decays and Inverse Decays
Consider the two-body decays of a particle species ψ through the reactions ψ → i+ j and i+ j → ψ,
respectively. We denote a parametrization of the CP asymmetry in amplitudes by:
|M(ψ → i+ j)|2 = α(ε)|M0|2 , |M(i+ j → ψ)|2 = β(ε)|M0|2 , (4.56)
where α(ε) and β(ε) contain the chosen parametrization (such as that of Eq. (4.40)).
63
The assumption of kinetic (but not chemical equilibrium) allows one to write phase-space distributions
in terms of number densities and equilibrium distributions/densities (cf. Eqs. (4.4) and (4.7)) as:
fψ = nψf eqψ
neqψ
=nψ
n′eqψ
f ′eqψ , (4.57)
where the primes denote that both f ′eqψ and n′eq
ψ correspond to a zero equilibrium chemical potential, µeqψ .
The equality is valid as the term eµeqψ /T cancels in the above expression. From now on, primes are dropped
and we work with zero chemical potential equilibrium densities and distributions (f ′eqi = e−Ei/T ).
We now split the BE (4.55) for the decaying particle and considered reactions into the direct (decay)
and indirect (inverse decay) parts. The direct part reads (symmetry factors are denoted by 1/S):
−∫dΠψ dΠi dΠj(2π)4δ4(pψ − pi − pj)
1
S |M(ψ → i+ j)|2 fψ
=− nψneqψ
∫d3pψ(2π)3
e−Eψ/T1
S
∫dΠi dΠj(2π)4δ4(pψ − pi − pj) |M|2dir
=− nψneqψ
∫gψ d
3pψ(2π)3
1
S1
2Eψ
∫dΠi dΠj(2π)4δ4(pψ − pi − pj)
1
gψ|M|2dir
e−Eψ/T
=− nψneqψ
∫gψ d
3pψ(2π)3
Γ(ψ → i+ j) e−Eψ/T ,
(4.58)
where in the last line we recognize the expression for the decay rate of ψ in an unspecified frame of
reference. The factor gψ has been explicitly and exceptionally included, since decay rates are typically
given in terms of an amplitude averaged over initial spins and summed over final ones. This rate can be
related to the decay rate in the rest frame ΓR by (Mψ is the mass of ψ particles):
Γ(ψ → i+ j) =Mψ
EψΓR(ψ → i+ j) . (4.59)
Expression (4.58) then reads:
− nψneqψ
∫gψ d
3pψ(2π)3
Mψ
EψΓR(ψ → i+ j) e−Eψ/T ≡ −nψ
⟨Mψ
EψΓR⟩eq
ψ
. (4.60)
In the last line we have defined what is usually called a thermal average [167]. Since in the rest frame
pψ = (Mψ,~0), one has that ΓR is independent of ~pψ and, therefore, it can be extracted from the thermal
average (we choose not to extract Mψ). On the other hand, Eψ is a function of the momentum, as it
does not necessarily refer to the rest frame. One explicitly has:
⟨Mψ
Eψ
⟩eq
ψ
=1
neqψ
∫gψ d
3pψ(2π)3
Mψ
Eψe−Eψ/T =
gψ(2π)3
∫ (Mψ
Eψ
)e−Eψ/T d3pψ
gψ(2π)3
∫e−Eψ/T d3pψ
. (4.61)
Taking into account the definition of the modified Bessel function given in (4.10) , one sees that:
K1(y) = y
∫ ∞
1
e−yt√t2 − 1 dt =
1
y
∫ ∞
y
e−t t√t2 − y2 dt, (4.62)
and thus the numerator may be written as (z ≡ Eψ/T , x ≡Mψ/T ):
gψ(2π)3
∫ (Mψ
Eψ
)e−Eψ/T d3pψ =
gψT3
2π2
∫ ∞
x
Mψ
Eψz√z2 − x2 e−z dz =
gψT3
2π2x2K1(x). (4.63)
64
In turn, the denominator neqψ is given in the left-hand side of Eq. (4.12). At the end, the thermal average
reduces to:
⟨Mψ
Eψ
⟩eq
ψ
=K1(x)
K2(x)=K1(Mψ/T )
K2(Mψ/T ). (4.64)
In conclusion, the direct part of the BE right-hand side, given in (4.58), simply reads:
−nψK1(Mψ/T )
K2(Mψ/T )α(ε) ΓR0 ≡ −
nψneqψ
α(ε) γD , γD = neqψ ΓR0
K1(Mψ/T )
K2(Mψ/T ), (4.65)
where ΓR0 corresponds to ΓR with |M0|2 instead of |M(ψ → i+j)|2, and γD is the decay reaction density.
For the inverse part of the BE, Eq. (4.58) is modified by changing the global sign, the amplitude and
fψ → fi fj . Since we are under an integral, where the delta function imposes conservation of energy, the
relation f eqi f eq
j = f eqψ is valid and the computations follow as in the direct case. A caveat arises regarding
(4.60), in which an neqψ is absorbed. To keep the calculation unchanged, one multiplies and divides the
expression by neqψ , resulting finally in
+ni njneqi n
eqj
neqψ
K1(Mψ/T )
K2(Mψ/T )β(ε) ΓR0 = +
ni njneqi n
eqj
β(ε) γD , (4.66)
for the inverse part. To summarize, the BE for decays and inverse decays reads:
sHzdYψdz
= −[nψneqψ
α(ε)− ni njneqi n
eqj
β(ε)
]γD . (4.67)
If one is interested in the evolution of Yi, the above is modified to dYi/dz = −(#i) dYψ/dz, where
(#i) = 2 if i = j and unity otherwise. Assuming no CP violation and that ni,j = neqi,j , one can write:
z
Yψ
dYψdz∼ −
(nψneqψ
− 1
)ΓR0H
, (4.68)
which means that the number of ψ particles in a comoving volume is not significantly altered for ΓR0 H
and interactions have thus frozen-out [54], making the rule of thumb presented in Section 4.1.4 sensible.
4.5.2 2↔ 2 Scatterings
Concerning 2 ↔ 2 scatterings ψ + a ↔ i + j, we consider the relation (4.57) once more, and define
the parametrization
|M(ψ + a→ i+ j)|2 = µ(ε)|M0|2 , |M(i+ j → ψ + a)|2 = ν(ε)|M0|2 . (4.69)
The BE (4.55) is taken as a starting point. Considering both the direct (ψ + a → i + j) and inverse
(i+ j → ψ + a) parts of the equation:
− (#ψ)
∫dΠψ dΠa dΠi dΠj(2π)4δ4(pψ + pa − pi − pj)
1
S
[|M|2dir fψfa − |M|2inv fifj
]
=− (#ψ)
∫dΠψ . . . dΠj(2π)4δ4(. . .)
1
S |M|20
[µ(ε)
nψ naneqψ n
eqaf eqψ f
eqa − ν(ε)
ni njneqi n
eqj
f eqi f
eqj
]
=− (#ψ)
[µ(ε)
nψ naneqψ n
eqa− ν(ε)
ni njneqi n
eqj
] ∫dΠψdΠa f
eqψ f
eqa
∫dΠidΠj(2π)4δ4(. . .)
1
S |M|20
=− (#ψ)
[. . .
] ∫d3pψ(2π)3
d3pa(2π)3
f eqψ f
eqa
F
EψEa
1
4F
∫dΠidΠj(2π)4δ4(. . .)
1
S |M|20
,
(4.70)
65
where one has used (4.57) and the relation f eqi f eq
j = f eqψ f eq
a . Inside the curly brackets one recognizes
the usual formula for a 2 ↔ 2 scattering cross section σ. We have also introduced the quantity F ≡√(pψ · pa)2 −M2
ψm2a and defined the Møller velocity in terms of particle velocities ~vi = ~pi /Ei [168]:
vMøl ≡F
EψEa=[|~vψ − ~va |2 − |~vψ × ~va |2
]1/2, (4.71)
which reduces to the relative velocity between particles when these are parallel. Its importance was first
recognized by Gondolo and Gelmini [167] for the computation of cosmological relic densities. One can
now rewrite Eq. (4.70) in the form:
−(#ψ)
[µ(ε)
nψ naneqψ n
eqa− ν(ε)
ni njneqi n
eqj
]neqψ n
eqa
1
neqψ n
eqa
∫d3pψ(2π)3
d3pa(2π)3
(σ vMøl
)f eqψ f
eqa
︸ ︷︷ ︸≡〈σvMøl〉eq
ψ,a
, (4.72)
where the thermal average 〈σvMøl〉eqψ,a has been set. Defining the reaction density γ2 ≡ neq
ψ neqa 〈σvMøl〉eq
ψ,a,
one can write the BE for the considered scatterings in a manner formally similar to that of Eq. (4.73):
sHzdYψdz
= −(#ψ)
[nψ naneqψ n
eqaµ(ε)− ni nj
neqi n
eqj
ν(ε)
]γ2 . (4.73)
To conclude, we now turn to the determination of a clearer formula for the scattering reaction density
γ2:
γ2 =
∫dp3ψ
(2π)3
dp3a
(2π)3σ(s)
F (s)
EψEaf eqψ f
eqa
=
∫dp3ψ
(2π)3
dp3a
(2π)3
[ ∫d4P δ(P0 − Eψ − Ea) δ3(~P − ~pψ − ~pa)
]F (s)σ(s)
EψEae−(Eψ+Ea)/T
=
∫d4P e−P0/T
∫dp3ψ
(2π)3Eψ
dp3a
(2π)3Eaδ4(P − pψ − pa)F (s)σ(s)
︸ ︷︷ ︸explicitly Lorentz invariant
,
(4.74)
where an integration term in P = (P 0, ~P ) has been introduced (the quantity between square brackets is
unity) and the dependency of F and σ on the square of the center-of-mass energy s = (pψ + pa)2 (not
to be mistaken with the entropy density) has been specified. Since the singled out quantity is Lorentz
invariant, we can compute it in any frame (but not the whole integral). In particular, one chooses the
frame in which pψ + pa = (√s,~0). This quantity reads:
∫dp3ψ
(2π)3Eψ
dp3a
(2π)3Eaδ(√s− Eψ − Ea) δ3(~pψ + ~pa)F (s)σ(s)
=4π
(2π)6
∫d|~pψ| |~pψ|
δ(|~pψ| − |~pψ|CM)
Eψ + EaF (s)σ(s) =
4π
(2π)6|~pψ|CM
F (s)σ(s)√s
.
(4.75)
In the center-of-mass frame, |~pψ|CM is given by:
|~pψ|CM =
√(s−M2
ψ −m2a
)2 − 4M2ψm
2a
2√s
≡
√λ(s,M2
ψ,m2a
)
2√s
=
√s
2
√λ(1,M2
ψ/s,m2a/s)
=F (s)√s, (4.76)
where one has defined the function λ(x, y, z) as in [169]. One can now write the reaction density as the
simple integral:
γ2 =
∫d4Pe−P0/T
1
4(2π)52 s λ
(1,M2
ψ/s,m2a/s)σ(s)
︸ ︷︷ ︸≡ σ(s)
=1
8π
∫d4P
(2π)4e−P0/T σ(s) , (4.77)
66
where one has also defined the so-called reduced (or dimensionless) cross section σ(s). Carrying out the
integration gives (P 2 = s):
γ2 =1
4(2π)5
∫4π dP0 d|~P ||~P |2 e−P0/T σ(s) =
1
4(2π)4
∫dP0
∫ds |~P | e−P0/T σ(s)
=1
64π4
∫ ∞
smin
ds σ(s)
[ ∫ ∞√s
dP0
√P 2
0 − s e−P0/T
],
(4.78)
where smin = max(Mψ + ma
)2,(mi + mj
)2. The term in square brackets can be evaluated in terms
of a modified Bessel function, yielding:
∫ ∞√s
dP0
√P 2
0 − s e−P0/T = T√sK1
(√s
T
). (4.79)
One then obtains the expression for the reaction density:
γ2 =T
64π4
∫ ∞
smin
ds√s σ(s) K1
(√s
T
). (4.80)
In this equation, the reduced cross section can also be given a clearer expression in terms of the Mandel-
stam variable t ≡ (pψ − pi)2. Since σ(s) is a Lorentz invariant quantity, it too can be computed in the
center-of-mass frame, namely
σ = 2 s λ(1,M2
ψ/s,m2a/s)σ(s) ≡ 2 s λ(s)σ(s)
= 2 s λ(s)
1
4F (s)
∫dp3i
(2π)3 2Ei
dp3j
(2π)3 2Ej(2π)4δ4(pψ + pa − pi − pj)
1
S |M(s, t)|20
= 2 s λ(s)1
2s√λ(s)
1
4(2π)2
∫dp3i dp
3j
EiEjδ3( . . . )δ
(√s− Ei − Ej
) 1
S |M(s, t)|20
=
√λ(s)
16π2
∫dp3i
EiEjδ(√s− Ei − Ej
) 1
S |M(s, t)|20.
(4.81)
Defining angular coordinates with respect to a particular z axis, chosen to coincide with ~pi, we get:
d3pi = sin θ dθ dϕ d|~pi| |~pi|2 = d(cos θ) dϕ dEiEi |~pi|, d(cos θ) =1
2|~pψ||~pi|dt , (4.82)
which allows us to write:
σ =
√λ(s)
16π2
∫dp3i
EiEjδ(√s− Ei − Ej
) 1
S |M(s, t)|20 (4.83)
=
√λ(s)
16π2
∫d(cos θ) dϕ dEiEi |~pi|
EiEjδ(√s− Ei − Ej
) 1
S |M(s, t)|20 (4.84)
=
√λ(s)
16π2
∫dϕ
∫dEi
∫d(cos θ)
|~pi|Ej
δ
(√s− Ei −
√m2j + |~pj |2
)1
S |M(s, t)|20 . (4.85)
After expanding the delta function and performing some straightforward manipulations, one finally arrives
at:
σ(s) =1
8πs
∫dt
1
S |M(s, t)|20 . (4.86)
The reaction dentity γ2 of Eq. (4.80), to be included in (4.73), is then given as an integral of this reduced
cross section, for which the amplitude is summed over both initial and final state degrees of freedom, as
previously mentioned.
67
In the context of leptogenesis, Boltzmann equations can be used to quantify the evolution of an
asymmetry in lepton number, generated via the out-of-equilibrium decay of heavy seesaw mediators. In
general, the result of computing the BE solution can be encoded into a single parameter 0 < η < 1, which
describes the efficiency in producing a net asymmery. In view of this, one usually writes:
YB = C YL = C ε η YX∣∣TM , (4.87)
where ε is the CP asymmetry in leptonic decays and C is the sphaleron conversion factor between baryon
and lepton numbers, which, for a second-order phase transition, is given in Eq. (4.35). The equilibrium
number density divided by the entropy density at high temperatures is given by:
YX∣∣TM =
3
4ζ(3)
gXπ2
(2π2
45g∗S
)−1
=135 ζ(3) gX
8π4 g∗S' 1.95× 10−3 gX . (4.88)
We conclude this chapter by summarizing the main approximations which have been considered so
far in the derivations presented above. These are:
• Absence of entropy production [167].
• Taking the available degrees of freedom to be independent of temperature in the early radiation
dominated era, g∗(T ) ' g∗S(T ) ' const. = 106.75.
• Neglecting quantum statistics factors in general, namely Pauli blocking and Bose enhancing
corrections to the collision operator.
• Working in zero temperature field theory, as opposed to considering finite temperature correc-
tions, belonging to the domain of thermal field theory. These effects have been considered by
Guidice et al. [169] for type I seesaw leptogenesis, resulting in corrections of order ∼ 10%, at most.
We also ignore the effect of Sommerfeld corrections to cross sections (see, for instance, [170]).
• The classical kinetic theory assumption of a weakly interacting, dilute gas of particles, which allows
one to consider only binary collisions.
• The assumption of molecular chaos (stosszahlansatz), which corresponds to saying that the mo-
menta of particles in a volume element are uncorrelated and thus simple products of phase-space
distribution functions arise in the BE [140].
• Insisting on the kinetic equlibrium parametrization of (4.7), with Ti = T and all the uncertainty
in the distributions contained in the chemical potential [142]. This is taken by Luty [171] to be
viewed as an ansatz which dictates the form of relevant out-of-equilibrium effects. Computations
where this assumption has been relaxed have been carried out for type I seesaw leptogenesis [172].
• Taking the quantum amplitude, both for scatterings or decays, to be independent of thermal
motion relative to the plasma (see appendix A of [169]).
Some of the above approximations are commonly employed to render the problem treatable, without
significantly hindering the quality of results.
Having obtained the general form of the kinetic equations which govern the number densities in a
thermal plasma, we now apply them to the case where the decaying particles generating the lepton
asymmetries are the heavy scalar triplets of the type II seesaw mechanism discussed in Section 2.2.2.
68
Type II Seesaw Leptogenesis 5In this chapter we will apply the formalism developed in Chapter 4 for type II seesaw leptogenesis,
namely that of CP asymmetries, lepton to baryon number conversion, and Boltzmann equations, including
the computation of decay rates, reduced cross sections and reaction densities. We follow the most general
approach where flavour effects are considered, taking into account processes which change the number
densities in each flavour. After obtaining the full set of flavoured Boltzmann equations, these are solved
numerically in the context of the A4 model with spontaneous CP violation discussed in Chapter 3.
5.1 Flavoured CP Asymmetries from Triplet Decays
We start by parametrizing the branching ratios for each scalar triplet decay channel beyond tree-
level, i.e. taking into account the CP asymmetries arising from the interference between tree-level and
higher-order diagrams, in a manner consistent with CPT invariance and unitarity (cf. Section 4.4.1).
Namely,
|M(∆i → `α `β)|2 = |M(`α `β → ∆∗i )|2 =(Bαβi,L − εαβi /2
)|M0|2,
|M(∆∗i → `α `β)|2 = |M(`α `β → ∆i)|2 =(Bαβi,L + εαβi /2
)|M0|2,
|M(∆i → φφ)|2 = |M(φ∗ φ∗ → ∆∗i )|2 =(Bi,H + εi/2
)|M0|2,
|M(∆∗i → φ∗ φ∗)|2 = |M(φφ→ ∆i)|2 =(Bi,H − εi/2
)|M0|2.
(5.1)
Here, (4.40) has been generalized to include the tree-level branching ratios Bαβi,L and Bi,H , as the scalar
triplets possess both a lepton doublet (for clarity, one omits the SU(2) L subscript) and a Higgs doublet
decay mode1. The CP asymmetries and branching ratios defined above obey:
∑
α≥ β
Bαβi,L ≡ Bi,L ,∑
α≥ β
εαβi ≡ εi , Bi,L +Bi,H = 1 . (5.2)
As a function of decay rates (recall (4.41)), the asymmetries are given by:
εαβi = 2Γ(∆∗i → `α `β)− Γ(∆i → `α `β)
Γ∆i + Γ∆∗i
=Γ(∆∗i → `α `β)− Γ(∆i → `α `β)
Γ∆i
, (5.3)
1 Doublet components are indistinguishable above the electroweak phase transition and one can then regard a lepton orHiggs doublet as a single species with g` = g` = 2 and gφ = gφ∗ = 2. The same applies to triplets and triplet components,and g∆i = g∆∗i
= 3.
69
∆i
`β
`α
∆i
φ
φ
Tree-Level
∆i ∆j
φ
φ
`α
`β
∆i ∆j
`α
`β
`µ
`ν
One-Loop
Figure 5.1: Tree-level diagrams for the decays of type II seesaw scalar triplets and one-loop diagrams
contributing to the decay process ∆i → `α `β . The one-loop diagrams, which correspond to the rightmost
diagram of Fig. 4.5, represent a wave-function correction to the tree-level amplitude (unlike the type I
case, where a vertex correction is also present).
where Γ∆iand Γ∆∗i
denote the total decay rates of the triplet and its conjugate, respectively, which
are equal as a consequence of CPT invariance. The interfering diagrams which are relevant for the
computation of the CP asymmetry are presented in Fig. 5.1.
Resorting to the Feynman rules presented in Fig. 2.4, one can write the amplitudes for the diagrams of
Fig. 5.1 and their CP conjugates. Since all triplet components possess the same dynamics at high-energy,
we consider amplitudes and decay rates computed for a single component, chosen for simplicity to be ∆0i
(this clearly has no effect on the asymmetry of (5.3)). Amplitudes are denoted by the superscripts [+2]
and [−2], according to the lepton number L produced in each reaction. Amplitudes for the Higgs channel
are obtained directly from Fig. 2.4. Above the EWPT, fermions are taken to be massless, while Higgs
doublets can be given a nonvanishing thermal mass mφ. For the tree-level amplitudes one has:
M[+2]tree = −2iY∆i∗
αβ uα PR vβ ,
M[−2]tree = −2iY∆i
αβ uα PL vβ .(5.4)
The one-loop amplitudes Mloop are given by the sum of contributions from the two wave-correction
diagrams of Fig. 5.1 (a topological symmetry factor of 1/(1 + δµν) is introduced for the loop):
M[+2]loop = − i
4π2
∑
j 6=i
uα PR vβM2i −M2
j
[µ∗i µj Y
∆j∗αδ R+ I+ Y
∆j∗αδ Y∆j
µν Y∆i∗µν
R′ + I ′1 + δµν
],
M[−2]loop = − i
4π2
∑
j 6=i
uα PL vβM2i −M2
j
[µi µ
∗j Y
∆j
αδ R+ I+ Y∆j
αδ Y∆j∗µν Y∆i
µν
R′ + I ′1 + δµν
],
(5.5)
where R + I and R′ + I ′ contain both a real and an imaginary part (hence the designations R and
I):
R+ I = ∆ε − ln(M2i /µ
2)
+ 2 ln 2 + iπr − ln(1− r2)− r ln
(1 + r
1− r
),
R′ + I ′ = M2i
(∆ε − ln
(M2i /µ
2)
+ 2 + iπ),
(5.6)
where r =√
1− 4m2φ /M
2i and ∆ε ≡ 2/ε− γ + ln(4π) contains the one-loop divergence (γ is the Euler-
-Mascheroni constant) which has been isolated through dimensional regularization (4D ⇔ ε → 0, and µ
represents the typical dimensionful auxiliary parameter).
70
The rate of a two-body decay is explicitly given in terms of the corresponding amplitude by:
Γ =1
S|~q|
8πM2i
|M|2 , (5.7)
where |~q| is the solution to Mi =√m2
1 + |~q|2 +√m2
2 + |~q|2 (mi denote masses of the decay products).
For leptons in the final state, |~q| = Mi/2, while for the Higgs channel |~q| =(Mi/2
)r. In the above
equation, a sum over final spin states,∑sα,sβ
|uα PL,R vβ |2 = 2 pα · pβ = M2i , is implicit in |M|2. One
therefore has, for the total tree-level decay rate:
Γ∆i=
1
16πMi
[ ∑
α≥ β
1
1 + δαβ|M(∆i → `α `β)|2 +
1
2|M(∆i → φφ)|2 r
]
=1
16πMi
[1
2
∑
α,β
(4|Y∆i
αβ |2M2i
)+
1
2
(4|µi|2
)r
]mφMi−−−−−−→ Mi
8π
(Tr[Y∆i
†Y∆i
]+ |λi|2
),
(5.8)
where λi ≡ µi/Mi, and one has taken the limit mφ Mi ⇒ r → 1. At this point, we can also give
expressions for the tree-level branching ratios:
Bαβi,L =2
1 + δαβ
∣∣Y∆i
αβ
∣∣2
Tr[Y∆i
†Y∆i
]+ |λi|2
, Bi,H =|λi|2
Tr[Y∆i
†Y∆i
]+ |λi|2
. (5.9)
By expanding the (beyond tree-level) decay rate as in (4.42), one then obtains the expression for the CP
asymmetry:
εαβi =1
1 + δαβ
1
16πMi
∑
µ,ν
8π
Mi
2 Re[M[+2]∗
tree M[+2]loop −M
[−2]∗tree M[−2]
loop
]
Tr[Y∆i
†Y∆i
]+ |λi|2
. (5.10)
Inserting now the above amplitudes into this expression and considering Mj Mi one finally arrives at:
εαβi = − 2
1 + δαβ
1
2π
∑
j 6=i
Mi
Mj
Im(λ∗i λjY
∆i
αβY∆j∗αβ + (Mi/Mj) Tr
[Y∆i
†Y∆j
]Y∆i
αβY∆j∗αβ
)
Tr[Y∆i
†Y∆i
]+ |λi|2
, (5.11)
which is split into two contributions, the second of which clearly reduces to zero in the unflavoured case,
i.e. upon summing over the flavour indices, α ≥ β. Although we could write (5.11) as a function of the
mass matrices Mν , as is done for instance in Ref. [133], we will not do so since the couplings are defined
at high scales. The connection between the low-energy Mν and the high-energy Y∆ would require a
proper treatment relying on renormalization-group effects.
5.2 Boltzmann Equations for Type II Seesaw
In order to track the evolution of a lepton asymmetry generated by the out-of-equilibrium decays of
the lightest Higgs triplet, we consider the network of Boltzmann equations which account for the relevant
out-of-equilibrium reactions, namely: decays, inverse decays, ∆L = 2 s-channel and t-channel scatterings,
and gauge scatterings, which will tend to keep triplets close to thermal equilibrium (absent in the type I
thermal leptogenesis scenario). The diagrams for these processes are depicted in Fig. 5.2.
In the following, we will be interested in the BEs which govern the evolution of the number densities
(normalized to the entropy density) of both the sum, ΣY∆i ≡ Y∆i + Y∆∗i, and the difference, δY∆i ≡
71
`β
`α
Decays and Inverse Decays
∆i ∆i
φ
φ
`α
∆i
φ
φ`β
s-Channel Scatterings
Gauge Scatterings
Fermions and Higgs
∆i
∆i
f, φ
f, φ
Ai, B
`α
∆i
φ
φ`β
t-Channel Scatterings
t-channel
∆i
∆i
∆i
Aa, B
Ab, B
u-channel
∆i
∆i
∆i
Aa, B
Ab, B
cubic
Aa, B
∆i
∆i
quartic
Ab, B
Ac, B
∆i
∆i
Aa, B
Ab, B
Figure 5.2: Scalar triplet interactions relevant to the BE out-of-equilibrium analysis, where one considers
the diagrams presented for decays, inverse decays and s- and t-channel scatterings and their charge con-
jugates (reverse all arrows), as well as gauge scattering reactions (f denotes any SM fermion). Reactions
can proceed in both the direct and inverse direction.
Y∆i− Y∆∗i
, between densities of triplets and their conjugates, and the number densities of the three
differences B/3 − Lα, denoted by YBLα. We choose to work with these three differences between (a
third of) baryon number and each lepton flavour number, as they correspond to non-thermalizing modes,
meaning they are unaffected by rapid sphaleron processes. The total B − L can be obtained from the
sum of these quantities and is directly related to the density of baryon number through (4.35). The
evolution of the YBLα will then only be influenced by the out-of-equilibrium reactions involving heavy
triplets, which break B− L. The BE for YBLα is obtained from that of δY`α ≡ Y`α − Y`α by [173]:
sHzd δY`αdz
= f(δY`α , δYφ
)⇒ sHz
dYBLα
dz= −f
((CL)αβ YBLβ , (CH)β YBLβ
), (5.12)
meaning that, aside from a global sign change, one introduces a matrix CL and a vector CH which
encode the effects of equilibrium interactions in the plasma, relating each YBLα with all three δY`α and
with δYφ ≡ Yφ − Yφ∗ , respectively. The entries of CL and CH will then depend on interactions which
are fast (Γ > H), such as gauge, heavy fermion Yukawa, and sphaleron interactions – collectively known
as spectator processes – and thus impose relations between the various chemical potentials. Solving the
linear system which contains these relations for different temperature regimes yields [174]:
1012 GeV . T . 1013 GeV : CL =
(−1 0 00 −1 0
−1/16 −1/16 −3/4
), CH = − 1
8
(3 3 4
),
1011 GeV . T . 1012 GeV : CL = 1460
(−196 34 24
34 −196 249 9 −156
), CH = − 1
115
(41 41 56
),
108 GeV . T . 1011 GeV : CL = 11074
(−906 120 120
75 −688 2875 28 −688
), CH = − 1
179
(37 52 52
),
T . 108 GeV : CL = 2711
(−221 16 16
16 −221 1616 16 −221
), CH = − 16
79
(1 1 1
).
(5.13)
The entries may vary slightly depending on the quark Yukawa interactions which are taken to be in
equilibrium [175]. Even though in some regimes some lepton flavours are indistinguishable, matrices are
72
kept 3 × 3 by convenience, as is done in [173]. Above T ∼ 1013 GeV, one takes lepton flavours to be
indistinguishable, since all lepton Yukawa interactions are out-of-equilibrium.
As seen in Section 4.5, the BE for a particle species ψ affected by the reaction ψ(+a)→ i+ j is (cf.
(4.67) and (4.73)):
sHzdYψdz
= −(#ψ)
[nψ(na)
neqψ (na
a)α(ε) +
ni njneqi n
eqj
β(ε)
]γ . (5.14)
If more than one interaction comes into play, the right-hand side is augmented by other terms of the same
kind. In order to obtain the BE for both the triplet sum ΣY∆iand difference δY∆i
, one first considers
the equations for Y∆iand Y∆∗i
individually. We work with an equation for the whole triplet population
and not a specific component, which allows a straightforward inclusion of coannihilations in the BE [176].
Expressions for α(ε) and β(ε) are to be extracted from the parametrization of (5.1). An exact result for
ΣY∆iis given, considering only triplet decays (and inverse decays), by:
sHzdΣY∆i
dz= −
ΣY∆i
Y eq∆i
−∑
α≥ β
[Y`αY`β
Y eq`
2
(Bαβi,L + εαβi /2
)+Y`αY`β
Y eq`
2
(Bαβi,L − εαβi /2
)]
−[Y 2φ
Y eqφ
2
(Bi,H − εi/2
)+
Y 2φ∗
Y eqφ
2
(Bi,H + εi/2
)]γD ,
(5.15)
where γD is given in (4.65) (with ΓR0 = Γ∆i). In order to write the BEs in a condensed form, some
useful approximations are considered. In particular, the expansion Yi∗Yj∗ ' Yi Yj − Yi δYj − Yj δYi, for
particle species i and j, presents itself as very useful. Additional approximations include neglecting terms
of the type δY ε, making Y ε ' Y eq ε and, in some cases, taking YiYj + Yi∗Yj∗ ' Y eqi Y eq
j . One also has
ΣY eq∆i
= 2Y eq∆i
and defines γΣD ≡ 2γD, thus arriving at the partial result (no gauge scatterings yet):
sHzdΣY∆i
dz= −
(ΣY∆i
Y eq∆i
− 2
)γD = −
(ΣY∆i
ΣY eq∆i
− 1
)γΣD . (5.16)
Proceeding in a similar manner for gauge triplet interactions as well as for the remaining quantities of
interest, namely δY∆i and YBLα, the full system of Boltzmann equations finally reads:
sHzdΣY∆i
dz= −
(ΣY∆i
ΣY eq∆i
− 1
)γΣD − 2
[(ΣY∆i
ΣY eq∆i
)2
− 1
]γA , (5.17)
sHzd δY∆i
dz= −
[δY∆i
ΣY eq∆i
+1
2
∑
α≥ β
Bαβi,L(CL)αµ YBLµ + (CL)βν YBLν
Y eq`
−Bi,H(CH)µ YBLµ
Y eqφ
]γΣD , (5.18)
sHzdYBLα
dz= −
∑
β
(1 + δαβ)γD
[εαβi
(ΣY∆i
ΣY eq∆i
+ 1− 2Bi,H
)
− 2Bαβi,L
(εi +
δY∆i
ΣY eq∆i
+1
2
(CL)αµ YBLµ + (CL)βν YBLν
Y eq`
)](5.19)
− 2[(1 + δαβ)(γ′s)αβ + (γt)αβ
][1
2
(CL)αµ YBLµ + (CL)βν YBLν
Y eq`
+(CH)µ YBLµ
Y eqφ
],
where a sum over µ and ν (but not over α) is implied, and equilibrium distributions are given by (nγ is
found in (4.16)):
neq∆i
=g∆i
2π2TM2
i K2
(Mi
T
), neq
` =3
4ζ(3)
g`π2
T 3, neqφ = ζ(3)
gφπ2
T 3. (5.20)
73
Here, the degrees of freedom gi are those indicated in Footnote 1 of the present chapter. Notice that
equilibrium distributions are the same for particles and antiparticles. In the above, γA and γt denote the
reaction densities for gauge and t-channel scatterings, while γ′s is defined as:
(γ′s)αβ = (γs)αβ −Bαβi,LBi,H γD , (5.21)
with γs denoting the s-channel reaction density. In fact, when one includes the BE right-hand side term for
the evolution of YBL corresponding to the s-channel process, one must use a so-called subtracted s-channel
reaction density γsubs , instead of the full γs since the latter includes a resonance – real intermediate state
(RIS) – which has already been taken into account by decays and inverse decays. The subtraction is done
in the narrow width approximation (see, for instance, [169]) and amounts to having:
(γsubs )αβ = (γs)αβ −
(Bαβi,L ±
εαβi2
)(Bi,H ±
εi2
)γD = (γ′s)αβ ∓
[εi2Bαβi,L +
εαβi2Bi,H
]γD +O(ε2) ,
(5.22)
where the ± corresponds to reactions which create/destroy lepton number, respectively. The term orig-
inating from the content of the square brackets has been moved, in the final BE expression (5.19), next
to the decay term, while γ′s appears alongside γt. The inclusion of the (subtracted) s-channel scattering
term is crucial2 for the consistency of the BEs: without it, we are limited to processes which create an
asymmetry even in thermal equilibrium [141] (cf. (4.40) and discussion at the end of Section 4.2). The
computation of the scattering reaction densities γA,s,t is addressed in the following section.
5.3 Scattering Reaction Densities
The general expression for scattering reaction densities as a function of the respective reduced cross
sections σ is given in Eq. (4.80), where – for massless fermions (above the EWPT) and under the
assumption of a negligible Higgs thermal mass – smin = 4M2i for gauge scatterings (two triplets in
the initial state), and smin = 0 for s- and t-channel processes. The reduced cross sections which enter the
above expressions are obtained from the amplitudes through Eq. (4.86). For the case of gauge scatterings3,
the amplitudes themselves are computed by considering the interactions of triplets with gauge bosons,
which can be read off from the expansion of the covariant derivative, given by (2.5) (with the generators
of (2.58)), in the seesaw Lagrangian (2.64).
A non-trivial step arises during the computation of diagrams with massless gauge bosons in the
final state (SSB is yet to happen), for which one must forcibly consider the following expression for the
polarization sums [178] (with boson four momenta p3µ and p4µ):
Pµν(pi) =∑
σ
εµ(pi, σ) ε∗ν(pi, σ) = −gµν +piµ ην + piν ηµ
pi · η− η2
piµ piν(pi · η)2
, (5.23)
where εµ(pi) are polarization four-vectors, the σ denote possible polarizations, and ηµ is a four-vector
which verifies η ·ε(pi, σ) = 0 and η ·pi 6= 0. The valid choice η = p3 +p4 is considered in our computation.
The integration limits for the Mandelstam variable t arising in (4.86) are, for a generic scattering i+ j →2Notice that what is typically ignored in the literature – due to its generally small magnitude – is γ′s, and not γsub
s .3A helpful reference concerning gauge interaction amplitudes for charged scalars is Ref. [177].
74
k + l, given in terms of masses and s (the Mandelstam variable, not the entropy density) by:
m2i +m2
k −1
2s
(s+m2
i −m2j
)(s+m2
k −m2l
)± 1
2s
√λ(s,m2
i ,m2j
)λ(s,m2
k,m2l
), (5.24)
where the definition of λ(x, y, z) has been given in Eq. (4.76).
Integrating over t results in the following reduced cross sections for the gauge scatterings (the sum of
which is denoted by σA)4:
σ(∆i∆∗i → ff , φφ∗) =
1
16π
(50 g4
2 + 41 g4Y
)r3i ,
σ(∆i∆∗i → AaAb) =
g42
π
[(5 + 34
M2i
s
)ri − 24
(M2i
s
)2(s
M2i
− 1
)ln
(1 + ri1− ri
)], (5.25)
σ(∆i∆∗i → AaB,BB) =
3
2π
(4g2
2g2Y + g4
Y
)[(1 + 4
M2i
s
)ri − 4
(M2i
s
)2(s
M2i
− 2
)ln
(1 + ri1− ri
)],
where ri ≡√
1− 4M2i /s. Similarly, for s- and t-channel scatterings, one obtains (notice that this cross
section corresponds to the full γs):
σαβs (s) =3|µi|2
∣∣Y ∆i
αβ
∣∣2
πM2i
1
1 + δαβ
[x
(x− 1)2 +(Γ∆i
/Mi
)2], (5.26)
σαβt (s) =6|µi|2
∣∣Y ∆i
αβ
∣∣2
πM2i
[− 1
1 + x+
ln(x+ 1)
x
], (5.27)
where xi ≡ s/M2i and the notation σs,t is self-evident. In the unflavoured limit, i.e. after a sum in flavour
indices (∑α≥β for σs and
∑α,β for σt), these reaction densities agree with those5 of Ref. [179] in the
absence of four-point ∆L = 2 interactions.
5.4 Leptogenesis in an A4 Model
We now turn to a particular framework – the A4-based model presented in Chapter 3 – in the context
of which leptogenesis is, in principle, possible through the out-of-equilibrium decays of type II seesaw
mediators. It is here assumed that the masses of the triplets are hierarchical M2 M1, meaning that
any asymmetry generated due to the decoupling of the heaviest triplet is erased by interactions involving
the lightest, which is in thermal equilibrium at that time. We will thus be interested in tracking the
evolution of Y∆1 . The CP asymmetries for both triplets read:
εαβ1 =1
3π
z1z2 |u1|2M21
4|u1|4M21 + 3z2
1v4
sinβ
︸ ︷︷ ︸≡ε01
−1 0 0
0 0 1
0 1 0
, (5.28)
εαβ2 =1
3π
z1z2 |u2|2M22
4|u2|4M22 + 2z2
2v4
[1 +
2
9
z22 v
4(ε1 + ε2)
2M22 |u2|4 + z2
2v4
]sinβ
︸ ︷︷ ︸≡ε02
−1 0 0
0 0 1
0 1 0
, (5.29)
where (5.11) (M2 M1) was used in deriving the expression for εαβ1 , and εαβ2 is obtained from the same
expression with a global minus sign since now Mi Mj (cf. squared-mass difference denominators in
4These results are in agreement with those of Ref. [179], with gauge reduced cross sections behaving as σ(∆i∆∗i →
ff , φφ∗) ∼ r3 and σ(∆i∆∗i → AaAb, AaB,BB) ∼ r in the high-energy limit (not low-energy, as is mistakingly remarked).
5In both eqs. (17a) and (17b) of Ref. [179] every “MT ” arising within square brackets should be replaced by an “mT ”.
75
Figure 5.3: Contours of the (magnitude of the) maximum CP asymmetries in the decays of hierarchical
scalar triplets ∆1,2. For temperatures above T ∼ 1013 GeV, lepton flavours are no longer distinguishable
and one enters the unflavoured regime.
(5.5)) and the second term in Eq. (5.11) vanishes. Notice that the total CP asymmetries vanish in this
model (purely flavoured leptogenesis) and that diagrams with leptons running in the loop produce no
contribution to the asymmetries. The correcting factor arising in square brackets for the ε2 asymmetry
contains perturbations (not to be confused with total CP asymmetries) which enter the expression solely
through the trace in the denominator of (5.11). Maximization of these asymmetries with respect to the
|ui| (the zi are fixed as functions of neutrino masses and perturbations by (3.49)) gives:
ε01,max =
z2
12√
3πv2M1 sinβ , for |u1|2 =
√3
2
z1v2
M1, (5.30)
ε02,max '
z1
12√
2πv2
[1 +
1
9(ε1 + ε2)
]M2 sinβ , for |u2|2 '
1√2
[1− 1
9(ε1 + ε2)
]z2v
2
M2. (5.31)
Expressing this in terms of the angle β and squared mass differences yields:
ε01,max '
√∆m2
31
12√
6πv2
[1− 1
3(ε1 + ε2)
]M1 sinβ , (5.32)
ε02,max ' −
√∆m2
31
48πv2
[1− 1
9(ε1 + ε2)
]M2 tanβ . (5.33)
The magnitude of the maximum CP asymmetry ε0i,max is shown as a contour plot in the (β,Mi)-plane in
Fig. 5.3.
To solve the BE network for the case at hand, a Fortran-based programme was constructed. In its
context, all quantities are made dimensionless and reaction densities are obtained from the numerical
integration of reduced cross sections. To numerically integrate the BEs – and thus arrive at entropy-
-normalized number densities YX – we resort to the Runge-Kutta-Fehlberg method.
76
90 95 100 105 110 115
10-8
10-9
10-10
10-11
10-12
10-13
10-14
245 250 255 260 265 270
10-8
10-9
10-10
10-11
10-12
10-13
10-14
M1 = 1´1012
GeV
M1 = 5´1012
GeV
YB
Β HºLFigure 5.4: Scatter plot of the baryon asymmetry generated in randomly-chosen perturbed versions of the
model, for M1 = 1012 GeV (black) and M1 = 5× 1012 GeV (cyan), which are characterized by different
values of β. The green horizontal bar spans a 30% deviation for YB from the central value of Eq. (1.6).
A set of working assumptions is considered, namely M2 = 10M1 and |u1| = u2 (equality of VEVs),
which automatically imply λ2 = 10 |λ1| through (2.72). The mass of the lightest triplet, M1, is chosen
such that the (maximum) CP asymmetry is large but still in the flavoured regime (see Fig. 5.3). The
value of |λ1| (or, equivalently, that of the VEV u1) is in turn fixed by the best-fit value of the solar
squared-mass difference ∆m221, as:
|λ1|2 =M2
1 ∆m221
4v4(eig(2)− eig(1)
) , (5.34)
where eig(i) are the numerically-determined (ordered) eigenvalues of the (perturbed) matrix (Y∆1 +
Y∆2)† (Y∆1 + Y∆2), which are proportional to neutrino squared masses.
Both z1 and z2 are randomly generated in the range [10−3, 2], while the perturbations ε1 and ε2
are varied in [−0.2, 0.2]. In Fig. 5.4 we show a scatter plot of YB for two different values of M1, for
randomly-chosen perturbed versions of the model. One expects that if the choice of parameters – which
the programme selects randomly – maximizes the CP asymmetry, more points should be accessible above
the experimental value of YB.
We now restrict ourselves to a particular point, chosen to lie in the region above the green band,
namely that which corresponds to z1 ' 0.2595, z2 ' 0.0656, ε1 ' −0.1668, ε2 ' 0.1682, and β ' 255.2o.
Reaction densities and the evolution of asymmetries for this particular perturbed version of the model
are given in Fig. 5.5. A baryon asymmetry YB ' 5.93 × 10−10 is obtained, following the decoupling of
the lightest triplet, by summing over all three YBLα and converting the resulting YB−L asymmetry into
a purely baryonic one through (4.35). For such a case, one additionally has ε01 ' −2.02 × 10−5 and the
77
Figure 5.5: Reaction densities normalized to the product H(T )nγ(T ) (left) and evolution of the various
densities considered in the BE network (right). RIS subtraction has been considered, and the given γ′s is
equal to (γ′s)ee + (γ′s)µτ , with (γ′s)µτ = 2(γ′s)ee, while γt is equal to (γt)ee + (γt)µτ + (γt)τµ, with all these
terms identical between them. Gauge scatterings keep triplets close to equilibrium.
branching ratios obey Bµτ1,L = 2Bee1,L ' 0.654.
The efficiency parameter defined in (4.87) can be generalized, in the case of flavour, by considering
flavoured efficiencies ηα which translate the strenght with which an asymmetry in each lepton flavour is
generated:
YB = C
(∑
α,β
ηα εαβ
)YX∣∣TM . (5.35)
Notice that, for scalar triplets, YX = ΣY∆i. For the case under consideration, one obtains the flavoured
efficiencies ηe ' 0.686, and ηµ ' ητ ' 0.346.
We have seen that the model is sufficient to account for the observed baryon asymmetry of the
Universe. This behaviour may appear borderline. However, it must be emphasized that some regions
of parameter space remain to be explored. One has assumed that triplet masses are hierarchical, taken
to imply that asymmetries produced prior to the decay of the lightest triplet have been washed out.
The fact that CP asymmetries in the out-of-equilibrium decays of ∆2 can overshadow those of ∆1 by as
much as an order of magnitude (see Fig. 5.3) challenges this assumption. Flavour considerations may
nevertheless come to our rescue, as a sufficiently massive ∆2 may decay during the unflavoured regime,
for which all different lepton flavours are out-of-equilibrium and indistinguishable. This fact enters in
the BEs through CL and CH and the structure of the model prevents, in such a case, the generation of
asymmetries.
78
Concluding Remarks
Particle physics operates on the most fundamental level of science: the subatomic world. At present,
our understanding of fundamental particles and their interactions relies on a powerful guiding principle:
the principle of symmetry. Its validity spans from quantum to macroscopic scales and, therefore, it is
only natural to adopt it as a guide in searching for the explanation of the unsolved mysteries of our
Universe. Driven by the observed relations between quantities, physical theories ascribe a central role
to symmetries. These arise in modern physics through the abstract language of group theory, which
generalizes spacetime symmetries to local internal ones. The latter are at the heart of the SM and, thus,
of our current understanding of elementary particles.
In the last couple of years we have witnessed important advances in particle physics. The LHC
ATLAS and CMS discovery of the long-awaited Higgs boson at CERN is gradually clarifying the structure
of electroweak symmetry breaking and providing ultimate proofs of the SM. Still, and in spite of its
numerous successes and repeated confirmations beyond the classical level, there is a plethora of questions
for which the SM offers no answer. Perhaps the best example of the standard theory limitations is the
phenomenon of neutrino oscillations, a purely quantum mechanical process which implies nonvanishing
neutrino masses and mixing. Since neutrinos are strictly massless in the SM, the observation of neutrino
oscillations provides a solid evidence for new physics. From the theoretical viewpoint, the longstanding
problem of fermion mass origin is now augmented by a bizarre fact: neutrino masses are very small when
compared with those of other fermions, and the neutrino mixing pattern is completely different from that
observed in the quark sector.
The most popular SM extensions accommodating massive neutrinos in a natural way are those in
which neutrino masses are generated through the tree-level exchange of new heavy particles (see Chapter
2). Interactions (and masses) of these extra states with the SM degrees of freedom determine the flavour
structure of the effective Majorana neutrino mass resulting from the decoupling of the heavy particles and
from electroweak symmetry breaking. Those interactions and masses are, in general, free parameters of
the theory which are consistent with the gauge and Lorentz symmetries. As it is well known, this leaves
too much room for arbitrariness. Therefore, inspired by their normative role, one often considers new
symmetries to shape the fermion mass and mixing pattern. In Chapter 3 we have identified the symmetries
of the effective (Majorana) neutrino mass matrix and analysed the phenomenological viability of a specific
model based on an A4 non-Abelian discrete symmetry. In this scenario, CP-violating effects stem from
a single complex phase associated to the VEV of a scalar singlet. The numerical analysis presented in
Section 3.3 shows that the model is compatible with all data provided by neutrino oscillation experiments
79
and predicts large CP-violating effects, which could be detected in future reactor, long-baseline and
accelerator neutrino experiments.
In our constant search for a deeper understanding of the Universe, we are often faced with the problem
of comparing different scales and distances. Intuitively, phenomena which occur at the subatomic level
are, in general, uncorrelated with those happening at cosmological distances: how can the structure of
the Universe at large scales have anything to do with interactions among fundamental particles? The
key to this enigma relies on considering the present Universe as the result of an evolutionary, dynamic
process.
Although symmetries are crucial for the explanation of particle interactions, the truth is that we owe
our existence to symmetry breaking, in line with Curie’s Principle. In the SM, the Higgs mechanism is
responsible for giving mass to the gauge bosons and (some) fermions, keeping photons massless. However,
it remains to be explained why there is more matter than antimatter in the Universe since, after all, this
is a requirement for our existence. Once more, this fact does not find a satisfactory explanation in
the SM framework. A remarkable feature of the seesaw paradigm discussed in Chapter 2 is that the
presence of heavy neutrino mass mediators provides a natural explanation for the fact that we live in a
matter-dominated Universe. The connection between neutrino phenomenology and cosmology may thus
be established within the leptogenesis scenario, where the excess of baryons in the Universe relies on a
lepton asymmetry, as discussed in Chapter 4. For the specific case of type II seesaw, the starting point to
generate that lepton asymmetry is the out-of-equilibrium decay of the heavy-triplet scalars. In Section 5.1
we have obtained the general expressions for the CP asymmetries generated by the interference between
one-loop and tree-level decay diagrams, while in Section 5.2 the relevant set of Boltzmann equations which
govern the evolution of lepton number densities in each flavour have been obtained. The viability of the
leptogenesis scenario in the framework of the A4 model analysed in Section 3.3 has been also explored
in Section 5.4. We have concluded that the model not only allows to reproduce the current neutrino
mass and mixing pattern, but also generates a sufficiently large baryon asymmetry of the Universe, in
accordance with the experimental result.
Our conclusions justify the purpose of this thesis: to relate neutrino masses and mixing, symmetries
and the origin of matter. What we have presented is just an example (among many others) of how
complementary studies may shed some light on answering open questions in fundamental physics, the
future of which hinges on the combined exploration of physical phenomena at the energy, intensity
and cosmic frontiers. Neutrino oscillations represent the most fruitful example of physics at the intensity
frontier and the next years will surely reveal something more about the elusive neutrinos, like the neutrino
mass hierarchy and leptonic CP violation. On the other hand, the synergies between observations at
the energy frontier provided by accelerators like the LHC and those at the intensity frontier can be
complemented with data from cosmological experiments in the search of a more complete description of
Nature.
Quaerendo Invenietis (“By seeking, you will discover”).
80
Bibliography
[1] M. C. Escher and J. E. Brigham, The graphic work of M.C. Escher, (2001) Taschen.
[2] J. Bosdriesz (Director) and M. Plukker (Producer), Metamorphose: M.C. Escher, 1898-1972 [Doc-
umentary], (1999) Netherlands: Radio Netherlands Television.
[3] G. Polya, Z. Kristallogr. 60 (1924) 278.
[4] A. L. Mackay, Physica A 114 (1982) 609.
[5] D. Shechtman, I. Blech, D. Gratias, and J. W. Cahn, Phys. Rev. Lett. 53 (1984) 1951.
[6] N. Wang, H. Chen, and K. H. Kuo, Phys. Rev. Lett. 59 (1987) 1010.
[7] T. Ishimasa, H.-U. Nissen, and Y. Fukano, Phys. Rev. Lett. 55 (1985) 511.
[8] G. Rhodes, Annu. Rev. Psychol. 57 (2006) 199.
[9] K. Brading and E. Castellani, Symmetries in Physics: Philosophical Reflections, (2003) Cambridge
University Press.
[10] G. Hon and B. Goldstein, From Summetria to Symmetry: The Making of a Revolutionary Scientific
Concept, (2008) Springer.
[11] P. Curie, J. Phys. Theor. Appl. 3 (1894) 393.
[12] E. Wigner, Symmetries and Reflections: Scientific Essays of Eugene P. Wigner, (1967) Indiana
University Press.
[13] E. Noether, Gott. Nachr. (1918) 235.
[14] T. D. Lee, Particle Physics and Introduction to Field Theory, (1981) Harwood Academic Publishers.
[15] E. Wigner, Group Theory and its Application to the Quantum Mechanics of Atomic Spectra, (1959)
Academic Press.
[16] G. ’t Hooft, Sci. Am. 242 (1980) 104.
[17] H. Weyl, Sitzber. Preuss. Akad. (1918) 465.
[18] H. Weyl, Z. Phys. 56 (1929) 330.
81
[19] C. N. Yang and R. Mills, Phys. Rev. 96 (1954) 191.
[20] S. Glashow, Nucl. Phys. 22 (1961) 579.
[21] S. Weinberg, Phys. Rev. Lett. 19 (1967) 1264.
[22] A. Salam, Conf. Proc. C680519 (1968) 367.
[23] G. ’t Hooft, Nucl. Phys. B 35 (1971) 167.
[24] J. Sakurai, Modern Quantum Mechanics, (1994) Addison-Wesley Publishing Company.
[25] F. Hund, Z. Phys. 43 (1927) 805.
[26] G. C. Branco, L. Lavoura, and J. P. Silva, CP Violation, (1999) Oxford University Press.
[27] T. D. Lee and C. N. Yang, Phys. Rev. 104 (1956) 254.
[28] C. S. Wu et al., Phys. Rev. 105 (1957) 1413.
[29] J. P. Lees et al. (BaBar Collaboration), Phys. Rev. Lett. 109 (2012) 211801.
[30] M. Gell-Mann and A. Pais, Phys. Rev. 97 (1955) 1387.
[31] J. Christenson, J. Cronin, V. Fitch, and R. Turlay, Phys. Rev. Lett. 13 (1964) 138.
[32] J. Schwinger, Phys. Rev. 82 (1951) 914.
[33] W. Pauli, Niels Bohr and the Development of Physics: Essays Dedicated to Niels Bohr on the
Occasion of His Seventieth Birthday, (1955) McGraw-Hill.
[34] J. S. Bell, Proc. Roy. Soc. A 231 (1955) 79.
[35] G. Luders, Ann. Phys. 2 (1957) 1.
[36] S. Hawking, Phys. Rev. D 32 (1985) 2489.
[37] J. Beringer et al. (Particle Data Group), Phys. Rev. D 86 (2012) 010001 and 2013 partial update
for the 2014 edition (URL: http://pdg.lbl.gov).
[38] M. Guidry, Gauge Field Theories: An Introduction with Applications, (1991) John Wiley & Sons.
[39] S. Weinberg, The Quantum Theory of Fields, Vol II. Modern Applications, (1996) Cambridge Uni-
versity Press.
[40] J. Goldstone, Nuovo Cimento 19 (1961) 154.
[41] M. Peskin and D. Schroeder, An Introduction to Quantum Field Theory, (1995) Westview Press.
[42] J. Goldstone, A. Salam, and S. Weinberg, Phys. Rev. 127 (1962) 965.
[43] P. Higgs, Phys. Rev. Lett. 13 (1964) 508.
82
[44] P. Higgs, Phys. Rev. 145 (1966) 1156.
[45] F. Englert and R. Brout, Phys. Rev. Lett. 13 (1964) 321.
[46] G. Guralnik, C. Hagen, and T. Kibble, Phys. Rev. Lett. 13 (1964) 585.
[47] G. Aad et al. (ATLAS Collaboration), Phys. Lett. B 716 (2012) 30.
[48] S. Chatrchyan et al. (CMS Collaboration), Phys. Lett. B 716 (2012) 1.
[49] E. Castellani, Found. Sci. 7 (2002) 181.
[50] E. Wigner, Ann. Math. 40 (1939) 149.
[51] M. Gell-Mann and Y. Ne’eman, The Eightfold Way, (1964) W. A. Benjamin.
[52] E. Wigner, Commun. Pur. Appl. Math 13 (1960) 1.
[53] A. Cohen, A. De Rujula, and S. Glashow, Astrophys. J. 495 (1998) 539.
[54] E. Kolb and M. Turner, The Early Universe, (1990) Westview Press.
[55] K. Abe et al. (BESS Collaboration), Phys. Rev. Lett. 108 (2012) 131301.
[56] S. Davidson, E. Nardi, and Y. Nir, Phys. Rep. 466 (2008) 105.
[57] W. Bernreuther, Lect. Notes Phys. 591 (2002) 237.
[58] C. Giunti and C. W. Kim, Fundamentals of Neutrino Physics and Astrophysics, (2007) Oxford
University Press.
[59] J. C. Romao and J. P. Silva, Int. J. Mod. Phys. A 27 (2012) 1230025.
[60] P. Langacker, Phys. Rept. 72 (1981) 185.
[61] S. Weinberg, Phys. Rev. D 19 (1979) 1277.
[62] L. Susskind, Phys. Rev. D 20 (1979) 2619.
[63] S. Glashow, J. Iliopoulos, and L. Maiani, Phys. Rev. D 2 (1970) 1285.
[64] N. Cabibbo, Phys. Rev. Lett. 10 (1963) 531.
[65] M. Kobayashi and T. Maskawa, Prog. Theor. Phys. 49 (1973) 652.
[66] L. Maiani, Phys. Lett. B 62 (1976) 183.
[67] L.-L. Chau and W.-Y Keung, Phys. Rev. Lett. 53 (1984) 1802.
[68] Y. Fukuda et al. (Super-Kamiokande Collaboration), Phys. Rev. Lett. 81 (1998) 1562.
[69] E. Majorana, Nuovo Cimento 14 (1937) 171.
[70] P. West, Introduction to Strings and Branes, (2012) Cambridge University Press.
83
[71] P. Pal, Am. J. Phys. 79 (2011) 485.
[72] S. Weinberg, Phys. Rev. Lett. 43 (1979) 1566.
[73] A. Broncano, M. B. Gavela, and E. Jenkins, Phys. Lett. B 552 (2003) 177.
[74] C. Itzykson and M. Nauenberg, Rev. Mod. Phys. 38 (1966) 95.
[75] P. Minkowski, Phys. Lett. B 67 (1977) 421.
[76] M. Gell-Mann, P. Ramond, and R. Slansky, in Supergravity (eds. P. van Nieuwenhuizen and D.
Freedman), (1979) North-Holland Publishing Company.
[77] T. Yanagida, Conf. Proc. C7902131 (1979) 95.
[78] S. Glashow, in Quarks and Leptons (eds. M. Levy et al.), (1980) Plenum Publishing.
[79] R. Mohapatra and G. Senjanovic, Phys. Rev. Lett. 44 (1980) 912.
[80] G. C. Branco, R. Gonzalez Felipe, and F. R. Joaquim, Rev. Mod. Phys. 84 (2012) 515.
[81] W. Konetschny, W. Kummer, Phys. Lett. B 70 (1977) 433.
[82] T. P. Cheng and L.-F. Li, Phys. Rev. D 22 (1980) 2860.
[83] J. Schechter and J. Valle, Phys. Rev. D 22 (1980) 2227.
[84] G. Lazarides, Q. Shafi, and C. Wetterich, Nucl. Phys. B 181 (1981) 287.
[85] R. Mohapatra and G. Senjanovic, Phys. Rev. D 23 (1981) 165.
[86] D. Griffiths, Introduction to Quantum Mechanics, (1995) Prentice Hall.
[87] A. Abada et al., J. High Energy Phys. 0712 (2007) 061.
[88] F. del Aguila and J.A. Aguilar-Saavedra, Nucl. Phys. B 813 (2009) 22.
[89] S. Kanemura and K. Yagyu, Phys. Rev. D 85 (2012) 115009.
[90] Z. Xing and S. Zhou, Neutrinos in Particle Physics, Astronomy and Cosmology, (2011) Springer.
[91] R. Foot, H. Lew, X. G. He, and G. C. Joshi, Z. Phys. C 44 (1989) 441.
[92] A. Abada et al., Phys. Rev. D 78 (2008) 033007.
[93] R. Mohapatra, Phys. Rev. Lett. 56 (1986) 561.
[94] R. Mohapatra and J. Valle, Phys. Rev. D 34 (1986) 1642.
[95] A. Zee, Nucl. Phys. B 246 (1986) 99.
[96] K. S. Babu, Phys. Lett. B 203 (1988) 132.
[97] F. de Campos et al., J. High Energy Phys. 0805 (2008) 048.
84
[98] X. Ji, R. Mohapatra, S. Nussinov, and Y. Zhang, Phys. Rev. D 78 (2008) 075032.
[99] M.-C. Chen and J. Huang, Mod. Phys. Lett. A 26 (2011) 1147.
[100] B. Pontecorvo, Zh. Eksp. Teor. Fiz. 33 (1957) 549.
[101] Z. Maki, M. Nakagawa, and S. Sakata, Prog. Theor. Phys. 28 (1962) 870.
[102] B. Pontecorvo, Zh. Eksp. Teor. Fiz. 53 (1967) 1717.
[103] D. N. Dinh, A. Ibarra, E. Molinaro, and S. T. Petcov, J. High Energy Phys. 1208 (2012) 125.
[104] J. Adam et al. (MEG Collaboration), Phys. Rev. Lett. 110 (2013) 201801.
[105] J. Kaulard et al. (SINDRUM II Collaboration), Phys. Lett. B 422 (1998) 334.
[106] U. Bellgardt et al. (SINDRUM Collaboration), Nucl. Phys. B 299 (1988) 1.
[107] M. Tortola, Fortschr. Phys. 61 (2013) 427.
[108] S. M. Bilenky, J. Hosek, and S. T. Petcov, Phys. Lett. B 94 (1980) 495.
[109] W. H. Furry, Phys. Rev. 56 (1939) 1184.
[110] W. Rodejohann, Int. J. Mod. Phys. E 20 (2011) 1833.
[111] M. Agostini et al. (GERDA Collaboration), Phys. Rev. Lett. 111 (2013) 122503.
[112] P. A. R. Ade et al. (Planck Collaboration), arXiv:1303.5062 [astro-ph.CO].
[113] P. A. R. Ade et al. (Planck Collaboration), arXiv:1303.5076 [astro-ph.CO].
[114] P. F. Harrison, D. H. Perkins, and W. G. Scott, Phys. Lett. B 530 (2002) 167.
[115] F. P. An et al. (DAYA-BAY Collaboration), Phys. Rev. Lett. 108 (2012) 171803.
[116] J. K. Ahn et al. (RENO Collaboration), Phys. Rev. Lett. 108 (2012) 191802.
[117] Y. Abe et al. (Double Chooz Collaboration), Phys. Rev. D 86 (2012) 052008.
[118] K. Abe et al. (T2K Collaboration), Phys. Rev. Lett. 107 (2011) 041801.
[119] K. Abe et al. (MINOS Collaboration), Phys. Rev. Lett. 107 (2011) 041801.
[120] L. J. Hall, H. Murayama, and N. Weiner, Phys. Rev. Lett. 84 (2000) 2572.
[121] C. D. Froggatt and H. B. Nielsen, Nucl. Phys. B 147 (1979) 277.
[122] G. Altarelli and F. Feruglio, Nucl. Phys. B 720 (2005) 64.
[123] S. F. King and C. Luhn, Rept. Prog. Phys. 76 (2013) 056201.
[124] S. F. King and C. Luhn, J. High Energy Phys. 0910 (2009) 093.
[125] G. Altarelli and F. Feruglio, Nucl. Phys. B 741 (2006) 215.
85
[126] I. de M. Varzielas, S. F. King, and G. G. Ross, Phys. Lett. B 648 (2007) 201.
[127] C. S. Lam, Phys. Rev. Lett. 101 (2008) 121602.
[128] E. Ma and G. Rajasekaran, Phys. Rev. D 64 (2001) 113012.
[129] K. S. Babu, E. Ma, and J. W. F. Valle, Phys. Lett. B 552 (2003) 207.
[130] M. Hirsch et al., Phys.Rev. D 69 (2004) 093006.
[131] R. Gonzalez Felipe, H. Serodio, and J. P. Silva, Phys. Rev. D 87 (2013) 055010.
[132] S. Bhattacharya, E. Ma, A. Natale, and D. Wegman, Phys. Rev. D 87 (2013) 013006.
[133] G. C. Branco, R. Gonzalez Felipe, F. R. Joaquim, and H. Serodio, Phys. Rev. D 86 (2012) 076008.
[134] C. Jarlskog, Phys. Rev. Lett. 55 (1985) 1039.
[135] R. D’Inverno, Introducing Einstein’s Relativity, (1992) Clarendon Press.
[136] C. L. Bennett et al. (WMAP Collaboration), Astrophys. J. Suppl. S. 208 (2013) 20.
[137] A. H. Guth, Phys. Rev. D 23 (1981) 347.
[138] A. L. Linde, Phys. Lett. B 108 (1982) 389.
[139] A. D. Dolgov, Phys. Atom. Nucl. 73 (2010) 815.
[140] K. Huang, Statistical Mechanics (2nd ed.), (1987) John Wiley & Sons.
[141] E. Kolb and S. Wolfram, Nucl. Phys. B 172 (1980) 224.
[142] S. Dodelson, Modern Cosmology, (2003) Academic Press.
[143] J. Bernstein, Kinetic Theory in the Expanding Universe, (2004) Cambridge University Press.
[144] M. Abramowitz and I. Stegun, Handbook of Mathematical Functions, (1964) Dover Publications.
[145] A. Bazavov et al., Phys. Rev. D 80 (2009) 014504.
[146] A. Sakharov, Pisma Zh. Eksp. Teor. Fiz. 5 (1967) 24.
[147] A. D. Dolgov, Phys. Rept. 222 (1992) 309.
[148] S. L. Adler, Phys. Rev. 177 (1969) 2426.
[149] J. S. Bell and R. Jackiw, Nuovo Cim. A 60 (1969) 47.
[150] C. Bouchiat, J. Iliopoulos, and P. Meyer, Phys. Lett. B 38 (1972) 519.
[151] S.-S. Chern and J. Simons, Ann. Math. 99 (1974) 48.
[152] G. ’t Hooft, Phys. Rev. Lett. 37 (1976) 8.
[153] F. R. Klinkhamer and N.S. Manton, Phys. Rev. D 30 (1984) 2212.
86
[154] V. A. Kuzmin, V. A. Rubakov, and M.E. Shaposhnikov, Phys. Lett. B 155 (1985) 36.
[155] D. Bodeker, G. D. Moore, and K. Rummukainen, Phys. Rev. D 61 (2000) 056003.
[156] D. E. Morrissey and M. J. Ramsey-Musolf, New J. Phys. 14 (2012) 125003.
[157] K. Kajantie, M. Laine, K. Rummukainen, and M. Shaposhnikov, Phys. Rev. Lett. 77 (1996) 2887.
[158] G. R. Farrar and M. E. Shaposhnikov, Phys. Rev. D 50 (1994) 774.
[159] M. B. Gavela et al., Nucl. Phys. B 430 (1994) 382.
[160] J. Cline and P.-A. Lemieux, Phys. Rev. D 55 (1997) 3873.
[161] I. Affleck and M. Dine, Nucl. Phys. B 249 (1985) 361.
[162] M. Fukugita and T. Yanagida, Phys. Lett. B 174 (1986) 45.
[163] A. Pilaftsis, Phys. Rev. D 56 (1997) 5431.
[164] A. Pilaftsis and T. Underwood, Phys. Rev. D 72 (2005) 113001.
[165] J. A. Harvey and M. S. Turner, Phys. Rev. D 42 (1990) 3344.
[166] M. Laine and M. Shaposhnikov, Phys. Rev. D 61 (2000) 117302.
[167] P. Gondolo and G. Gelmini, Nucl. Phys. B 360 (1991) 145.
[168] S. R. de Groot, W. A. van Leeuwen, and C. G. van Weert, Relativistic kinetic theory, (1980)
North-Holland Publishing Company.
[169] G. F. Giudice et al., Nucl. Phys. B 685 (2004) 89.
[170] A. Strumia, Nucl. Phys. B 809 (2009) 308.
[171] M. A. Luty, Phys. Rev. D 45 (1992) 455.
[172] A. Basbol and S. Hannestad, J. Cosmol. Astropart. P. 0701 (2007) 003.
[173] R. Barbieri, P. Creminelli, A. Strumia, and N. Tetradis, Nucl. Phys. B 575 (2000) 61.
[174] E. Nardi, Y. Nir, E. Roulet, and J. Racker, J. High Energy Phys. 0601 (2006) 164.
[175] A. Abada et al., J. High Energy Phys. 0609 (2006) 010.
[176] T. Hambye, New J. Phys. 14 (2012) 125014.
[177] C. F. Berger, L. Covi, S. Kraml, and F. Palorini, J. Cosmol. Astropart. P. 0810 (2008) 005.
[178] T. P. Cheng and L.-F. Li, Gauge Theory of Elementary Particle Physics, (1984) Clarendon Press.
[179] T. Hambye, M. Raidal, and A. Strumia, Phys. Lett. B 632 (2006) 667.
[180] A. Denner, H. Eck, O. Hahn, and J. Kublbeck, Nucl. Phys. B 387 (1992) 467.
87
[181] G. Goldberg, Phys. Rev. D. 32 (1985) 3331.
[182] M. Veltman, Diagrammatica: The Path to Feynman Diagrams, (1994) Cambridge University Press.
[183] W.-K. Tung, Group Theory in Physics, (1985) World Scientific Publishing Company.
88
Computing Diagrams
with Majorana Fermions ASuppose one wants to study a given scattering (or decay) process. One starts by drawing all possible
Feynman diagrams corresponding to that process, up to a certain order. Consider now one of such
diagrams. The philosophy for computing its amplitude M without resorting explicitly to the charge
conjugation matrix C is, according to Ref. [180], as follows:
• Assign to each fermion chain in the diagram an arbitrary orientation (fermion flow).
• Proceeding along each fermion flow chain, starting at its termination (for closed loops, at an arbi-
trary propagator), write down the appropriate spinors, vertex terms (see below) and propagators
in accordance with the rules of Fig. A.1.
• Include a factor of (−1) for every closed loop.
• Choose a reference order for the order of the spinors in the expression of the amplitude (the ordering
refers to the particle label of each spinor).
• After exhausting every fermion chain, include a factor of (−1)p in the amplitude, where p denotes
the permutation parity of the spinor order in this diagram with respect to the previously chosen
reference order.
• Include a combinatorial factor, related to the topology of the diagram [181,182]. Majorana fermions
are to be treated here in the same way as real scalar or vector fields.
• For diagrams involving vector bosons, include the appropriate propagators and polarization vectors.
Vertex contributions are extracted from the Lagrangian density by reading off a quantity hΓ, where Γ
includes all nontrivial matrices in Dirac space and h corresponds to a numerical factor. The vertex term
will then be either i η hΓ or i η hΓ′ depending on the fermion flow choice (see Fig. A.1), where η denotes
an identical particle factor – one has η =∏i ni! for a vertex involving ni (indistinguishable) particles of
each type i – and Γ′ is defined as Γ′ ≡ C ΓT C−1. In most cases of interest, Γ = PR,L = C PTR,L C−1 = Γ′,
and so vertex contributions will be independent from the choice of a fermion flow.
89
u(p, s)
v(p, s)
v(p, s)
u(p, s)
i η hΓ i η hΓ′
ψc
ψ
ψc
ψ
S(p) ≡ 1
/p−m + iε
i S(+p)
i S(−p)
i S(+p)
Figure A.1: Rules for writing propagator, vertex and spinor contributions obtained from Ref. [180]. In
our Feynman diagrams, time is taken to flow from the left to the right.
90
Clebsch-Gordan Coefficients for A4 BB.1 General Description of the Group
The A4 group, introduced in Section 1.2, corresponds to the group of even permutations of four
elements. The group contains nG ≡ #A4 = 12 elements, explicitly given in (1.4), and four conjugacy
classes which partition the whole set:
C1 = e, n1 = 1,
C2 = (12)(34), (13)(24), (14)(23), n2 = 3,
C3 = (123), (142), (134), (243), n3 = 4,
C4 = (132), (124), (143), (234), n4 = 4,
(B.1)
where ni is the number of elements of each class Ci. The results presented in this appendix extensively
rely on group theory definitions and results, most of which can be found in Ref. [183].
The number of irreducible representations of the group (irreps., see Footnote 4 of Chapter 1) equals
the number of its conjugacy classes, nC = 4. Since the 1D (ir)reps. of A4 coincide with those of
A4/[A4, A4], and since the commutator [A4, A4] is isomorphic to the Klein group K = Z2 × Z2, one has
#(A4/[A4, A4]) = 12/4 = 3. Hence, A4/[A4, A4] ∼= Z3 and so A4 has 3 (degenerate) 1D representations.
The dimension n of the remaining irreducible representation of A4 is given by:
∑
irreps. µ
nµ2 = nG ⇒ 12 + 12 + 12 + n2 = 12 ⇒ n = 3, (B.2)
where nµ represents the dimension of the irrep. labeled by µ.
The character (χµi ) table for this group is given in Table B.1. It can be constructed from the known 1D
irreps of Z3 and by considering the following relations of completeness and orthonormality, respectively:
∑
µ
ninG
(χµi )∗χµj = δij . (B.3)
∑
i
ninG
(χµi )∗χνi = δµν , (B.4)
From relation (B.3) one obtains the value of χ33 :
∑
µ
n3
nG(χµ3 )
∗χµ3 = 1 ⇒
∑
µ
|χµ3 |2
=12
4⇒ |1|2 + |ω|2 +
∣∣ω2∣∣2 +
∣∣χ33
∣∣2 = 3 ⇒ χ33 = 0. (B.5)
91
χ1 χ2 χ3 χ4
1 1 1 1 1
1′ 1 1 ω ω2
1′′ 1 1 ω2 ω
3 3 χ32 = −1 χ3
3 = 0 χ34 = 0
Table B.1: Character table for the group A4. The entries correspond to the value of the character
χµi ≡ Tr Uµ(g), g ∈ Ci for each of the nC irreducible representations. As in Section 1.2, UR(g) ≡ URg
corresponds, for a given group representation R, to the matrix onto which g is mapped. The definition
ω ≡ exp(i 2π/3) for one of the cube roots of unity is also considered.
e s2 = t3 (123) t (132) t2
(12)(34) s (142) sts (124) tst = st2s
(13)(24) t2st (134) ts (143) st2
(14)(23) tst2 (243) st (234) t2s
Table B.2: Explicit decomposition of the elements of A4 in terms of s and t.
By the same argument, one shows that χ34 = 0. Finally, for χ3
2 and from (B.4), one has:
∑
i
ninG
(χ1i
)∗χ3i = 0 ⇒
∑
i
ni χ3i = 0 ⇒ 1× 3 + 3χ3
2 = 0 ⇒ χ32 = −1. (B.6)
The whole group can be obtained from two generators, s ≡ (12)(34) and t ≡ (123). The explicit
decomposition of group elements in terms of products of these generators is presented in Table B.2.
B.2 Choice of an Explicit 3D Representation
For 1D representations, the character coincides with the representation. For the remaining 3D irrep.,
however, one must make a choice and select which of the non-trivial group elements corresponds to a
diagonal matrix. Here, the choice of Altarelli and Feruglio [125] is considered. In particular, U3(t), the
3× 3 matrix representing t in the 3D irrep. of A4 is chosen to be diagonal.
It is important to mention that the choice of basis for the 3D representation affects the value of the
Clebsch-Gordan coefficients (CGCs) and thus consistency is required. Although our choice of U3(t) will
differ slightly from that of Altarelli and Feruglio, the CGCs obtained here are the same up to meaningless
normalization. An alternative option for the shape of the 3D irrep. matrices comes from considering
U3(s) diagonal instead, as is done by Ma and Rajasekaran [128].
While working with discrete groups one can always (and will here) choose a basis for which the
representation matrices are unitary. Since t3 = e and U3(e) = 13×3, the elements on the diagonal of
U3(t) will be cube roots of unity. In order to obtain the right character, χ33 = 0 (t ∈ C3), these roots
92
must all be different and one may choose:
U3(t) =
1 0 0
0 ω 0
0 0 ω2
. (B.7)
Constructing the explicit form of U3(s) automatically defines all matrices U3(g) with g ∈ A4, thanks
to the possibility of decomposing every group element into products of t and s. Since s2 = e, U3(s)2 =
13×3 = U3(s)−1U3(s), and so the matrix U3(s) is its own inverse. Due to it is unitarity, U3(s) is also
Hermitian. Imposing the condition χ32 = −1, one has:
U3(s) =
a b c
b∗ d e
c∗ e∗ −1− a− d
, (B.8)
with a and d constrained to be real. Since ts ∈ C3, one must have Tr[U3(ts)] = χ33 = 0, and st2 ∈ C4, so
Tr[U3(st2)] = χ34 = 0. Thus:
Tr[U3(ts)
]= 0
Tr[U3(st2)
]= 0
⇒
a+ ωd− ω2(1 + a+ d) = 0
a+ ω2d− ω(1 + a+ d) = 0⇒
a = −1/3
d = −1/3. (B.9)
The requirement that U3(s) is unitary corresponds to having orthonormal vectors in the matrix lines:
1/9 + |b|2 + |c|2 = 1/9 + |b|2 + |e|2 = 1/9 + |c|2 + |e|2 = 1 ⇒ |b|2 = |c|2 = |e|2 = 4/9. This condition
can be fulfilled with b, c and e real and positive: b = c = e = 2/3. We have thus explicitly built the 3D
irreducible representation of A4, in a basis corresponding to the matrices:
U3(e) =
1 0 0
0 1 0
0 0 1
, U3(t) =
1 0 0
0 ω 0
0 0 ω2
, U3(s) =
1
3
−1 2 2
2 −1 2
2 2 −1
. (B.10)
B.3 The Tensor Product Representation
Every group representation R can be decomposed into a direct sum of irreducible representations
(chosen unitary). In the vector space basis where this decomposition is possible one has:
UR(g) =⊕
irreps. µ
aµ Uµ(g), ∀ g ∈ A4, (B.11)
where aµ denotes the multiplicity of the irrep. µ.
Consider in particular the representation obtained from the tensor product of two 3D irreps. of A4,
denoted 3 ⊗ 3. To obtain the multiplicities a1, a1′ , a1′′ and a3, one can use the following expression,
which is a consequence of (B.4) and (B.11):
aµ = (χµ)† · χR, where χR ≡
(——–
√ninG
χRi ——–
). (B.12)
93
Taking into account the property χµ⊗νi = χµi × χνi , one obtains, for the multiplicities:
χ1 =1√12
(1,√
3, 2, 2)
χ1′ =1√12
(1,√
3, 2ω, 2ω2)
χ1′′ =1√12
(1,√
3, 2ω2, 2ω)
χ3 =1√12
(3, −√
3, 0, 0)
χ3⊗3 =1√12
(9,√
3, 0, 0)
⇒
a1 =1× 9 +
√3×√
3
12= 1
a1′ =1× 9 +
√3×√
3
12= 1
a1′′ =1× 9 +
√3×√
3
12= 1
a3 =3× 9−
√3×√
3
12= 2
. (B.13)
As a consequence of (B.11) and (B.13), one concludes that 3⊗ 3 = 1⊕ 1′ ⊕ 1′′ ⊕ 3⊕ 3. This means
that there is a certain basis for which every U3⊗3(g) matrix will be unitary and of the form:
U3⊗3(g) =
U1(g)
U1′(g)
U1′′(g) U3(g)
U3(g)
. (B.14)
As we calculate the CGCs in the next section we will see that there is a possibility for certain symmetric
and antisymmetric choices which will allow to make sense out of writing 3⊗3 = 1⊕1′⊕1′′⊕3s⊕3a.
B.4 Computing the CGCs
We will denote the basis in which U3⊗3(g) has the form (B.14) by a tilde. Consider two A4 triplets,
~a = (a1, a2, a3) and ~b = (b1, b2, b3), with ai, bi ∈ C. Under the action of an element of the group they
will transform according to some 3D representation matrices. Assuming henceforth that they transform
according to the choice of (B.10), one has:
~at−→ ~a ′ = U3(t)~a =
1 0 0
0 ω 0
0 0 ω2
a1
a2
a3
=
a1
ωa2
ω2a3
=
a′1a′2a′3
, (B.15)
~as−→ ~a ′′ = U3(s)~a =
1
3
−1 2 2
2 −1 2
2 2 −1
a1
a2
a3
=
1
3
−a1 + 2a2 + 2a3
2a1 − a2 + 2a3
2a1 + 2a2 − a3
=
a′′1a′′2a′′3
. (B.16)
We now need to see how the tensor product(~a⊗~b
)transforms:
(~a⊗~b
)t−→(~a⊗~b
)′= U3⊗3(t)
(~a⊗~b
), (B.17)
(~a⊗~b
)≡ (a1b1, a1b2, a1b3, a2b1, a2b2, a2b3, a3b1, a3b2, a3b3), (B.18)
(~a⊗~b
)′≡ (a1b1
′, a1b2′, a1b3
′, a2b1′, a2b2
′, a2b3′, a3b1
′, a3b2′, a3b3
′). (B.19)
94
To determine the shape of U3⊗3(t), one simply considers what happens to the products aibj when
both ~a and ~b are transformed simultaneously under U3(t). Using (B.15):
(~a⊗~b
)′= U3⊗3(t)
(~a⊗~b
)=((
U3(t)~a)⊗(U3(t)~b
) )=
a1
ωa2
ω2a3
⊗
b1
ωb2
ω2b3
= (a1b1, ω a1b2, ω2 a1b3, ω a2b1, ω
2 a2b2, a2b3, ω2 a3b1, a3b2, ω a3b3).
(B.20)
One thus concludes that U3⊗3(t) = diag(1, ω, ω2, ω, ω2, 1, ω2, 1, ω). In the same manner, one can
resort to the transformation rule of (B.16) in order to obtain the explicit form of U3⊗3(s):
U3⊗3(s) =1
3
−U3
s 2 U3s 2 U3
s
2 U3s −U3
s 2 U3s
2 U3s 2 U3
s −U3s
=
1
9
1 −2 −2 −2 4 4 −2 4 4
−2 1 −2 4 −2 4 4 −2 4
−2 −2 1 4 4 −2 4 4 −2
−2 4 4 1 −2 −2 −2 4 4
4 −2 4 −2 1 −2 4 −2 4
4 4 −2 −2 −2 1 4 4 −2
−2 4 4 −2 4 4 1 −2 −2
4 −2 4 4 −2 4 −2 1 −2
4 4 −2 4 4 −2 −2 −2 1
. (B.21)
We now need to find the vectors vi that make the change of basis which leaves U3⊗3(t) and U3⊗3(s)
in the form of (B.14), i.e. transforms these matrices into U3⊗3(t) and U3⊗3(s). We denote by S the
change of basis matrix, whose columns are the vectors vi as written in the current basis (no tilde):
S =
| | |v1 v2 · · · v9
| | |
, (B.22)
such that U3⊗3(g) = S−1 U3⊗3(g)S ⇔ U3⊗3(g) = S U3⊗3(g)S−1.
In the tilde basis, the change of basis vectors, vi = S−1vi, correspond (by definition of a change of
basis) to vectors ei with zeros in every entry except for a one in the j-th position. We construct the vi
vectors by requiring that they be orthonormal, which implies the unitarity of the change of basis matrix,
S† = S−1. Before proceeding, one presents the explicit matrix representations of the generators in the
tilde basis. For t, the matrix is U3⊗3(t) = diag(1, ω, ω2, 1, ω, ω2, 1, ω, ω2), while for s, one has:
U3⊗3(s) =
13×3
1
3
−1 2 2
2 −1 2
2 2 −1
1
3
−1 2 2
2 −1 2
2 2 −1
. (B.23)
One may first determine vi with i = 1, 2, 3 by noticing that these vectors must be eigenvectors of both
U3⊗3(t) and U3⊗3(s) with eigenvalues λ′1 = 1, λ′2 = ω, λ′3 = ω2 and λ′′1 = λ′′2 = λ′′3 = 1, respectively.
Let us determine v1. One sees that the eigenvectors of U3⊗3(t) with eigenvalue 1 are clearly linear
combinations of e1, e6 and e8. At the same time, v1 must be an eigenvector of U3⊗3(s) with the same
95
eigenvalue. The space of such eigenvectors is spanned by:
p1 =(0, 1, 0, 1, 0, 0, 0, 0, 1
),
p2 =(0, 0, 1, 1, 0, 0, 0, 1, 0
),
p3 =(− 1, 1, 1, 1, 0, 0, 1, 0, 0
),
p4 =(1, 0,−1,−1, 0, 1, 0, 0, 0
),
p5 =(1,−1, 0,−1, 1, 0, 0, 0, 0
).
(B.24)
Combining the constraints, one sees that there can be no p1, p3 or p5 in v1 since these vectors can
never be obtained by linear combinations of e1, e6 and e8. The vectors p2 and p4 remain. From the
fact that their linear combination must cancel the third component, one concludes that v1 ∝ p2 + p4.
Normalizing the sum yields:
v1 =1√3
(1, 0, 0, 0, 0, 1, 0, 1, 0
). (B.25)
The vector v2 will also be a linear combination of the pi, and simultaneously, by looking at U3⊗3(t),
a combination of e2, e4 and e9 – the eigenvectors of U3⊗3(t) with eigenvalue ω. Now, p2 through p5 are
excluded, leading to the conclusion that v2 ∝ p1, and so:
v2 =1√3
(0, 1, 0, 1, 0, 0, 0, 0, 1
). (B.26)
Finally, by looking at U3⊗3(t), one sees that v3 will be a linear combination of e3, e5 and e7, which
excludes p1, p2 and p4. Once again, one must consider v1 ∝ p3 + p5 to get rid of any components aside
from the third, fifth and seventh, concluding that:
v3 =1√3
(0, 0, 1, 0, 1, 0, 1, 0, 0
). (B.27)
To find out the remaining vi, we restrict ourselves to the space spanned by the orthogonal1 vectors:
q1 = (−1, 0, 0, 0, 0, 1, 0, 0, 0),
q2 = (1, 0, 0, 0, 0, 1, 0,−2, 0),
q3 = (0,−1, 0, 1, 0, 0, 0, 0, 0),
q4 = (0, 1, 0, 1, 0, 0, 0, 0,−2),
q5 = (0, 0,−1, 0, 1, 0, 0, 0, 0),
q6 = (0, 0, 1, 0, 1, 0,−2, 0, 0).
(B.28)
Consider now the effect of U3⊗3(t) on v4. One sees that v4 and v7 are eigenvectors of U3⊗3(t) with
eigenvalue 1. Thus, v4 and v7 will be linear combinations of e1, e6 and e8 orthogonal to v1. Looking at
the available qi vectors, one realizes that v4 and v7 must be given by combinations of q1 and q2:
v4 = x q1 + y q2 =(y − x, 0, 0, 0, 0, x+ y, 0,−2y, 0
),
v7 = x q1 + y q2 =(y − x, 0, 0, 0, 0, x+ y, 0,−2y, 0
).
(B.29)
1The qi vectors are not only orthogonal between themselves but also span the space orthogonal to the one of v1, v2 andv3. Normalization of these vectors is avoided as it is cumbersome to work with.
96
The values of x, y, x and y are chosen such that v4 and v7 are orthogonal and normalized, since S is
taken to be unitary. A general vector(~a⊗~b
)will transform to the tilde basis as:
˜(~a⊗~b
)= S†
(~a⊗~b
)⇔
a1b1
a1b2...
a3b3
=
———– v∗1 ———–
———– v∗2 ———–...
———– v∗9 ———–
a1b1
a1b2...
a3b3
. (B.30)
Looking at (B.30) and the previous expressions (B.29) for v4 and v7, one sees that:
a2b1 = (y − x)∗a1b1 + (x+ y)∗a2b3 + (−2y)∗a3b2,
a3b1 = (y − x)∗a1b1 + (x+ y)∗a2b3 + (−2y)∗a3b2.(B.31)
In the tilde basis, (a2b1, a2b2, a2b3) and (a3b1, a3b2, a3b3) transform in the same way under the action
of any group element. It is therefore safe to symmetrize and antisymmetrize the sum a2b1 + a3b1 in the
indexes i, j of the aibj . One then obtains constraints on x, y, x and y by assuming that v4 leads to a
symmetric a2b1 and v7 to an antisymmetric a3b1.
Once one has obtained v4 and v7, the remaining vectors vi will follow since, in the tilde basis, the
transformations of v4, v5 and v6 are related (the same happens for v7, v8 and v9). One will later see that
the (anti)symmetrization choice will naturally extend from v4 (v7) to v5 and v6 (v8 and v9).
Consider the following definitions for the coefficients Sij and Aij :
a2b1 ≡∑
i,j
Sij aibj , a3b1 ≡∑
i,j
Aij aibj . (B.32)
One starts by expanding the sum of a2b1 and a3b1 in terms of the aibj , as implied by (B.31):
∑i,j σij aibj ≡
∑i,j Sij aibj +
∑i,j Aij aibj = a2b1 + a3b1
= (y − x)∗a1b1 + (x+ y)∗a2b3 + (−2y)∗a3b2
+ (y − x)∗a1b1 + (x+ y)∗a2b3 + (−2y)∗a3b2
= (y + y − x− x)∗a1b1 + (x+ x+ y + y)∗a2b3 + (−2y − 2y)∗a3b2.
(B.33)
To constrain v4 and v7, (anti)symmetrization conditions are imposed:
S11 = σ11 ∧ A11 = 0
S23 = S32 =σ23 + σ32
2
A23 = −A32 =σ23 − σ32
2
⇒
(y − x)∗ = (y + y − x− x)∗ ∧ (y − x)∗ = 0
(x+ y)∗ = (−2y)∗ =(x+ x− y − y)∗
2
(x+ y)∗ = (2y)∗ =(x+ x+ 3y + 3y)∗
2
. (B.34)
Solving the previous system yields x = y and x = −3y. One can then write v4 and v7, unnormalized:
v4 = x(− 4/3, 0, 0, 0, 0, 2/3, 0, 2/3, 0
), (B.35)
v7 = x(0, 0, 0, 0, 0, 2, 0,−2, 0
). (B.36)
These vectors are clearly orthogonal, as one could have anticipated from the definitions of (B.32).
The orthornormality of the remaining vi will not be spoilt by the (anti)symmetrization choices.
97
Normalization implies |x|2 = 9/24 and |x|2 = 1/8. Choosing x = −(1/2)√
3/2 and x = 1/(2√
2):
v4 =1√6
(2, 0, 0, 0, 0,−1, 0,−1, 0
), (B.37)
v7 =1√2
(0, 0, 0, 0, 0, 1, 0,−1, 0
). (B.38)
Regarding the remaining vi vectors, one sees that v5 and v8 are eigenvectors of U3⊗3(t) with eigenvalue
ω and that v6 and v9 are eigenvectors of U3⊗3(t) with eigenvalue ω2. Thus, v5 and v8 will be linear
combinations of e2, e4, and e9, orthogonal to v2, whereas v6 and v9 will be linear combinations of e3, e5,
and e7, orthogonal to v3. Scanning the available qi vectors, one realizes that v5 and v8 must be given by
combinations of q3 and q4, and that v6 and v9 must be given by combinations of q5 and q6:
v5 = γ q3 + δ q4, v6 = ε q5 + η q6, v8 = γ q3 + δ q4, v9 = ε q5 + η q6. (B.39)
In the tilde basis, by looking at the explicit form of U3⊗3(s), one obtains the following relation:
U3⊗3(s) v4 = −1
3v4 +
2
3v5 +
2
3v6 ⇒ S U3⊗3(s)S−1S v4 = −1
3S v4 +
2
3S v5 +
2
3S v6
⇒ U3⊗3(s) v4 = −1
3v4 +
2
3v5 +
2
3v6 .
(B.40)
Then, by (B.29):
3xU3⊗3(s) q1 + 3 yU3⊗3(s) q2 = −v4 + 2v5 + 2v6
⇒ 3x q′′1 + 3 y q′′2 = −x q1 − y q2 + 2 γ q3 + 2 δ q4 + 2 ε q5 + 2 η q6.(B.41)
From the explicit form of U3⊗3(s) in Eq. (B.21) one reads off q′′1 and q′′2 , which can be decomposed
as a linear combination of qi vectors as follows2:
q′′1 = −1
3
(q1 + q3 + q4 + q5 + q6
), q′′2 = −1
3
(q2 + 3q3 + q4 − 3q5 + q6
). (B.42)
Inserting this result in (B.41), and using the fact that the qi are linearly independent, gives:
−x− 3y = 2γ
x− y = 2δ
−x+ 3y = 2ε
−x− y = 2η
⇒
γ = − 12x− 3
2y
δ = 12x− 1
2y
ε = − 12x+ 3
2y
η = − 12x− 1
2y
. (B.43)
By noticing that the same procedure can be applied to v7, v8 and v9 to obtain the barred variables,
and by considering (B.34) as well as the given choices for x and x, it follows that:
γ = 0
δ = 23x = − 1√
6
ε = −x = 32
1√6
η = − 13x = 1
21√6
∧
γ = − 12 x− 3
2 y = −2x = − 1√2
δ = 12 x− 1
2 y = 0
ε = − 12 x+ 3
2 y = x = 12√
2
η = − 12 x− 1
2 y = −x = − 12√
2
. (B.44)
2Notice that the subspace spanned by v1, v2 and v3 has already been dealt with.
98
One can now write vi for i = 5, 6, 8, 9:
v5 =1√6
(0,−1, 0,−1, 0, 0, 0, 0, 2
), (B.45)
v6 =1√6
(0, 0,−1, 0, 2, 0,−1, 0, 0
), (B.46)
v8 =1√2
(0, 1, 0,−1, 0, 0, 0, 0, 0
), (B.47)
v9 =1√2
(0, 0,−1, 0, 0, 0, 1, 0, 0
). (B.48)
Using the above Eqs. (B.25)-(B.27), (B.37)-(B.38), and (B.45)-(B.48) for the change of basis vectors
vi one can determine the (unitary) change of basis matrix. It’s inverse is explicitly given by:
S−1 = S† =1√6
√2 0 0 0 0
√2 0
√2 0
0√
2 0√
2 0 0 0 0√
2
0 0√
2 0√
2 0√
2 0 0
2 0 0 0 0 −1 0 −1 0
0 −1 0 −1 0 0 0 0 2
0 0 −1 0 2 0 −1 0 0
0 0 0 0 0√
3 0 −√
3 0
0√
3 0 −√
3 0 0 0 0 0
0 0 −√
3 0 0 0√
3 0 0
. (B.49)
To obtain the CGCs, consider a general vector(~a⊗~b
)in the non-tilde basis, where ~a and ~b transform
according to (B.15) and (B.16). Describing this vector in the tilde basis produces:
˜(~a⊗~b
)= S†
(~a⊗~b
)⇒
a1b1
a1b2
a1b3
a2b1
a2b2
a2b3
a3b1
a3b2
a3b3
=
1√3(a1b1 + a2b3 + a3b2)
1√3(a1b2 + a2b1 + a3b3)
1√3(a1b3 + a2b2 + a3b1)
1√6(2a1b1 − a2b3 − a3b2)
1√6(−a1b2 − a2b1 + 2a3b3)
1√6(−a1b3 + 2a2b2 − a3b1)
1√2(a2b3 − a3b2)
1√2(a1b2 − a2b1)
1√2(−a1b3 + a3b1)
. (B.50)
This, for example, means that when one applies any group transformation, a1b1 remains invari-
ant, which means 1√3(a1b1 + a2b3 + a3b2) transforms trivially (common factors like 1/
√3 can be ig-
nored). Thus, considering (B.14), one finally extracts the Clebsch-Gordan decomposition for the 3⊗ 3 =
1⊕ 1′ ⊕ 1′′ ⊕ 3s ⊕ 3a tensor product representation from (B.50):
a1b1 + a2b3 + a3b2 ∼ 1,
a1b2 + a2b1 + a3b3 ∼ 1′,
a1b3 + a2b2 + a3b1 ∼ 1′′,
2a1b1 − a2b3 − a3b2
−a1b2 − a2b1 + 2a3b3
−a1b3 + 2a2b2 − a3b1
∼ 3s,
a2b3 − a3b2
a1b2 − a2b1
−a1b3 + a3b1
∼ 3a. (B.51)
99
100
Top Related