Post on 10-Feb-2016
description
1
Summary and ConclusionsKyle Cranmer (New York University)
Harrison B. Prosper (Florida State University)Sezen Sekmen (CERN)
LPCC Workshop: Likelihoods for LHC Searches
LPCC Workshop on Likelihoods CERN
List of TalksDay 1
h Sezen Goalsh Glen Principlesh Kyle Context/Scope
Feedbackh Marcoh Maggieh Béranger
Day 2h Kyle HistFactoryh Sven ATLAS HZZ4lHiggs Combination h Minshui CMSh Haoshuang ATLAS
Day 3h Wolfgangh Javier (thanks Maurizio!)h WouterPanelists Sünje, Mike,
Lorenzo2LPCC Workshop on Likelihoods CERN
DAY 1
LPCC Workshop on Likelihoods CERN 3
Sezen: Workshop Goals
Goalsh Educate ourselves: why are likelihoods needed?h Move towards routine publication of likelihoods
LPCC Workshop on Likelihoods CERN 4
LPCC Workshop on Likelihoods CERN 5
Glen: Basic Ideas
DistributionProbability density (or mass) function, Nature(x)x potential observations
ModelP(x | μ, θ) is a parametric model of the unknown function Nature(x) with parameters μ and θ, some of which are interesting (μ) and some not (θ).
LikelihoodL(μ, θ) = L(D | μ, θ) = P(D | μ, θ) D = observed data
LPCC Workshop on Likelihoods CERN 6
Glen: Basic Ideas
Need a way to get rid of parameters not of current interest. There are two general ways, marginalization and profiling:
Marginal Likelihood
Profile Likelihood
Profiling can be regarded as marginalization with the prior
Lm (x | μ)= L(x |μ, θ)∫ π(θ)dθ
Lp (x | μ)=L(x |μ, ˆ̂θ(μ))
π (θ ) = δ (θ − ˆ̂θ )
Kyle: Context & Scope
LPCC Workshop on Likelihoods CERN 7
Feedback
Marco: Is it the SM Higgs?
LHC Higgs Cross Section Working GroupAssumptions
h SM tensor structure (CP-even scalar)h A single zero-width resonanceh κi = σi / σSMi and κf = Γf / ΓSMi are free parameters, where
How do we best report experimental results (with the goal of allowing more detailed/accurate studies)?
LPCC Workshop on Likelihoods CERN 9
€
σ ⋅BR(ii → H → ff ) = σ SM ⋅BRSM
κ i2 ⋅κ f
2
κ H2
Maggie: Is it the SM Higgs?
LPCC Workshop on Likelihoods CERN 10
Can use an effective field theory (EFT) approach:
LPCC Workshop on Likelihoods CERN 11
Maggie: Is it the SM Higgs?
Béranger: Is it the SM Higgs?
Effective Lagrangian
Fitting procedure
LPCC Workshop on Likelihoods CERN 12
Béranger: Is it the SM Higgs?
LPCC Workshop on Likelihoods CERN 13
DAY 2
LPCC Workshop on Likelihoods CERN 14
Kyle: HistFactory
Equivalent to a multi-bin Poisson model with bins so small that the chance of > a single count per bin is negligible
n is the number of events and {xe} are the measurements (e.g., the di-photon masses)
In general, f is a mixture:
LPCC Workshop on Likelihoods CERN 15
Kyle: HistFactory
which, in this case, represents a Gaussian G(x| μ, σ).
fp(ap | αp) are the likelihoods of the auxiliary measurements ap from either real, simulated, or hypothetical experiments.
These functions provide constraints on the parameters α and hence on the parameters νc(α).
LPCC Workshop on Likelihoods CERN 16
Kyle: HistFactory
LPCC Workshop on Likelihoods CERN 17
XML representation of model
Kyle
http://www.brianlemay.com/
HistFactory
RooWorkspace
LPCC Workshop on Likelihoods CERN 18
Sven: HZZ*(4l) in ATLAS
LPCC Workshop on Likelihoods CERN 19
Sven: HZZ*(4l) in ATLAS
Cranmer, K, Kernel Estimation in High-Energy Physics Computer Physics Communications 136:198-207, 2001hep ex/0011057
Kernel density estimation+ density morphing+ HistFactory
LPCC Workshop on Likelihoods CERN 20
Sven: HZZ*(4l) in ATLAS
Editorial comment: Jack’s intuition is spot on! For discrepantresults, the combined result ought to be worse.
LPCC Workshop on Likelihoods CERN 21
Sven: HZZ*(4l) in ATLAS
Clarity Prize goes to Sven for explaining to me why a p-value computed from the background-only hypothesis depends on the alternative hypothesis!
Harrison: “Please explain this plot”Sven: “The sampling distributionof t(x) = -2 ln Lp/Lmax is independentof mH, as it should be, but the powerof the test is maximized for each mH,so the observed value of t changes with mH”
Higgs Combination
Mingshui: Higgs Combination (CMS)
Model: Marked Poisson Process (see Kyle’s HistFactory talk)LEP
No constraints for parameters θ with systematic uncertainties
TevatronUse priors π(θ|θ0) to constrain θ
LHCInterpret π(θ|θ0) as π(θ|θ0) ~ f (θ0|θ) π(θ)
Cowanscher Ur-prior!and interpret f (θ0|θ) as the likelihood for auxiliary measurements θ0
LPCC Workshop on Likelihoods CERN 23
Mingshui: Higgs Combination (CMS)
Assumptions (current measurements)h Data are disjointh Standard Model with mH and μ as free parametersh Same mH for all channels
Detailed models can be provided in RooWorkspace formLPCC Workshop on Likelihoods CERN 24
Haoshuang: Higgs Combination (ATLAS)
Basic tool is HistFactory for all channels except for H to γγA Single Channel
LPCC Workshop on Likelihoods CERN 25
Haoshuang: Higgs Combination (ATLAS)
Important point In combining channels the Greek symbol fallacy is avoided.
An explicit decision must be made about how parameters with the same name are related, if at all.
Typically done by modifying the XML representation of the model.
LPCC Workshop on Likelihoods CERN 26
DAY 3
LPCC Workshop on Likelihoods CERN 27
Wolfgang: BSM Searches
Guided by a well-motivated theory, e.g., the pMSSM, and its simplified model decomposition
pMSSM Results (non-CMS)
…but CMS pMSSM / SMs analysis in progress…
LPCC Workshop on Likelihoods CERN 28
Wolfgang: BSM Searches
LPCC Workshop on Likelihoods CERN 29
Javier: BSM Searches
LPCC Workshop on Likelihoods CERN 30
Javier: BSM Searches
LPCC Workshop on Likelihoods CERN 31
Javier: BSM Searches
LPCC Workshop on Likelihoods CERN 32
Javier: BSM Searches
LPCC Workshop on Likelihoods CERN 33
Nuisance parametersmarginalized throughMonte Carlo integration
Wouter: RooFit
RooFit is a probability modeling language:
RooStats provides high level statistical tools that use RooFit models
LPCC Workshop on Likelihoods CERN 34
Wouter: RooFit
A RooWorkspace is a mechanism to store a model + data
LPCC Workshop on Likelihoods CERN 35
Panel Discussion
Sünje, Mike, Lorenzo
HEPData on INSPIRE Make data sets searchable, findable, citableAssign Digital Object Identifier (DOI) to datah Should we track the re-use of data?h Should we have a single portal (e.g, Inspire)?h Will will have a single portal?h Will need non-web access alsoh RECAST requests that are honored could yield citationh Are there legal issues?
LPCC Workshop on Likelihoods CERN 36
CONCLUSIONS
LPCC Workshop on Likelihoods CERN 37
ICHEP 2040
LPCC Workshop on Likelihoods CERN 38
Data
pNMSSM
OTTRTA
p(Data | Theory)
SMme, mμ, mτ
mu, md, ms, mc, mb, mt
θ12, θ23, θ13, δg1, g2, g3
θQCD
μ, λ
The New Standard Model has been firmly established
p(Theory | Data)
Conclusions
We could do a better job of understanding the LHC data if more information were made public in a systematic way
A general way to do this is to publish the probability model + relevant data set
The technology exists (RooWorkspace, Inspire, HepData) to publish arbitrarily complicated models, retrieve them and use them in analyses
My sense is that our field is nearing a tipping point, for the better!
LPCC Workshop on Likelihoods CERN 39
Thanks!
h We thank the LHC Physics Centre at CERN (LPCC) for hosting this workshop and its financial support of two RooStats developers. We thank the Theory Secretariat for organizing the coffee breaks!
h We thank YOU for making this workshop both informative and enjoyable.
h We thank the World’s funding agencies and the World’s taxpayers for their generous support:
LHC cost: $1million / scientist
LPCC Workshop on Likelihoods CERN 40