Current Status of e-ID based on BDT Algorithm
description
Transcript of Current Status of e-ID based on BDT Algorithm
![Page 1: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/1.jpg)
Current Status of e-ID based on BDT Algorithm
Hai-Jun YangUniversity of Michigan
(with X. Li, T. Dai, A. Wilson, B. Zhou)
BNL Analysis JamboreeMarch 19, 2009
![Page 2: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/2.jpg)
H. Yang - Status of BDT e-ID 22
Goals• ATLAS default electron-ID (IsEM) has relatively
low efficiency (~70%) and high jet faked electron rate based on our diboson and Higgs studies. The goal of e-ID algorithm development based on Boosted Decision Trees (BDT) are:
To improve the e-ID selection efficiency
To reduce faked electron rate from dijet samples (previous works)
To reduce faked electron rate from jet samples (current works)
![Page 3: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/3.jpg)
H. Yang - Status of BDT e-ID 33
Electron ID Studies with BDTSelect electrons in two steps1) Pre-selection: an EM cluster matching a track
2) Apply electron ID based on pre-selected samples with different e-ID algorithms (IsEM, Likelihood, AdaBoost and EBoost).
BDT e-ID development at U. Michigan (Rel. v12)- H. Yang’s talk at US-ATLAS Jamboree on Sept. 10, 2008
http://indico.cern.ch/conferenceDisplay.py?confId=38991
BDT e-ID (EBoost) based on Rel. v13- H. Yang’s talk at ATLAS performance and physics workshop at
CERN on Oct. 2, 2008
http://indico.cern.ch/conferenceDisplay.py?confId=39296
Implementation of EBoost in EgammaRec (Rel. v14)- H. Yang’s talk at ATLAS egamma conference on Dec. 17, 2008
http://indico.cern.ch/conferenceDisplay.py?confId=431173
![Page 4: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/4.jpg)
H. Yang - Status of BDT e-ID 44
Comparison of e-ID Algorithms (v14)
IsEM (tight) Efficiency = 70.9% jet rejection rate=3092
Likelihood Efficiency = 71% jet rejection rate=5200
AdaBoost Efficiency = 71% jet rejection rate=7059
EBoost Efficiency = 71% jet rejection rate=47830
A technical note about BDT e-ID will be submitted shortly
![Page 5: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/5.jpg)
H. Yang - Status of BDT e-ID 5
Current Works on e-ID background from -jet and D3PD Validation
Background study using -jet Samples• H. Yang et.al., “First Look at -jet samples using BDT e-ID
algorithm”, ATLAS Egamma Phone Conference, 03/09/2009
1st Version D3PD Validation• H. Yang, “Validation of D3PD using e-ID related variables”, US-
ATLAS Egamma Meeting, 01/29/2009• H. Yang, “Update on validation of D3PD using e-ID related
variables”, US-ATLAS Egamma Meeting, 02/26/2009• H. Yang, “Update of D3PD validation using e-ID related variables”,
US-ATLAS Egamma Meeting, 03/12/2009
![Page 6: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/6.jpg)
H. Yang - Status of BDT e-ID 6
2nd Version D3PD Validation using Zee Samples
• D3PD samples (5 files)– egamma_PythiaZee_1Lepton.recon.v142203.
[1-5].root
• Five identical AOD files for validation– mc08.106050.PythiaZee_1Lepton.recon.AOD.
e347_s462_r541_tid028675/AOD.028675._0780[2-6].pool.root.1
– Use DoElectron.cxx to extract electron related variables.
![Page 7: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/7.jpg)
H. Yang - Status of BDT e-ID 7
El_Eta and El_Phi
AOD D3PD
AOD D3PD
![Page 8: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/8.jpg)
H. Yang - Status of BDT e-ID 8
E233/E277 and E237/E277
AOD D3PD
![Page 9: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/9.jpg)
H. Yang - Status of BDT e-ID 9
EtCone30 and EtCone40
AOD D3PD
![Page 10: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/10.jpg)
H. Yang - Status of BDT e-ID 10
BLayerHits and Pixel Hits
AOD D3PD
![Page 11: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/11.jpg)
H. Yang - Status of BDT e-ID 11
EL_IsEM ?
AOD
D3PD
Not same ?
![Page 12: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/12.jpg)
H. Yang - Status of BDT e-ID 12
EL_IsEM
AOD (black) and D3PD (red crosses) are identical if they are compared in same plot!
![Page 13: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/13.jpg)
H. Yang - Status of BDT e-ID 13
-jet: e-ID Background
• To estimate the jet and dijet rejection rates using various e-ID algorithms (IsEM, Likelihood and BDT).
• MC samples for test include:– DS108087, jet– DS105802, JF17 dijet– DS106050, Zee signal
![Page 14: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/14.jpg)
H. Yang - Status of BDT e-ID 14
MC jet samples for test(Pre-selection)
samples
N_electron
DS108087
jet
DS105802
JF17 dijets
DS106050
Zee
N_events 127887 237950 83690N_candidateEt>17 GeV, ||<2.5
194046 896818 108550
N_candidate (precuts)
With EM/Track match
20441 20994 94153
Rejection/Efficiency
after precutsRejection
9.5
Rejection
42.7
Acceptance
86.7%
![Page 15: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/15.jpg)
H. Yang - Status of BDT e-ID 15
Results of IsEM
• Et > 17GeV, Tight cuts
- Efficiency (Zee) = 70.9%
- Rejection (jet) = 426(±4.7%)
- Rejection (jf17) = 3092(±5.9%)
![Page 16: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/16.jpg)
H. Yang - Status of BDT e-ID 16
e-ID (Likelihood)log(ElectronWt/BgWt)
Et > 17GeV
- Efficiency = 71%- Rej(jet)=20(±1%) low!- Rej(jf17)=5200(±7.6%)
jf17 jet
Zee
![Page 17: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/17.jpg)
H. Yang - Status of BDT e-ID 17
Input Variables for BDT
-jets
electrons
dijets
![Page 18: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/18.jpg)
H. Yang - Status of BDT e-ID 18
Input Variables for BDT
-jets
electronsdijets
![Page 19: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/19.jpg)
H. Yang - Status of BDT e-ID 19
Input Variables for BDT
-jets
electrons
dijets
![Page 20: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/20.jpg)
H. Yang - Status of BDT e-ID 20
BDT e-ID trained using dijet as background (list of top 10 powerful variables)
Rank Input variable Gini index
1 Etcone45 / Et 46.08%
2 E2tsts1-Emins1(Emax2-Emin in LAr.1st)
8.60%
3 No. of TRT hits / No. of B-layer hits 6.68%
4 deta1 between track and EM cluster 5.21%
5 Number of pixel hits 4.48%
6 F1(frac. of E deposited in LAr. 1st samp)
4.32%
7 Ethad1/Et (E leakage in hcal. 1st samp) 3.94%
8 E237 / E277 3.41%
9 Eta of inner track 2.33%
10 Number of B-layer hits 2.19%
![Page 21: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/21.jpg)
H. Yang - Status of BDT e-ID 21
e-ID (BDT_dijet)
Et > 17GeV
- Efficiency = 71%- Rej(jet)=591(±5.5%)- Rej(jf17)=47830(±23%)
BDT e-ID trained using dijets as background
![Page 22: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/22.jpg)
H. Yang - Status of BDT e-ID 22
BDT peaks in jet samples?Correlations
![Page 23: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/23.jpg)
H. Yang - Status of BDT e-ID 23
Comparison of IsEM, Likelihood and BDT (trained using dijets)
• BDT
![Page 24: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/24.jpg)
H. Yang - Status of BDT e-ID 24
Comparison of IsEM, Likelihood and BDT (trained using dijets)
• BDT
![Page 25: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/25.jpg)
H. Yang - Status of BDT e-ID 25
BDT e-ID trained using jets as background(list of top 10 powerful variables)
Rank Input variable Gini index
1 Number of B-layer hits 21.49%
2 Ntrk (R=0.3) 17.94%
3 Pt (R=0.3) 11.41%
4 Number of pixel hits 11.17%
5 E233 / E277 5.79%
6 E237 / E277 4.77%
7 No. of TRT hits / No. of B-layer hits 4.52%
8 deta1 between track and EM cluster 4.49%
9 Etcone45 / Et 4.20%
10 F1(frac. of E deposited in LAr. 1st samp)
1.97%
![Page 26: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/26.jpg)
H. Yang - Status of BDT e-ID 26
e-ID (BDT_jet)
Et > 17GeV
- Efficiency = 71%- Rej(jet)=1788(±9.6%)- Rej(jf17)=19081(±15%)
BDT e-ID trained using jets as background
![Page 27: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/27.jpg)
H. Yang - Status of BDT e-ID 27
Performance ComparisonBDT e-ID trained using jets as background
• BDT
![Page 28: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/28.jpg)
H. Yang - Status of BDT e-ID 28
Performance ComparisonBDT e-ID trained using jets as background
• BDT
![Page 29: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/29.jpg)
H. Yang - Status of BDT e-ID 29
Summary samples
e-IDs
DS108087Jet
Rejection
DS105802
JF17 dijets
Rejection
DS106050
Zee
Acceptance
IsEM (tight) 426(±4.7%) 3092(±5.9%) 71%(±0.4%)
Likelihood 20(±1.0%) 5200(±7.6%) 71%(±0.4%)
BDT(using dijets)
591(±5.5%)
426(±4.7%)
47830(±23%)
27176(±17%)
71%(±0.4%)
77%(±0.3%)
BDT(using jets)
1788(±9.6%) 19081(±15%) 71%(±0.4%)
![Page 30: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/30.jpg)
H. Yang - Status of BDT e-ID 30
Backup Slides
![Page 31: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/31.jpg)
H. Yang - Status of BDT e-ID 31
BDT peaks in jet samples?
![Page 32: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/32.jpg)
H. Yang - Status of BDT e-ID 3232
Comparison of e-ID Algorithms (v14)
IsEM (tight) Efficiency = 70.9% jet rejection rate=5.5e4
Likelihood Efficiency = 80% jet rejection rate=3.2e4
AdaBoost Efficiency = 80% jet rejection rate=7.0e4
EBoost Efficiency = 80% jet rejection rate=3.9e5
Jet rejection rate based on number of MC truth jets
![Page 33: Current Status of e-ID based on BDT Algorithm](https://reader035.fdocuments.in/reader035/viewer/2022062518/56813fc3550346895daaa1b4/html5/thumbnails/33.jpg)
H. Yang - Status of BDT e-ID 3333
Implementation of BDT Trees in Egamma Package and Test
• E-ID based on BDT has been implemented into egamma reconstruction package.
• We successfully run through the reconstruction package based on v14.2.22 and v14.5.0 to test the BDT e-ID.
RDODigitizedraw data
Reconstructionwith egammaRel. V14.2.22
or Rel. V14.5.0
AOD
CBNT (*Ele_BDT)
egammaPID::EBoost