What is the CMS(UK) Data Model?
-
Upload
tucker-lee -
Category
Documents
-
view
24 -
download
2
description
Transcript of What is the CMS(UK) Data Model?
![Page 1: What is the CMS(UK) Data Model?](https://reader035.fdocuments.in/reader035/viewer/2022071808/5681321c550346895d987e1d/html5/thumbnails/1.jpg)
Glenn Patrick 31/03/00 CMS(UK)
What is the CMS(UK) Data
Model?Assume that CMS software is available at every UK institute connected by some infrastructure (ie. Grid).
The problem then reduces to:•What datasets are required?•Where are they required?•Why are they required? •Who is going to generate, distribute them?•What are the formats, sizes & access patterns?
![Page 2: What is the CMS(UK) Data Model?](https://reader035.fdocuments.in/reader035/viewer/2022071808/5681321c550346895d987e1d/html5/thumbnails/2.jpg)
Event Tag Data
Physics Objects
Reconstructed Data
Raw Data
![Page 3: What is the CMS(UK) Data Model?](https://reader035.fdocuments.in/reader035/viewer/2022071808/5681321c550346895d987e1d/html5/thumbnails/3.jpg)
DataImport
DataExport
Mass Storage & DiskServers
Database Servers
Tapes
Network from CERN
Networkfrom Tier 2 andsimulation centers
PhysicsSoftware
Development
R&D Systemsand Testbeds
Info serversCode servers
Web ServersTelepresence
Servers
TrainingConsultingHelp Desk
ProductionReconstruction
Raw/Sim-->ESD
Scheduled, predictable
experiment/physics groups
ProductionAnalysis
ESD-->AODAOD-->DPD
Scheduled
Physics groups
Individual Analysis
AOD-->DPDand plots
Chaotic
Physicists Desktops
Tier 2
Local institutes
CERN
Tapes
Support Services
![Page 4: What is the CMS(UK) Data Model?](https://reader035.fdocuments.in/reader035/viewer/2022071808/5681321c550346895d987e1d/html5/thumbnails/4.jpg)
batchphysicsanalysis
batchphysicsanalysis
detector
event summary data
rawdata
eventreconstruction
eventreconstruction
eventsimulation
eventsimulation
analysis objects(extracted by physics topic)
Offline Data andComputation for Physics Analysisevent filter
(selection &reconstruction)
event filter(selection &
reconstruction)
processeddata
![Page 5: What is the CMS(UK) Data Model?](https://reader035.fdocuments.in/reader035/viewer/2022071808/5681321c550346895d987e1d/html5/thumbnails/5.jpg)
CPU for productionMass Storage for RAW, ESD AOD, and TAG
Institute
Selected User AnalysesInstitute
Selected User Analyses
Regional Centre
User analysis
Production Centre
Generate raw dataReconstructionProduction analysis
User analysis
Regional Centre
User analysisRegional Centre
User analysis
Institute
Selected User Analyses
Regional Centre
User analysis
Institute
Selected User Analyses
CPU for analysisMass storage for AOD, TAG
CPU and data servers
AOD,TAGreal : 80TB/yrsim: 120TB/yr
AOD,TAG8-12 TB/yr
LHCb
![Page 6: What is the CMS(UK) Data Model?](https://reader035.fdocuments.in/reader035/viewer/2022071808/5681321c550346895d987e1d/html5/thumbnails/6.jpg)
ProductionCentre
(x1)
RegionalCentre(~x5)
Institute(~x50)
Real Data Simulated Data
Data collectionTriggeringReconstructionFinal State Reconstruction
CERN
WAN Output to each RC:AOD and TAG datasets20TB x 4 times/yr= 80TB/yr
User Analysis
WAN Output to each Institute:AOD and TAG for samples1TB x 10 times/yr= 10TB/yr
RAL , Lyon, ...
Event GenerationGEANT trackingReconstructionFinal State Reconstruction
WAN Output to each RC:AOD, Generator and TAG datasets30TB x 4 times/yr= 120TB/yr
User Analysis
Selected User Analysis Selected User Analysis
WAN Output to each institute:AOD and TAG for samples3TB x 10 times/yr= 30TB/yr
LHCb
![Page 7: What is the CMS(UK) Data Model?](https://reader035.fdocuments.in/reader035/viewer/2022071808/5681321c550346895d987e1d/html5/thumbnails/7.jpg)
Dataflow Model
RAW Data
DAQ system
L2/L3 Trigger
Calibration Data
Reconstruction
Event Summary Data (ESD) Reconstruction Tags
Detector
RAW Tags
L3YES, sample L2/L3NO
ESD Reconstruction Tags
Analysis Object Data (AOD) Physics Tags
First PassAnalysis
Physics Analysis
Private Data
Analysis Workstation
Physics results
ESD RAW
![Page 8: What is the CMS(UK) Data Model?](https://reader035.fdocuments.in/reader035/viewer/2022071808/5681321c550346895d987e1d/html5/thumbnails/8.jpg)
Need to answer questions like...
How will a physicist in Bristol/Brunel/IC/RAL:
• Select events for a given physics channel from a year’s worth of data taking?
• Transfer/replicate the selection for further analysis?
• Generate & process a large sample of simulated events?
• Run his/her batch job on existing samples of Monte-Carlo events (eg. at Tier1/Tier2)?
Where do you want the data?
What sort of data do you need - Tag,AOD,ESD,Raw?
![Page 9: What is the CMS(UK) Data Model?](https://reader035.fdocuments.in/reader035/viewer/2022071808/5681321c550346895d987e1d/html5/thumbnails/9.jpg)
How to Go Forward?• Need to identify critical mass of people formed from all of the institutes who will start to study, develop and exploit CMS(UK) facilities now.
• Require expert(ise) in OO databases - specifically Objectivity (BaBar estimate 1 FTE).
• Each institute needs to start to identify its data requirements for simulation/physics/trigger studies.
• Need to understand how best to distribute, replicate, and centralise database & associated resources.
• Need good organisation with regular meetings, etc.