DT Upgrade overview and Phase 2 discussion

44
C. F. Bedoya

description

DT Upgrade overview and Phase 2 discussion. C. F. Bedoya. CMS Week - April 9th 2013. C. F. Bedoya April 9th , 2013. 2. DT Upgrade Overview. LS1 projects: * Theta TRB: (talk from Fabio). * SC relocation (next slides) Phase 1 Trigger : * L1 TDR finalized and now under review. - PowerPoint PPT Presentation

Transcript of DT Upgrade overview and Phase 2 discussion

Page 1: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya

Page 2: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 2

LS1 projects:* Theta TRB: (talk from Fabio). * SC relocation (next slides)

Phase 1 Trigger : * L1 TDR finalized and now under review.

Goal is to make a unified trigger system both in HW and

allow to combine information from different detectors at the earlier possible time

* Twin Mux board: (pre-DTTF board)A. Triossi started tests of reading trigger inputs with Avago+FPGA

* Algorithms: DT+ RPC algorithms? Manpower more or less identified. Some general approaches established. We need to start performing the actual

work.

Phase 2: * Mixing Tracker + DT (or other subdetector) info studies moved to Tracking Trigger working

group (CMS general, not subdetector based)

* New inputs from CMS regarding L1A parameters affect our electronics (this talk and next one)

Page 3: DT Upgrade overview and  Phase 2 discussion

-ROS

-TSCCOPPEROPTICAL

READOUT

TRIGGER

C. F. Bedoya April 9th, 2013 3

Page 4: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 4

Many important milestones achieved but also a lot of work ahead us. Since last report (oct 2012):

- New CUOF produced (attenuation pad that allowed larger margins for unbalanced data)

- December tests at P5: 3 sectors in TRG+RO operated satisfactorily with CUOF+OFCU chain

- ESR (Electronics System Review) on Feb 6th 2013. Went well.- February tests at P5: 72 TRG links tested satisfactorily with different trigger

rates- Motherboard first boards production: boards in hands, firmware being

finalized- First fanouts received: good results- Final order for fibers placed: all RO fibers delivered- Aachen crate prototype received and tested. New backplane prototype

expected soon.- Slow control OCB and ECB first tests at 904 ok.- CUOF+OFCU-TRG temperature tests satisfactory results- Half of the Wiener crates ordered (delivery next week?)- Final VME_patch and TIMBUS-OFCU boards under assembly: tests ok- Alignment modules moved from X2 Near- DT-DSS relocation in USC underway- Final order of Pattern Units partly placed

Page 5: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 5

Next steps:- Final motherboard tests (missing firmware)- CUOF cooling test (missing motherboard)- CUOF slow control tests at 904 (idem)- Final integration tests in 904 with Minicrate to ROS and TSC chain.- New OCB to be produced (waiting slow control tests). - New backplane from Aachen expected soon- OFCU-TRG slow control tests- Final purchase of OF fanouts- Fibers installation- LV rack refurbishment (waiting for CUOF consumption)- Place final order of Wiener crates - Final orders of CUOF, motherboards, Slow control, OFCU-RO, TSCrear, OFCU-TRG....- IRR (Installation Readiness Review) 3 months prior installation (oct), i.e. June....

However, there is still one point I don´t think we have yet totally solved :Would we go ahead with full trigger relocation as it is?Deciding on some pre-production tests. Partially done at Torino with doubts... what else??

Page 6: DT Upgrade overview and  Phase 2 discussion

DT LS1 upgrade project milestones• 2012 June -- new TRBs and CuOF prototypes tested for radiation tolerance Jun

2012

• 2012 June -- DT system Electronics Change Review

• 2012 Nov -- Installation plan review/ milestones reassessment

• 2013 March -- Fiber trunk cables ready for installation

• 2013 May -- new TRB theta ready for installation

• 2013 July -- CuOF-OFCu system ready for installation in one test wheel (YB-1?)

• 2013 Oct – Complete CuOF-OFCu system ready for installation in five wheels

Page 7: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 7

Interim report from the TPSWG (Trigger Performance Strategy Working Group):https://cms-docdb.cern.ch/cgi-bin/DocDB/RetrieveFile?docid=11688&version=1&filename=TPS-Interim-Report-v20.pdf

TPSWG charges: to establish a plan for CMS triggering options during Phase 2.

Even if you think you will be retired by 2022, think of what you want to do until then... (R&D takes years)

Page 8: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 8

TPSWG charges: to establish a plan for CMS triggering options during Phase 2.

L1 Tracking trigger

Bandwidth increase (L1A latency 20 us and rate 1 MHz)

* More latency is desirable for implementing tracking trigger algorithms

* Rate increase of hadron (and photon) triggers will not be reduced with tracking trigger (need more than 100 kHz)

ROBs may be able to stand 20 us latency, but 1 MHz trigger rate is basically impossible. It could be possible in a fraction of them, mostly if we run the TDCs in continuous mode.

What do we do? 1 MHz & 20 us => 1-2 years & 5 MCHF ? Probably more... Evaluate

Studies so far compiled in this document, you are welcomed to review it:https://twiki.cern.ch/twiki/pub/CMS/DtUpgrade/subsystemarch_DT.docx

AND

Page 9: DT Upgrade overview and  Phase 2 discussion

We are being asked this

Other intermediate points that

alleviate the cost and time

of needed intervention

should be given (if they exist)

C. F. Bedoya April 9th, 2013

Page 10: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 10

Higher LHC luminosity will imply larger occupancies in the DT chambers, which will turn into a higher hit frequency. The consequences of this higher occupancy are the following:

-increase buffers occupancy at the different levels-slow down the time to process an event (reduce processing speed)-increase the required bandwidth for the output link

Increase of trigger rate and/or trigger latency only makes this scenario even harder.

* L1A rate and latency will impact only readout chain

* ROS will be redesigned between LS1 and LS2.

* Will try to incorporate DDU functionality to remove further bottlenecks

* i.e. impact is on the ROB

Page 11: DT Upgrade overview and  Phase 2 discussion

11

Big constraint: large time needed to perform any intervention in our detector.Very unlikely that we get access time for all of our needs:

LS1.5?: new pixelLS2: HCAL, GEMs?, GRPC? LS3: New Tracker, New ECAL FE

Whatever we invent, will likely need to coexist with present electronicsbut at the same time, the architecture could be completely different if 20 us latency is granted...

C. F. Bedoya April 9th, 2013

Page 12: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 12

PHASE 2We need to come with a general plan for DT by the CMS Upgrade week in June (DESY)

Running until 20303000 fb-1 by 2030Inst Lumi of 7-8 e34After LS3 (2030) Level1 trigger 1 MHz rate and 20 us latency ?

Will chambers survive that? Do we want to make any special study on them?

Maybe not the full chambers, but some items? PADCs? FEBs?

Minicrates?

In reality probably the power supplies and the cooling are the point of major concern for the long reliability of the system... (though not much design we can do on that?...)

Page 13: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 13

Page 14: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 14

We can digitize every hit, assign it a time stamp (related to BC0?) and send it outside the Minicrate to USC

With the time digital word, one should be able to recreate the trigger primitives

Each time measurement will contain:tBX + tdrift + offsets...

Page 15: DT Upgrade overview and  Phase 2 discussion

UXC

USC

C. F. Bedoya April 9th, 2013 15

This hits will be stored in memories until a L1A arrives and an event matching is performed (such as what ROB does).

Page 16: DT Upgrade overview and  Phase 2 discussion

Minicrate mechanics, integration and interfaces:what is easier: replace full Aluminum structure or inner parts?How long it takes to perform the replacement?How can we install new fibers from MC to racks?Power dissipation

Electronics design, time digitization:R&D of time digitization based in FPGAs (ACTEL not feasible 350 MHz max, Xilinx?)New techniques for scrubbing, etc and other rad tolerance actions (reloading firmware with TTC comands, etc)Delay lines or DESER approach?Power consumption How many channels per FPGA? (cost)

Trigger algorithms and offsite electronics:Starting with time digital information:

extract correct bunch crossing, obtain trigger primitives (meantimer? others?)

Implement algorithms in FPGAs and interconnect with present trigger electronicsREADOUT: Perform event matching, etcWhich is the latency required? can we survive with a mixed scheme in which we replace only a fraction of the minicrates?

C. F. Bedoya April 9th, 2013 16

Page 17: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 17

With our time resolution requirement, we should focus in FPGA implementations

Page 18: DT Upgrade overview and  Phase 2 discussion

* Chambers resolution aprox 4 ns

* HPTDC working in 0.781 ns time bin(Actually signal integrity degrades this resolution to aprox 1 ns in some channels)

* FPGA: There are basically two approaches to reach this resolutions:

Deserializer option (+ coarse counter)

Delay lines (+ coarse counter)

C. F. Bedoya April 9th, 2013 18

* Current consumption should also be a factor to take into account for the design, we could be talking of few Amps/board very easily (Critical point in my opinion)

* Radiation tolerance is the other constrain

Page 19: DT Upgrade overview and  Phase 2 discussion

Using FPGA gates as delay elements.

High resolution can be achieved(picoseconds)

Dependencies on placement (and thus firmware version), on temperature, on power supply...

C. F. Bedoya April 9th, 2013 19

Page 20: DT Upgrade overview and  Phase 2 discussion

* Deserializer option: encoding hit as a input serial stream. It can be done with high speed deserializers (GTX) or with lower speed ones, which depend on clock speed)

C. F. Bedoya April 9th, 2013 20

serdes

coarse counter

encoder

hit

coarse clock

deser

time digital word

coarse time

finetime

high speed clock

* GTX are for 11 Gbps and more

* Number of GTX/FPGA is somehow limited, but with SERDES one can think of achieving large number of channels per chip 128? one ROB?)

* 1 ns => 1 GHz clock speed (this is well within today´s reach)

Page 21: DT Upgrade overview and  Phase 2 discussion

Baseline approach:128 channels /FPGA (one FPGA== one ROB)=> 6-7 FPGAs/Minicrate=> 2 -3 boards max/Minicrate?=> probably one or two MTP optical fiber cords per Minicrate (24 links)

At 10 times more luminosity, required output bandwidth could be 200 Mbps/ROB(30 kHz non correlated hits, 114 kHz tracks/ROB)11.6 Gbps output link allows to transmit basically 9 time measurements/BX => no latency impact

C. F. Bedoya April 9th, 2013 21

Page 22: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 22

size of this connectors....

Page 23: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 23

How to obtain the trigger primitives with BC0 related digital time measurements?

to be continued.......

Page 24: DT Upgrade overview and  Phase 2 discussion

24C. F. Bedoya April 9th, 2013

Page 25: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 25

Page 26: DT Upgrade overview and  Phase 2 discussion
Page 27: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 27

TSWG (Trigger Strategy Working Group)

CMS is discussing the possibility of operating in Phase 2 with:

- 1 MHz L1A rate (instead of 100 kHz)- 20 us latency (instead of 6 us as previously assumed. Now 3.2 us)

We are asked to provide statements about the implications in our subsystem.

Two meetings already took place:- https://indico.cern.ch/conferenceDisplay.py?confId=195153- https://indico.cern.ch/conferenceDisplay.py?confId=200590

It appeared unlikely because it meant replace all ECAL electronics, but now this is not considered so difficult.(Would this mean that tracking trigger is not so mandatory if we can increase the trigger rate to 1 Mhz? not clear)

Page 28: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 28

Higher LHC luminosity will imply larger occupancies in the DT chambers, which will turn into a higher hit frequency. The consequences of this higher occupancy are the following:

-increase buffers occupancy at the different levels-slow down the time to process an event (reduce processing speed)-increase the required bandwidth for the output link

Increase of trigger rate and/or trigger latency only makes this scenario even harder.

* L1A rate and latency will impact only readout chain

* ROS will be redesigned between LS1 and LS2.

* Will try to incorporate DDU functionality to remove further bottlenecks

* i.e. impact is on the ROB

Page 29: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 29

Rate outside the trains, no muons included (basically neutrons)

Big difference between chambers,:

- YB-2 S4 MB4 27 kHz/TDC channel- YB-/+2 MB1s 12 kHz/TDC channel

Accounting inside trains may increase 3.6 kHz in MB1 and 0.3 kHz in MB4 (¿?)

From Gianni CMS week June

1035

1035

Page 30: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 30

Gianni CMS week June

* Upper sectors MB4* Leaks between barrel and endcap in MB1s

Page 31: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 31From I. Redondo

YB-/+2 MB1s at 1035 : 350 kHz “muon” rate (per chamber)114 kHz “muon” rate per ROB29 kHz “muon” rate per TDC

This are chamber rates (DTTF input counters)

Phi rates

Page 32: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 32

From I. Redondo

eta

Page 33: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 33

Hit rate 27 kHz => 37 us between hits. Since there are:128 channels/ROB, => one hit per ROB each 0.29 us (3.5 hits/ROB/event)32 channels/TDC, => one hit per TDC each 1.2 us.

Including muons, adds 8 hits per L1buffer FIFO (or more):Worst muon rate 350 kHz => 29kHz/TDC => 1 muon every 35 us/TDC

Hits stored in L1buffer until the L1A arrives.

So if the L1A latency was 20 us, this means we will have to store max 1 hit/channel, i.e. 32 hits/TDC until the L1A arrive + 8 hits/TDC from muons until the L1A arrive This represents 12 hits per L1buffer FIFO/event, while they are 256 positions in the FIFO.

It seems it should be possible to work with 20 us latency in phase 2

Page 34: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 34

-Readout FIFO-Event processing speed-Output link bandwidth

Time window is approximately 1 us, and we will have 1 hit/TDC each 1.2 us, that is the number of hits per event that are expected to be stored in the Readout FIFO

But the limiting factor is the speed at which you output the data

byte-wise mode at 20 MHz Header,trailer, hits = 32 bit words

It takes 8 BXs to send each hit (minimum) + 16 BXs header+ trailer+ 8 BXs token sharing + seems there is more... (next slide)

Page 35: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 35

No muons

<- 3.5 hits/ROB

Max 500 kHz... probably optimistic

If one includes the worst case muon rate (350 kHz/chamber => 1 muon/ROB every 9 us) => MAX 300 kHz

(one could reduce the time window, increase the threshold or start being inefficient...)

Test perform on a ROB at lab with fixed trigger rate:

Page 36: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 36

• trigger matching: reduces the payload. (100 kHz L1A rate => one L1A each 10 us with a sampling window of 1 us => we read 10% of the time)

• At 1 MHz L1A rate => we will be reading 100% of the data, thus trigger matching is an overhead of processing time and

bandwidth (headers and trailers)

• In principle, it is possible to run the HPTDCs in continuous mode:• - hits sent as arrive from chambers, • - time measurement referred to BC0

• Latency and L1A rate becomes irrelevant for the ROB

• Trigger matching in the new ROS may be possible (needs study and will imply detailed calibration of the different absolute BXs ID between the ROBs and the ROS).

• It may open possibility to use TDC data in the trigger (?)... (output delay not deterministic and long...)

• Limiting factor in this mode of operation is the bandwidth of the readout link:

27 kHz * 128 ch * 32 bits/hit = 110 Mbps

• With muons (350 kHz/chamber => 114 kHz/ROB of 8 hits muons => 114 Mbps

• Max bandwidth is 160 Mbps => hit rate 39 Hz/cm2 (to be checked) MB4s are close...

• This may work depending on the uncertainty of the background. If it is large, then we start o be inefficient

Page 37: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya December 11th, 2012 37

• We don´t expect any problem for running present ROBs design with Phase 2 occupancies and 100 kHz L1A rate and 6 us L1A latency

• With the background rates expected in Phase 2, it is unlikely that we could run at a trigger rate higher than 500 kHz.

• Latencies of 20 us could likely be achieved, although proper testing should be done. • HPTDCs could be operated in continuous mode, which should allow operation with 20 us latency and 1

MHz trigger rate. However, this means no background reduction is possible playing with the time window (no filtering is done, all hits are sent). Therefore, we could start to loose efficiency for background rates larger than 39 Hz/cm2. (This number should also be checked). MB4 rates are too close to this number (27 Hz/cm2 for YB-2 S4 MB4).

• Any action to place a shielding in the MB4s of the upper sectors or in the MB1s of the external wheels

will reduce any possible efficiency drop. • Considering the large uncertainties (energy, extrapolation, target luminosity, etc), even if the MB4s can

be shielded, replacing the ROBs of the MB1s external wheels with a higher performance board, should also not be discarded

• In order to operate in continuous mode, a DC balancing is mandatory at CUOF level.

Page 38: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 38

Page 39: DT Upgrade overview and  Phase 2 discussion

Width of the board is fixed by the Minicrate width (aprox 9.5 cm)

Length is variable, limitation are the input connectors which are 5.5 cm long

Number of boards

Otherwise, interface board with high density connectorsThis will allow higher density integration (up to 1 board/Minicrate? do we want this?)

Page 40: DT Upgrade overview and  Phase 2 discussion

• Simulations of the impact of inefficiency in the detector in the physics analysis do not exist. A muon joint effort would make sense on this.

• Probably even less if one refers to simulation of the electronics (readout), and this is one of the main points we are asked....

• In reality probably the power supplies and the cooling are the point of major concern for the long reliability of the system... (though not much design we can do on that?...)

• But for custom electronics we need time for brainstorming and for putting together the manpower and the budget (different timescales for different institutes, etc), so discussions on what can be done should take place now.

• Different L1A parameters may imply some challenge in the electronics.. it may be more interesting that to rebuilding the same wheel...

• Or maybe what we want to think is in replacing subdetectors technology.. but I doubt that.

C. F. Bedoya April 9th, 2013 40

Page 41: DT Upgrade overview and  Phase 2 discussion

(or 25 us...)

maybe not so much

Pixel trigger?

AND

C. F. Bedoya April 9th, 2013

Page 42: DT Upgrade overview and  Phase 2 discussion

RPC: Probably changed by LS2. All in USC. Any increase L1A param => 100-150 keuros (84 RMBs, 3 FEDs)

CSC: Good at increased L1A rate, bad at increased latency>300 kHz & 6.4 us => 4 months & 12 MCHF. 500 kHz possible at all? which implicationsIs it worth changing just a fraction of the CFEBs?Do you expect any longevity issue that would make worth a re-desing

independently from this?1 MHz & 20 us => 4 months & 12 MCHF ? Is that true?

DT: Opposite to CSCs... latency should be OK, rate overkilling300 kHz & 20 us => OK (to check)500 kHz & 20 us => YB-/+2 MB1 and MB4s? 6 months?, cost? 1 MCHF? EvaluatePossibility open that we need to replace Minicrates either way.. maybe not all in

one goal, but the R&D should be launched soon1 MHz & 20 us => 1-2 years & 5 MCHF ? Probably more... Evaluate

ECAL: limit is 150 kHz & 6.4 us (maybe 300 kHz?). > => 26 months shutdown, 10 MCHFTracker: very much in favor of increasing latency (20 us)

if rate increased, happy with 500 kHz, 1 MHz not impossibleHCAL: changes limited to USC, does not seem a major constraint at present.

C. F. Bedoya April 9th, 2013

Page 43: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 43

Page 44: DT Upgrade overview and  Phase 2 discussion

C. F. Bedoya April 9th, 2013 44