Elsevier Editorial System(tm) for Nuclear Inst. and...

Elsevier Editorial System(tm) for Nuclear

Inst. and Methods in Physics Research, A

Manuscript Draft

Manuscript Number:

Title: A 15 GSa/s, 1.5 GHz Bandwidth Waveform Digitizing ASIC

Article Type: Research Paper

Section/Category: Electronics

Keywords: Waveform sampling; ASIC; Integrated Circuit; Analog-to-Digital;

Switched Capacitor Array

Corresponding Author: Mr. Eric Oberla,

Corresponding Author's Institution:

First Author: Eric Oberla

Order of Authors: Eric Oberla; Henry Frisch; Hervé Grabas; Jean-Francois

Genat; Kurtis Nishimura; Gary Varner

Abstract: The PSEC4 custom integrated circuit was designed for the

recording of fast waveforms for use in large-area time-of-flight detector

systems. The ASIC has been fabricated using the IBM-8RF 0.13 μm CMOS

process. On each of 6 analog channels, PSEC4 employs a switched

capacitor array (SCA) 256 samples deep, a ramp-compare ADC with 10.5~bits

of effective resolution, and a serial data readout with the capability of

region-of-interest windowing to reduce dead time. The sampling rate can

be adjusted between 4 and 15 Gigasamples/second [GSa/s] on all channels

and is servo-controlled on-chip with a low-jitter delay-locked loop

(DLL). The input signals are passively coupled on-chip with a -3 dB

analog bandwidth of 1.5 GHz. The power consumption in quiescent sampling

mode is less than 50 mW/chip; at a sustained trigger and readout rate of

50 kHz the chip

draws 100 mW. After fixed-pattern pedestal subtraction, the uncorrected

differential non-linearity is 0.15%

over an 800 mV dynamic range. With a linearity correction, a full 1 V

dynamic range is available. The sampling timebase has a fixed-pattern

non-linearity with an RMS of 13%, which can be calibrated for precision

waveform feature extraction and picosecond-level timing resolution.

The first experimental application to the front-end readout of large-area

Micro-Channel Plate (MCP) photodetectors is presented.

THE UNIVERSITY OF CHICAGO

THE ENRICO FERMI INSTITUTE

5640 SOUTH ELLIS AVENUE

CHICAGO · ILLINOIS 60637

phone: (773) 795-3342

March 26, 2013

Dr. W. BarlettaCoordinating EditorNuclear Instruments and Methods in Physics Research Section AMassachusetts Institute of TechnologyCambridge, MA, USA

Dear Dr. Barletta:

This is the cover letter with the items requested by Elsevier for themanuscript “A 15 GSa/s, 1.5 GHz Bandwidth Waveform Digitizing ASIC”. Wethank you in advance for your time and effort.

To our knowledge we do not require any special considerations.A brief research background: This paper covers the architecture and per-

formance of PSEC4, a custom-designed 15 Gigasample/second [GSa/s] waveformsampling and digitizing ASIC. This compact ‘oscilloscope-on-a-chip’ was fabri-cated in the 0.13 µm CMOS technology and was designed for the recording oftransient waveforms with baseband signal bandwidths up to 1.5 GHz.

The design of PSEC4 was part of the LAPPD effort to develop very fast,large-area photodetectors [1]. The LAPPD MCP-PMT is a 20×20 cm2 (8×8 in2)hermetically packaged photodetector with a 30 channel RF microstrip anodesignal pick-off [2]. PSEC4 will be used as the front-end readout ASIC for theinitial MCP-PMTs deployed in test-beam experiments.

The Corresponding Author is:Eric OberlaEnrico Fermi InstituteACC 1235640 S. Ellis AveChicago, IL [email protected]

Sincerely,

Eric Oberla

Cover Letter

References

[1] The Large-Area Picosecond Photo-Detectors Project web page:<http://psec.uchicago.edu>

[2] H. Grabas, R. Obaid, E. Oberla, H. Frisch, et. al., “RF Strip-Line Anodes forPsec Large-Area MCP-based Photodetectors”, to be published Nucl. Instr.Meth. A (2013).

A 15 GSa/s, 1.5 GHz Bandwidth Waveform Digitizing ASIC

Eric Oberlaa, Herve Grabasa,1, Jean-Francois Genata,2, Henry Frischa, Kurtis Nishimurab,3, Gary Varnerb

aEnrico Fermi Institute, University of Chicago; 5640 S. Ellis Ave., Chicago IL, 60637bUniversity of Hawai’i at Manoa; Watanabe Hall, 2505 Correa Rd., Honolulu HA

Abstract

The PSEC4 custom integrated circuit was designed for the recording of fast waveforms for use in large-

area time-of-flight detector systems. The ASIC has been fabricated using the IBM-8RF 0.13 µm CMOS

process. On each of 6 analog channels, PSEC4 employs a switched capacitor array (SCA) 256 samples deep,

a ramp-compare ADC with 10.5 bits of effective resolution, and a serial data readout with the capability

of region-of-interest windowing to reduce dead time. The sampling rate can be adjusted between 4 and

15 Gigasamples/second [GSa/s] on all channels and is servo-controlled on-chip with a low-jitter delay-locked

loop (DLL). The input signals are passively coupled on-chip with a -3 dB analog bandwidth of 1.5 GHz. The

power consumption in quiescent sampling mode is less than 50 mW/chip; at a sustained trigger and readout

rate of 50 kHz the chip draws 100 mW. After fixed-pattern pedestal subtraction, the uncorrected differential

non-linearity is 0.15% over an 800 mV dynamic range. With a linearity correction, a full 1 V dynamic

range is available. The sampling timebase has a fixed-pattern non-linearity with an RMS of 13%, which

can be calibrated for precision waveform feature extraction and picosecond-level timing resolution. The first

experimental application to the front-end readout of large-area Micro-Channel Plate (MCP) photodetectors

is presented.

Keywords:

Waveform sampling, ASIC, Integrated Circuit, Analog-to-Digital, Switched Capacitor Array

1. Introduction1

We describe the design and performance of2

PSEC4, a ≥10 Gigasample/second [GSa/s] wave-3

form sampling and digitizing Application Spe-4

cific Integrated Circuit (ASIC) fabricated in the5

IBM-8RF 0.13 µm complementary metal-oxide-6

semiconductor (CMOS) technology. This compact7

‘oscilloscope-on-a-chip’ is designed for the recording8

1Present address, CEA/IRFU/SEDI; CEN Saclay-Bat141 F-91191 Gif-sur-Yvette CEDEX, France

2Present address, LPNHE, CNRS/IN2P3, UniversitesPierre et Marie Curie and Denis Diderot, T12 RC, 4 PlaceJussieu 75252 Paris CEDEX 05, France

3Present address, SLAC National Accelerator Labora-tory, 2575 Sand Hill Road, Menlo Park, CA 94025

of radio-frequency (RF) transient waveforms with9

signal bandwidths between 100 MHz and 1.5 GHz.10

1.1. Background11

The detection of discrete photons and high-12

energy particles is the basis of a wide range of13

commercial and scientific applications. In many of14

these applications, the relative arrival time of an15

incident photon or particle is best measured by ex-16

tracting features from the full waveform at the de-17

tector output [1, 2]. Additional benefits of front-end18

waveform sampling include the detection of pile-19

up events and the ability to filter noise or poorly20

formed pulses.21

For recording ‘snapshots’ of transient waveforms,22

Preprint submitted to elsevier March 26, 2013

*ManuscriptClick here to view linked References

http://ees.elsevier.com/nima/viewRCResults.aspx?pdf=1&docID=10508&rev=0&fileID=488932&msid=3F323063-BEEC-4F51-882B-A2F802CC145A

switched capacitor array (SCA) analog memories23

can be used to sample a limited time-window at a24

relatively high rate, but with a latency-cost of a25

slower readout speed [3, 4]. These devices are well26

suited for triggered-event applications, as in many27

high energy physics experiments, in which some28

dead-time can be afforded on each channel. With29

modern CMOS integrated circuit design, these SCA30

sampling chips may be compact, low power, and31

have a relatively low cost per channel [4].32

Over the last decade, sampling rates in SCA33

waveform sampling ASICs have been pushed to sev-34

eral GSa/s with analog bandwidths of several hun-35

dred MHz [5]. As a scalable front-end readout36

option coupled with the advantages of waveform37

sampling, these ASICs have been used in a wide38

range of experiments; such as high-energy physics39

colliders [6], gamma-ray astronomy [7, 8], high-40

energy neutrino detection [9, 10], and rare decay41

searches [11, 12].42

1.2. Motivation43

A natural extension to the existing waveform44

sampling ASICs is to push design parameters that45

are inherently fabrication technology limited. Pa-46

rameters such as sampling rate and analog band-47

width are of particular interest considering the48

fast risetimes (τ r ∼ 60-500 ps) and pulse widths49

(FWHM ∼ 200 ps - 1 ns) of commercially available50

micro-channel plate (MCP) and silicon photomul-51

tipliers [13, 14]. These and other fast photo-optical52

or RF devices require electronics matched to speed53

of the signals.54

The timing resolution of discrete waveform sam-55

pling is intuitively dependent on three primary fac-56

tors as described by Ritt4 [15]:57

σt ∝τr

(SNR)√Nsamples

(1)58

where SNR is the signal-to-noise ratio of the pulse,59

τ r is the 10-90% rise-time of the pulse, and Nsamples60

is the number of independent samples on the rising61

4Assuming Shannon-Nyquist is fulfilled

edge within time τ r. The motivation for oversam-62

pling above the Nyquist limit is that errors due to63

uncorrelated noise, caused both by random time64

jitter and charge fluctuations, are reduced by in-65

creasing the rising-edge sample size. Accordingly,66

in order to preserve the timing properties of analog67

signals from a fast detector, the waveform record-68

ing electronics should 1) be low-noise, 2) match the69

signal bandwidth, and 3) have a reasonably fast70

sampling rate.71

1.3. Towards 0.13 µm CMOS72

The well-known advantages of reduced transistor73

feature size include higher clock speeds, greater cir-74

cuit density, lower parasitic capacitances, and lower75

power dissipation per circuit [16]. The sampling76

rate and analog bandwidth of waveform sampling77

ASICs, which depend on clock speeds, parasitic ca-78

pacitances, and interconnect lengths, are directly79

enhanced by moving to a smaller CMOS technology.80

Designing in a smaller technology also allows clock-81

ing of an on-chip analog-to-digital converter (ADC)82

at a faster rate, reducing the chip dead-time.83

With the advantages of reduced transistor feature84

sizes also comes increasingly challenging analog de-85

sign issues. One issue is the increase of leakage86

current. Leakage is enhanced by decreased source-87

drain channel lengths, causing subthreshold leakage88

(VGS < VTH), and decreased gate-oxide thickness,89

which promotes gate-oxide tunneling [17]. Effects90

of leakage include increased quiescent power dissi-91

pation and potential non-linear effects when storing92

analog voltages.93

Another design issue of deeper sub-micron tech-94

nologies is the reduced dynamic range [17]. The95

available voltage range is given by (VDD-VTH),96

where VDD is the supply voltage and VTH is97

the threshold, or ‘turn-on’, voltage for a given98

transistor. For technologies above 0.1 µm, the99

(VDD-VTH) range is decreased with downscaled100

feature sizes to reduce high-field effects in the gate-101

oxide [17]. In the 0.13 µm CMOS process, the sup-102

ply voltage VDD is 1.2 V and the values of VTH103

2

Figure 1: Photograph of the fabricated PSEC4 die. The chipdimensions are 4×4.2 mm2.

range from 0.42 V for a minimum-size transistor104

(gate length 120 nm) to roughly 0.2 V for a large105

transistor (5 µm) [18, 19].106

The potential of waveform sampling design in107

0.13 µm CMOS was shown with two previous108

ASICs. A waveform sampling prototype achieved a109

sampling rate of 15 GSa/s and showed the possibil-110

ity of analog bandwidths above 1 GHz [20]. Leak-111

age and dynamic range studies were also performed112

with this chip. In a separate 0.13 µm ASIC, fab-113

ricated as a test-structure chip, a 25 GSa/s sam-114

pling rate rate was achieved using low VTH transis-115

tors [21]. The performance and limitations of these116

chips led to the optimized design of the PSEC4117

waveform digitizing ASIC. The fabricated PSEC4118

die is shown in Figure 1.119

In this paper, we describe the PSEC4 architec-120

ture (§2), experimental performance (§3), and a121

first application to the front-end readout of large-122

area, picosecond resolution photodetectors (§4).123

2. Architecture124

An overview of the PSEC4 architecture and func-125

tionality is shown in Figure 2. For clarity, this block126

diagram shows one of six identical signal channels.127

A PSEC4 channel is a linear array of 256 sample128

points and a threshold-level trigger discriminator.129

Each sample point in the array is made from a130

switched capacitor sampling cell and an integrated131

ADC circuit as shown in Figure 3.132

To operate the chip, a field-programmable gate133

array (FPGA) is used to provide timing control,134

clock generation, readout addressing, data manage-135

ment, and general configurations to the ASIC. Sev-136

eral analog voltage controls are also required for137

operation, and are provided by commercial digital-138

to-analog converter (DAC) chips.139

Further details of the chip architecture, includ-140

ing timing generation (§2.1) sampling and trigger-141

ing (§2.2), analog-to-digital conversion (§2.3) , and142

data readout (§2.4), are outlined in the following143

sections.144

2.1. Timing Generation145

The sampling signals are generated with a 256-146

stage Voltage-Controlled Delay Line (VCDL), in147

which the individual stage time delay is adjustable148

by two complementary voltage controls. Each stage149

in the VCDL is an RC delay element made from a150

CMOS current-starved inverter. The inverse of the151

time delay between stages sets the sampling rate.152

Rates of up to 17 GSa/s are possible with PSEC4153

as shown in Figure 4. The stability of the sam-154

pling rate is negatively correlated with the slope155

magnitude as the VCDL becomes increasingly sen-156

sitive to noise. The slowest stable sampling rate is157

∼4 GSa/s.158

A ‘write strobe’ signal is sent from each stage159

of the VCDL to the corresponding sampling cell in160

each channel. The write strobe passes the VCDL-161

generated sampling rate to the sample-and-hold162

switch of the cell as shown in Figure 3. To allow163

the sample cell enough time to fully charge or dis-164

charge when sampling, the write strobe is extended165

to a fixed duration of 8× the individual VCDL de-166

lay stage. In sampling mode, a block of 8 adjacent167

SCA sampling cells are continuously tracking the168

3

Figure 2: A block diagram of PSEC4 functionality. The RF-input signal is AC coupled and terminated in 50Ω off-chip. Thedigital signals (listed on right) are interfaced with an FPGA for PSEC4 control. A 40 MHz write clock is fed to the chip andup-converted to ∼10 GSa/s with a 256-stage voltage-controlled delay line (VCDL). A ‘write strobe’ signal is sent from eachstage of the VCDL to the corresponding sampling cell in each channel. The write strobe passes the VCDL-generated samplingrate to the sample-and-hold switches of each SCA cell. Each cell is made from a switched capacitor sampling cell and integratedADC register, as shown in Figure 3. The trigger signal ultimately comes from the FPGA, in which sampling on every channelis halted and all analog samples are digitized. The on-chip ramp-compare ADC is run with a global analog ramp generatorand 1 GHz clock that are distributed to each cell. Once digitized, the addressed data are serially sent off-chip on a 12-bit busclocked at up to 80 MHz.

ADC Clock Read_enableV_ramp

12 bit register

C_sample

T1

T2

(~1 GHz)Write Strobe

Trigger

V_in

Data_out<12..1>

V_ped

+

−

Figure 3: Simplified schematic of the ‘vertically integrated’ PSEC4 cell structure. The sampling cell input, Vin, is tied to theon-chip 50Ω input microstrip line. Transistors T1 and T2 form a dual-CMOS write switch that facilitates the sample-and-holdof Vin on Csample, a 20 fF capacitance referenced to V ped. The switch is toggled by the VCDL write strobe while sampling(Figure 2) or a ASIC-global trigger signal when an event is to be digitized. When the ADC is initiated, a global 0.0-1.2 Vanalog voltage ramp is sent to all comparators, which digitizes the voltage on Csample using a fast ADC clock and 12-bitregister. To send the digital data off-chip, the register is addressed using Read enable.

4

input signal.169

To servo-control the VCDL at a specified sam-170

pling rate and to compensate for temperature ef-171

fects and power supply variations, the VCDL can172

be delay-locked on chip. The VCDL forms a delay-173

locked loop (DLL) when this servo-controlled feed-174

back is enabled. The servo-control circuit is made175

of a dual phase comparator and charge pump circuit176

to lock both the rising and falling edges of the write177

clock at a fixed one-cycle latency [22]. A loop-filter178

capacitor is installed externally to tune the DLL179

stability.180

With this DLL architecture, a write clock with181

frequency fin is provided to the chip, and the182

sampling is started automatically after a locking183

time of several seconds. The nominal sampling184

rate in GSa/s is set by 0.256·fin [MHz], and the185

sampling buffer depth in nanoseconds is given by186

103/fin [MHz−1]. A limitation of the PSEC4 de-187

sign is the relatively small recording depth at high188

sampling rates due to the buffer size of 256 samples.189

2.2. Sampling and Triggering190

A single-ended, 256-cell SCA was designed and191

implemented on each channel of PSEC4. Each sam-192

pling cell circuit is made from a dual CMOS write193

switch and a metal-insulator-metal sampling capac-194

itor as shown in Figure 3. With layout parasitics,195

this capacitance is effectively 20 fF. During sam-196

pling, the write switch is toggled by the write strobe197

from the VCDL. To record an event, an external198

trigger, typically from an FPGA, overrides the sam-199

pling and opens all write switches, holding the ana-200

log voltages on the capacitor for the ADC duration201

(≤4 µs).202

The PSEC4 has the capability to output a203

threshold-level trigger bit on each channel. The in-204

ternal trigger is made from a fast comparator, which205

is referenced to an external threshold level, and dig-206

ital logic to latch and reset the trigger circuit. To207

form a PSEC4 trigger, the self-trigger bits are sent208

to the FPGA, which returns a global trigger signal209

back to the chip. Triggering interrupts the sampling210

on every channel, and is held until the selected data211

are digitized and read out.212

2.3. ADC213

Digital conversion of the sampled waveforms is214

done on-chip with a single ramp-compare ADC that215

is parallelized over the entire ASIC5. Each sam-216

ple cell has a dedicated comparator and 12 bit217

register as shown in Figure 3. In this architec-218

ture, the comparison between each sampled voltage219

(Vsample) and a global ramping voltage (Vramp),220

controls the clock enable of a 12-bit register. When221

Vramp > Vsample, the register clocking is disabled,222

and the 12-bit word, which has been encoded by the223

ADC clock frequency and the ramp duration below224

Vsample, is latched and ready for readout.225

Embedded in each channel is a 5-stage ring oscil-226

lator that generates a fast digital ADC clock, ad-227

justable between 200 MHz and 1.4 GHz. The ADC228

conversion time, power consumption, and resolu-229

tion may be configured by adjusting the ramp slope230

or by tuning the ring oscillator frequency.231

2.4. Readout232

The serial data readout of the register bits is per-233

formed using a shift register ‘token’ architecture,234

in which a read enable pulse is passed sequentu-235

ally along the ADC register array. To reduce the236

chip readout latency, a limited selection of PSEC4’s237

1536 registers can be read out. Readout address-238

ing is done by selecting the channel number and a239

block of 64 cells. While not completely random ac-240

cess, this scheme permits a considerable reduction241

in dead time. At a maximum rate of 80 MHz, the242

readout time is 0.8 µs per 64-cell block.243

The readout latency is typically the largest con-244

tributor to the dead-time of the chip. The ADC245

conversion time also adds up to 4 µs of latency246

per triggered event. These two factors limit the247

sustained trigger rate to ∼200 kHz/channel or248

∼50 kHz/chip.249

5An overview of this ADC architecture can be found inreference [23].

5

2

4

6

8

10

12

14

16

18

20

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8

Sa

mp

ling

Ra

te [

GS

a/s

]

Voltage Control [V]

MeasuredFit to DataSimulated

Figure 4: Sampling rate as a function of VCDL voltage control. Good agreement is shown between post layout simulation andactual values. Rates up to 17 GSa/s are achieved with the free-running PSEC4 VCDL. When operating the VCDL withoutfeedback, the control voltage is explicitly set and the sampling rate is given by 17.7 (1 − 0.018 exp(5.91 · Vcontrol)) [GSa/s].Typically, the servo-locking will be enabled and the VCDL is run as a delay-locked loop (DLL). In this case, the sampling rateis automatically set by the input write clock frequency.

Figure 5: The PSEC4 evaluation board. The board uses a Cyclone III Altera FPGA (EP3C25Q240) and a USB 2.0 PCinterface. Custom firmware and acquisition software were developed for overall board control. The board uses +5 V power anddraws <500 mA, either from a DC supply or the USB interface.

3. Performance250

Measurements of the PSEC4 performance have251

been made with several chips on custom evalu-252

ation boards shown in Figure 5. The sampling253

rate was fixed at a nominal rate of 10.24 GSa/s.254

Here we report on bench measurements of lin-255

earity (§3.1), analog leakage (§3.2), noise (§3.3),256

power (§3.4), frequency response (§3.5), sampling257

calibrations (§3.6), and waveform timing (§3.7). A258

summary table of the PSEC4 performance is shown259

in §3.8.260

3.1. Linearity and Dynamic Range261

The input dynamic range is limited by the 1.2 V262

core voltage of the 0.13 µm CMOS process [18]. To263

enable the recording of signals with pedestal levels264

that exceed this range, the input is AC coupled and265

a DC offset is added to the 50 Ω termination. This266

is shown in the Figure 2 block diagram, in which267

6

-20

0

20

0 0.2 0.4 0.6 0.8 1 1.2Input Voltage [V]

Fit Residuals 0

500

1000

1500

2000

2500

3000

3500

4000

PS

EC

4 o

utp

ut [A

DC

counts

]

Raw Pedestal ScanLinear fit

Figure 6: DC response of the device running in 12 bit mode.The upper plot shows raw data (red points) and a linear fitover the the same dynamic range (dotted black line, slope of4 counts/mV). The fit residuals are shown in the lower plot.A differential non-linearity (DNL) of better than 0.15% isobserved for input signals between 0.2 V and 1.0 V beforeany calibrations.

the DC offset is designated by V ped. The offset268

level is tuned to match the input signal dynamic269

range to that of PSEC4.270

The PSEC4 response to a linear pedestal scan271

is shown in Figure 6. A dynamic range of 1 V272

is shown, as input signals between 100 mV and273

1.1 V are fully coded with 12 bits. A differential274

non-linearity (DNL) of better than 0.15% is shown275

for most of that range. The linearity and dynamic276

range near the voltage rails are limited due to tran-277

sistor threshold issues in the comparator circuit.278

The DNL of this response, shown by the linear fit279

residuals in Figure 6, can be corrected by creating280

an ADC count-to-voltage look-up-table (LUT) that281

maps the input voltage to the PSEC4 output code.282

The raw PSEC4 data is converted to voltage and283

‘linearized’ using this LUT.284

3.2. Sample Leakage285

When triggered, the write switch on each cell286

is opened and the sampled voltage is held at287

high impedance on the 20 fF capacitor (Fig. 3).288

Two charge leakage pathways are present: 1)289

sub-threshold conduction through the write switch290

[V]ped - VsampleV-0.6 -0.4 -0.2 0 0.2 0.4 0.6

Leak

age

Cur

rent

[fA

]

-400

-300

-200

-100

0

100

200

300

400

500

SPICE simulation:

Write Switch Leakage

NFET tunneling

Figure 7: The PSEC4 leakage as a function of input voltage.The measured leakage is shown by the histogrammed datapoints. Results from a 0.13 µm CMOS spice simulationare also included. The simulation shows the leakage currentcontributions from 1) sub-threshold conduction through thedisengaged write switch; and 2) gate-oxide tunneling fromthe NFET in the input stage of the comparator.

formed by transistors T1 and T2; and 2) gate-oxide291

tunneling through the NFET at the comparator in-292

put. The observable leakage current is the sum of293

these two effects.294

To measure the leakage current, a 300 ns wide,295

variable-level pulse was sent to a single PSEC4296

channel. Since the sampling window is 25 ns, each297

SCA cell sampled the transient level. After trigger-298

ing, the sampled transient voltage was repeatedly299

digitized at 1 ms intervals and the change in voltage300

on the capacitor was recorded over a 10 ms storage-301

time.302

The PSEC4 leakage current as a function of input303

voltage over the full 1 V dynamic range is shown in304

Figure 7. A pedestal level VDD/2 = 0.6 V was set305

at the input. The measured leakage is shown in the306

2-D histogram. A large spread (RMS ∼70 fA) is307

seen at each voltage level. Results from a 0.13 µm308

CMOS spice simulation show that the write-switch309

leakage is the dominant pathway. A small amount310

(≤100 fA) of NFET gate-oxide tunneling is also311

consistent with the data.312

In normal operation, the ADC is started imme-313

diately after a trigger is registered. In this case,314

the analog voltage hold time is limited to the ADC315

7

<Noise>Entries 89244Mean 0.02628RMS 0.7038

Readout [mV]-4 -2 0 2 4 6 8

Cou

nts/

(0.2

5 m

V)

0

2

4

6

8

10

12

310×

<Noise>Entries 89244Mean 0.02628RMS 0.7038

Fit Parameters:mean: 0.03 mVsigma: 0.68 mV

Figure 8: A PSEC4 baseline readout showing the electronicnoise. The data are recorded from single channel after offsetcorrection. The RMS value of ∼700 µV is representative ofthe electronics inherent noise on all channels.

conversion time. Assuming a constant current, the316

leakage-induced voltage change is given by317

∆V =Ileakage ∆t

Csample(2)318

where ∆t is the ADC conversion time. With the319

maximum leakage current of ±500 fA and a con-320

version time of 4 µs, ∆V is ±100 µV. This value is321

at least 5× lower than the electronics noise.322

3.3. Noise323

After fixed-pattern pedestal correction and324

event-by-event baseline subtraction, which removes325

low-frequency noise contributions, the PSEC4 elec-326

tronic noise is measured to be ∼700 µV RMS on327

all channels as shown in Figure 8. The noise figure328

is dominated by broadband thermal noise on the329

20 fF sampling capacitor, which contributes 450 µV330

(RMS 60 electrons) at 300 K. Other noise sources331

include the ADC ramp generator and comparator.332

The noise corresponds to roughly 3 least significant333

bits (LSBs), reducing the effective resolution of the334

device to 10.5 bits over the dynamic range.335

3.4. Power336

The power consumption is dominated by the337

ADC, which simultaneously clocks 1536 ripple338

150

200

250

300

350

200 400 600 800 1000 1200 1400

PS

EC

4 P

ow

er

[mW

]

ADC Clock Freq. [MHz]

0.14 mW/MHz

Figure 9: The total PSEC4 power as a function of the ADCclock rate. Clock rates between 200 MHz and 1.4 GHz can beselected based on the power budget and targeted ADC speedand resolution. When the ADC is not running, the quiescent(continuous sampling) power consumption is ∼40 mW perchip.

counters and several hundred large digital buffers at339

up to 1.4 GHz. The total power draw per chip as a340

function of ADC clock rate is shown in Figure 9. To341

reduce the steady state power consumption and to342

separate the chip’s digital processes from the ana-343

log sampling, the ADC is only run after a trigger is344

sent to the chip. Without a trigger, the quiescent345

power consumption is ∼40 mW per chip, including346

the locked VCDL sampling at 10.24 GSa/s and the347

current biases of all the comparators.348

Initiating the ADC with a clock rate of 1 GHz349

causes the power draw to increase from 40 mW to350

300 mW within a few nanoseconds. To mitigate351

high-frequency power supply fluctuations when352

switching on the ADC, several ‘large’ (2 pF) de-353

coupling capacitors were placed on-chip near the354

ADC. These capacitors, in addition with the close-355

proximity evaluation board decoupling capacitors356

(∼0.1-10 µF), prevent power supply transients from357

impairing chip performance.358

At the maximum PSEC4 sustained trigger rate359

of 50 kHz, in which the ADC is running 20% of360

the time, a maximum average power of 100 mW is361

drawn per chip.362

8

-5

-4

-3

-2

-1

0

1

2

3

4

0.1 1.0 2.0

Am

plit

ude [dB

]

Frequency [GHz]

500mVpp (-2dBm)50mVpp (-22dBm)

Figure 10: The PSEC4 frequency response. The -3 dB ana-log bandwidth is 1.5 GHz. The positive resonance above1 GHz is due to bondwire inductance of the signal wires inthe chip package. Similar responses are shown for large andsmall sinusoidal inputs.

-50

-45

-40

-35

-30

-25

-20

0.1 1.0 2.0

Cro

ssta

lk A

mplit

ude [dB

]

Frequency [GHz]

Ch. 1Ch. 2Ch. 4Ch. 5Ch. 6

Figure 11: The channel-to-channel crosstalk as a function offrequency. Channel 3 was driven with a -2 dBm sinusoudalinput. Adjacent channels see a maximum of -20 dB crosstalkat 1.1 GHz. The electronic noise floor is -50 dB for reference.

3.5. Frequency Response363

The target analog bandwidth for the PSEC4 de-364

sign was ≥1 GHz. The bandwidth is limited by the365

parasitic input capacitance (Cin), which drops the366

input impedance at high frequencies6 as367

|Zin| =Rterm√

1 + ω2 Rterm Cin

(3)368

where Rterm is an external 50 Ω termination re-369

sistor. Accordingly, the expected half-power band-370

width is given by:371

f3dB =1

2π Rterm Cin(4)372

The extracted Cin from post-layout studies was373

∼2 pF, projecting a -3 dB bandwidth of 1.5 GHz374

which corresponds to the measured value shown in375

Figure 10. The chip package-to-die bondwire in-376

ductance gives a resonance in the response above377

1 GHz that distorts signal content at these frequen-378

cies. An external filter may be added to flatten the379

response.380

The measured channel-to-channel crosstalk is381

-25 dB below 1 GHz for all channels as shown in382

Figure 11. For frequencies less then 700 MHz, this383

drops to better than -40 dB. The primary crosstalk384

mechanism is thought to be the mutual induc-385

tance between signal bondwires in the chip pack-386

age. High frequency substrate coupling on the chip387

or crosstalk between input traces on the PSEC4388

evaluation board may also contribute.389

3.6. Sampling Calibration390

For precision waveform feature extraction, both391

the overall time-base of the VCDL and the cell-to-392

cell time step variations must be calibrated. With393

the rate-locking DLL, the overall PSEC4 sampling394

time base is stably servo-controlled at a default rate395

of 10.24 GSa/s. The time-base calibration of the396

individual 256 delay stages, which vary due to cell-397

to-cell transistor size mismatches in the VCDL, is398

the next task. Since this is a fixed-pattern varia-399

tion, the time-base calibration is typically a one-400

time measurement.401

6This ignores negligible contributions to the impedancedue to the sampling cell input coupling. The write switchon-resistance (≤ 4 kΩ over the full dynamic range) and the20 fF sampling capacitance introduce a pole at ≥2 GHz.

9

Time calibrationEntries 256Mean 0.09601RMS 0.01202

Time-base calibration constants [ns]0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2

Ent

ries/

(6.5

ps)

0

10

20

30

40

50

Time calibrationEntries 256Mean 0.09601RMS 0.01202

Fit Parameters:

mean: 95.9 ps

sigma: 12.1 ps

PSEC4 sampling cell no.0 50 100 150 200 250

Tim

e [n

s]

-0.05

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4Time-base:

DNLINL

DLL wrap-around offset

Figure 12: LEFT: A histogram of the extracted time-base calibration constants (∆t). These values are calculated using thezero-crossing technique and are used to correct the sampling time-base of the PSEC4 chip. A 13% spread in the ∆t values isobserved. The average sampling rate over these cells is found to be 10.4 GSa/s, slightly higher than the nominal value.RIGHT: The differential (DNL) and integral non-linearity (INL) of the PSEC4 time-base. The extracted ∆t’s are comparedto an ideally linear time-base with equal time-steps per sample point. The large time-step at the first sample bin is causedby a fixed DLL latency when wrapping the sampling from the last cell to the first. With the servo-locking DLL the INL isconstrained to be zero at the last cell.

Time [ns]0 2 4 6 8 10 12 14 16 18 20

Am

plitu

de [m

V]

-200

-150

-100

-50

0

50

100

150

200

Figure 13: A 10.24 GSa/s capture of a 400 MHz sine input is shown (black dots) after linearity correction and time-basecalibration. A fit (red dotted line) is applied to the data.

The brute force ‘zero-crossing’ time-base cali-402

bration method is employed [24]. This technique403

counts the number of times a sine wave input404

crosses zero voltage at each sample cell. With405

enough statistics, the corrected time per cell is ex-406

tracted from the number of zero-crossings (Nzeros)407

using408

< ∆t >=Tinput < Nzeros >

2Nevents(5)409

where Tinput is the period of the input and Nevents410

is the number of digitized sine waveforms. A typi-411

cal PSEC4 time-base calibration uses 105 recorded412

events of 400 MHz sinusoids.413

The variation of the time-base sampling steps is414

∼13% as shown in the left plot of Figure 12. Due415

to a relatively large time step at the first cell, the416

average sampling rate over the remaining VCDL417

cells is 10.4 GSa/s, slightly higher than the nominal418

rate.419

The non-linearity of the PSEC4 time-base is420

shown in the right plot of Figure 12. Each bin in421

the plot is indicative of the time-base step between422

10

Time [ns]0 2 4 6 8 10

Am

plitu

de [V

]

0

0.05

0.1

0.15

0.2

0.25 Rising-edge Fit:

/ndf: 19.5/212χ: 5.591 ns0t

: 537 psσ

t_diffEntries 1595Mean 200.1RMS 2.579

Time Difference [ps]190 195 200 205 210 215 220 225

Ent

ries/

750

fem

tose

cond

s

0

20

40

60

80

100

120t_diff

Entries 1595Mean 200.1RMS 2.579

Fit Parameters:

mean: 200.0 ps

sigma: 2.55 ps

Figure 14: LEFT: An example PSEC4 digitized pulse and off-line fit that was used for the timing resolution measurement.A 1.25 ns FWHM Gaussian pulse was split to two channels of the chip. The digitized waveform (black dots) is captured at10.24 GSa/s and is shown after applying the time-base calibration constants. The timing was extracted using a Gaussian-functional fit to the leading edge of the waveform (red line). A voltage error of 1.5 mV, which corresponds to the RMS baselinefluctuations, is included on each sample point of the waveform to obtain the χ2 value. RIGHT: The PSEC4 2-channel timingresolution. The timing resolution is 2.6 ps RMS when running at 10.24 GSa/s. A fast pulse was split to two channels of thechip, as shown on the left. The time difference between the two channels was extracted by fitting the digitized waveforms.

the binned cell and its preceding neighbor cell. The423

relatively large DNL in the first bin, which corre-424

sponds to the delay between the last (cell 256) and425

first sample cells, is caused by a fixed DLL latency426

when wrapping the sampling from the last cell to427

the first.428

A digitized 400 MHz sine wave is shown in Fig-429

ure 13 after applying the time-base calibration con-430

stants.431

3.7. Waveform Timing432

The effective timing resolution of a single mea-433

surement is calculated by waveform feature extrac-434

tion after linearity and time-base calibration. A435

0.5 Vpp, 1.25 ns FWHM Gaussian pulse was cre-436

ated using a 10 GSa/s arbitrary waveform gener-437

ator (Tektronix AWG5104). The output of the438

AWG was sent to 2 channels of the ASIC us-439

ing a broadband-RF 50/50 splitter (Mini-Circuits440

ZFRSC-42). This pulse, as recorded by a channel441

of PSEC4, is shown on the left in Figure 14.442

A least-squares Gaussian functional fit is per-443

formed to the leading edge of the pulse. The pulse444

times from both channels are extracted from the fit445

and are subtracted on an event-by-event basis. A446

2-channel RMS timing resolution of 2.6 ps is found447

as shown on the right in Figure 14.448

3.8. Performance Summary449

The performance and key architecture parame-450

ters of PSEC4 are summarized in Table 1.451

4. Application to Large-Area Photodetec-452

tors453

The first application of PSEC4 is the front-end454

waveform digitization of large-area photodetectors455

with picosecond-level time resolution [25, 26]. The456

LAPPD MCP-PMT is a 20×20 cm2 (8×8 in2) her-457

metically packaged photodetector with a 30 chan-458

nel RF microstrip anode signal pick-off [27]. The459

1-dimensional transmission line anode design is op-460

timized for precise spatial resolution with an effe-461

cient use of electronics channels. The (x,y) position462

of the incident particle or photon is extracted by us-463

ing the differential times of waveforms at the two464

microstrip terminals (x), and the relative charge465

captured on neighboring strips (y) [27]. Waveform466

sampling, matched to the MCP bandwidth, allows467

11

Table 1: PSEC4 architecture parameters and measured performance results.

Parameter Value Comment

Channels 6 die size constraintSampling Rate 4-15 GSa/s servo-locked on-chipSamples/channel 256 25 ns recording window at 10.24 GSa/sAnalog Bandwidth 1.6 GHz ∼2.5 dB distortion at 1.3 GHzCrosstalk 7% max. over bandwidth

<1% typical for signals <800 MHzNoise 700 µV RMS (typical). RF-shielded enclosure.Effective ADC Resolution 10.5 bits 12 bits loggedADC time 4 µs max. 12 bits logged at 1 GHz clock speed

250 ns min. 8-bits logged at 1 GHzADC clock speed 1.4 GHz max.Dynamic Range 1 V after linearity correctionReadout time 0.8n µs n is number of 64-cell blocks to read (n = 24 for entire chip)Sustained Trigger Rate 50 kHz max. per chip. Limited by [ADC time + Readout time]−1

Power Consumption 100 mW max. average powerCore Voltage 1.2 V 0.13 µm CMOS standard

Figure 15: The initial PSEC4 application: a high-channel density waveform digitization of a large-area Micro-Channel Plate(MCP) RF microstip anode. The two readout boards use five PSEC4 ASICs each to digitize 30 anode strips at both terminals.The active area of the central detector is 20×20 cm2.

12

Time [ns]5 10 15 20 25

Vol

tage

[V]

-0.35

-0.3

-0.25

-0.2

-0.15

-0.1

-0.05

0

0.05

0.1

Figure 16: PSEC4 digitization of 20×20 cm2 MCP pulses.The pulses are recorded on both ends of a microstrip anodeusing the PSEC4 evaluation board (Fig. 5). The amplitudecorresponds to ∼10 photo-electrons.

for both the time and charge extraction to deter-468

mine the (x,y) position, in addition to the time-of-469

arrival and energy of the incident particle or pho-470

ton.471

A compact, detector integrated data acquisition472

(DAQ) system was designed for the LAPPD MCP-473

PMTs. The front-end microstrip anode waveform474

digitization board shown in Figure 15, in which five475

PSEC4 ASICS are used on each end to capture476

waveforms from all 30 strips. The board maintains477

a 50Ω impedance between the anode output and478

the chip input. The back-end FPGA and clock-479

distribution boards (not shown) can be mechani-480

cally mounted behind the LAPPD MCP-PMT.481

The ‘single-tile’ readout configuration is shown482

in Figure 15. Typical MCP pulses from this config-483

uration, as recorded by PSEC4, are shown in Fig-484

ure 16. Depending on the event rate of the applica-485

tion, the detector active area may be increased by486

serially connecting the microstrip anodes of adja-487

cent LAPPD MCP tiles using a common front-end488

PSEC4 digitizer board and DAQ system [27].489

5. Conclusion490

We have described the architecture and perfor-491

mance of the PSEC4 waveform digitzing ASIC.492

The advantages of implementing waveform sam-493

pling IC design in a deeper sub-micron process are494

shown, with measured sampling rates of up to 15495

GSa/s and analog bandwidths of 1.5 GHz. Poten-496

tial 0.13 µm design issues, such as leakage and dy-497

namic range, were optimized and provide a 1 V498

dynamic range with sub-mV electronics noise. A499

one-time timebase calibration is required to get pre-500

cise waveform timing with 2-3 picosecond resolu-501

tion. The first application of the PSEC4 ASIC is502

the compact, low-power front-end waveform sam-503

pling of LAPPD MCP-PMTs.504

6. Acknowledgements505

We thank Mircea Bogdan, Fukun Tang, Mark Za-506

skowski, and Mary Heintz for their strong support507

in the Electronics Development Group of the En-508

rico Fermi Institute. Stefan Ritt, Eric Delagnes,509

and Dominique Breton provided invaluable guid-510

ance and advice on SCA chips.511

This work is supported by the Department of En-512

ergy, Contract No. DE-AC02-06CH11357, and the513

National Science Foundation, Grant No. PHY-106514

601 4.515

References516

[1] J.-F. Genat, G. Varner, F. Tang, H.J. Frisch, “Signal517

Processing for Pico-second Resolution Timing Measure-518

ments”, Nucl. Instr. Meth. A 607 (2009) 387-393.519

[2] D. Breton, E. Delagnes, J. Maalmi, K. Nishimura, L.L.520

Ruckman, G. Varner, J. Va’vra, “High Resolution Pho-521

ton Timing with MCP-PMTs: A Comparison of Com-522

mercial Constant Franction Discriminator (CFD) with523

ASIC-based waveform digitizers TARGET and Wave-524

Catcher”, Nucl. Instr. Meth A 629 (2011) 123-132.525

[3] S. Kleinfelder, “Development of a Switched Capacitor526

Based Multi-Channel Transient Waveform Recording527

Integrated Circuit”, IEEE Trans. Nucl. Sci. 35, (1988)528

151-154.529

[4] G. Haller, B. Wooley, “A 700 MHz Switched Capacitor530

Analog Waveform Sampling Circuit”, SLAC-PUB-6414531

(1993).532

[5] S. Ritt, “Gigahertz Waveform Sampling: An Overview533

and Outlook”, 12th Pisa Meeting on Advanced Detec-534

tors, 23d May 2012.535

13

[6] G.S. Varner, L.L Ruckman, J.W. Nam, R.J. Nichol,536

J. Cao, P.W. Gorham, M. Wilcox, “The Large Ana-537

log Bandwidth Recorder and Digitizer With Ordered538

Readout (LABRADOR) ASIC”, Nucl. Instr. Meth. A539

583 (2007) 447-460.540

[7] E. Delagnes, Y. Degerli, P. Goret, P. Nayman, F. Tou-541

ssenel, P. Vincent, “SAM: A new GHz sampling ASIC542

for the H.E.S.S.-II Front-End Electronics”, Nucl. Instr.543

Meth. A 567 (2006) 21-26.544

[8] K. Bechtol, S. Funk, A. Okumra, L. Ruckman, A.545

simons, H. Tajima, J. Vandenbroucke, G. Varner,546

“TARGET: A Multi-Channel Digitizer Chip for Very-547

High-Energy Gamma-Ray Telescopes”, J. Astroparticle548

Physics 36 (2012) 156-165.549

[9] S. Kleinfelder, “Gigahertz Waveform Sampling and Dig-550

itization Circuit Design and Implementation”, IEEE551

Trans. Nucl. Sci. 50, (2003) 955-962.552

[10] G.S. Varner, P. Gorham, J. Cao, “Monolithic Multi-553

Channel GSa/s Transient Waveform Recorder for Mea-554

suring Radio Emissions from High Energy Particle Cas-555

cades”, Proc. SPIE Int. Soc. Opt. Eng 4858 (2003) 31.556

[11] C. Broennimann, R. Horisberger, R. Schnyder, “The557

Domino Sampling Chip: A 1.2 GHz Waveform Sam-558

pling CMOS Chip”, Nucl. Instr. Meth. A 420 (1999)559

264-269.560

[12] S. Ritt, “The DRS Chip: Cheap Waveform Digitization561

in the GHz Range”, Nucl. Instr. Meth. A 518 (2004)562

470-471.563

[13] J. Milnes, J. Howorth, “Picosecond Time Response564

Characteristics of Micro-channel Plate PMT Detec-565

tors”, SPIE USE, V. 8 5580 (2004) 89-100.566

[14] P. Eraerds, M. Legre, A. Rochas, H. Zbinden, N. Gisin,567

“SiPM for fast Photon-Counting and Multiphoton De-568

tection”, Optics Express, Vol. 15 (2007) 14539-14549.569

[15] S. Ritt, “The Role of Analog Bandwidth and570

S/N in Timing”, talk at The Factors that571

Limit Timing Resolution in Photodetectors,572

<http://psec.uchicago.edu/workshops/573

fast timing conf 2011/>, University of Chicago, Apr574

2011.575

[16] R.H. Dennard, F. H. Gaennsslen, H. N. Yu, V.L576

Rideout, E. Bassous, A.R. LeBlanc, “Design of Ion-577

Implemented MOSFETs with Very Small Physical Di-578

mensions,” IEEE J. Solid-State Circuits SC-9 (1974)579

256-268.580

[17] Y. Taur et. al., “CMOS Scaling into the Nanometer581

Regime”, Proc. IEEE. Vol. 85, No. 4, (1997) 486-504.582

[18] IBM Corporation, “CMRF8SF Model Reference583

Guide”, V.1.4.0.10 (2008).584

[19] The MOSIS Service. Wafer Electrical Test Data and585

SPICE Model Parameters. Run: V18B. Available586

on-line (accessed 4 Feb. 2013):587

<http:://www.mosis.com/pages/Technical/588

Testdata/ibm-013-prm>589

[20] E. Oberla, H. Grabas, M. Bogdan, H. Frisch, J.-F.590

Genat, K. Nishimura, G. Varner, A. Wong, “A 4-591

Channel Waveform Sampling ASIC in 0.13 µm CMOS592

for Front-End Readout of Large-Area Micro-Channel593

Plate Detectors”, Physics Procedia 37 (2012) 1690-594

1698.595

[21] M. Cooney, M. Andrew, K. Nishimura, L. Ruckman,596

G. Varner, H. Grabas, E. Oberla, J.-F. Genat, “Multi-597

purpose Test Structures and Process Characterization598

using 0.13 µm CMOS: The CHAMP ASIC”, Physics599

Procedia 37 (2012) 1699-1706.600

[22] H. Chang et al., “A Wide-Range Delay-Locked Loop601

With a Fixed Latency of One Clock Cycle”, IEEE J.602

Solid-State Circuits 37 (2002) 1021-1027.603

[23] O. Milgrome, S. Kleinfelder, M. Levi, “A 12 Bit Analog604

to Digital Converter for VLSI Applications in Nuclear605

Science”, IEEE Trans. Nucl. Sci. 39, (1992) 771-775.606

[24] K. Nishimura, A. Romero-Wolf, “A Correlation-Based607

Timing Calibration & Diagnostic Technique for Fast608

Digitization ASICs”, Physics Procedia 37 (2012) 1707-609

1714.610

[25] The Large-Area Picosecond Photo-Detectors Project611

web page: <http://psec.uchicago.edu>612

[26] M. Wetstein, B. Adams, A. Elagin, J. Elam, H. Frisch,613

Z. Insepov, V. Ivanov, S. Jokela, A. Mane, R. Obaid, I.614

Veryovkin, A. Vostrikov, R. Wagner, A. Zinovev et al.,615

to be submitted to Nucl. Instr. Meth. A (2013).616

[27] H. Grabas, R. Obaid, E. Oberla, H. Frisch, et. al., “RF617

Strip-Line Anodes for Psec Large-Area MCP-based618

Photodetectors”, to be published Nucl. Instr. Meth. A619

(2013).620

14

Elsevier Editorial System(tm) for Nuclear Inst. and...

Documents

Transcript of Elsevier Editorial System(tm) for Nuclear Inst. and...