CT Ultrasound Registration

9
Automatic CT-ultrasound registration for diagnostic imaging and image -guide d intervention Wolfga ng Wein a, * , Shelby Brunke b , Ali Khamene a , Matthew R. Callstrom c , Nassir Navab d a Imaging and Visualization Department, Siemens Corporate Research, Inc., 755 College Road East, Princeton, NJ 08540, USA b Siemens Medical Solutions, Inc., Ultrasound Division, Issaquah, WA, USA c Department of Diagnostic Radiology, Mayo Clinic, Rochester, MN, USA d Computer Aided Medical Procedures (CAMP), Technische Universität München, Germany a r t i c l e i n f o  Article history: Received 1 February 2008 Received in revised form 16 May 2008 Accepted 10 June 2008 Available online 19 June 2008 Keywords: CT Ultrasound Registration Fusion Image-guided Intervention a b s t r a c t The fusio n of tra cke d ul tra sou nd wit h CT has benets for a var iet y of clinical app lic ations, how ever ext en- sive man ual effo rt is usual ly requ ired for corr ect regis trati on. We deve loped new meth ods tha t allow one to simulate med ical ultr asou nd from CT in real- time, reprodu cing the majo rity of ultr ason ic imag ing effects. They are combined with a robust similarity measure that assesses the correlation of a combina- tion of signals extracted from CT with ultrasound, without knowing the inuence of each signal. This ser ves as the fou nd ati on of a fully aut omati c regist rat ion , that ali gnsa 3D ult ras ound sweep wit h the cor - responding tomographic modality using a rigid or an afne transformation model, without any manual interaction. These techniques were evaluated in a study involving 25 patients with indeterminate lesions in liver and kidney. The clinical setup, acquisition and registration workow is described, along with the evalu- ation of the registration accuracy with respect to physician-dened Ground Truth. Our new algorithm cor rec tly registers without any man ual int era cti on in 76% of the cases, the average RMS TREover mu lti ple targ et lesio ns throug hout the liver is 8.1 mm. Ó 2008 Elsevier B.V. All rights reserved. 1. Introduction Con vent ional 2D ultr asound systems can be equ ipp ed with position sensors to perform 3D acquisitions of whole organs, and to obtain spatial information during a procedure. This is usually denoted as ‘‘3D Freehand Ultrasound” in the technical community (wh ile ultr asou nd ven dor s ofte n decl are han d-s wep t ima ging without tracking as ‘‘Freehand”). The fusion of such 3D ultrasound imaging with tomographic modalities can be used not only to im- pro ve nav igati on and ultrasound-based guidanc e for inte rven tion al procedures, but also to improve diagnostic value, e.g. for assess- me nt of inde ter mi nate les io ns . Th is re quire s that the tar get anatomy is precisely registered between ultrasound and the pre- ope rati ve mo dali ty. Doi ng so in an auto mat ed man ner is ver y challenging, and is the main focus of this work. 1.1. Clinical context A common clinical problem is a patient with an indeterminate lesi on, contain ed in the liver or kidn ey, identied by compute d tomogr aphy (CT) or magnetic resonance imagi ng (MRI). Often, fur- ther clinical work-up involves characterization of the lesion with the use of ultrasound (US) imaging. US imaging is utilized because this modality can often determine wh ether the lesio n is like ly benign or malignant due to the characteristic appearance of many lesions. However, it is sometimes difcult to correlate CT or MRI ndings with US imaging due to inherent differences in the imag- ing me tho ds . US imag ing ha s severa l lim iti ng fac tor s tha t aff ect the use of thi s me tho do logy for lesional identicat io n an d for its us e as a guidance method. These factors include: (1) image acquisition is user dependent, (2) the eld of view is limited, (3) US images are typi call y acqu ired off true axial, sagi ttal, or coro nal planes with resu ltan t difcult y corr elat ing with thes e othe r cross-se ctio nal imaging methods, (4) lesional iden tication can be difcult due to its echogenicity relative to the organ that is interrogated, and (5) the quality of the imaging is affected by the physical character- istics of the patient and overlying structures such as ribs, subcuta- neous fat, and normal gas-containin g structures. In this context, the fusion of CT and ultrasound can improve the diagnostic value to an extent beyond the ‘‘sum” of the individual modalities, poten- tially sparing an invasive biopsy, where a tissue sample for further pathological examination is obtained. This leads to a second common clinical issue – the need to per- form percutaneous needle biopsy or ablation of an indeterminate or malignant lesion contained in the liver or kidney. In the most 1361-8415/$ - see front matter Ó 2008 Elsevier B.V. All rights reserved. doi:10.1016/j.media.2008.06.006 * Corresponding author. Tel.: +1 609 734 4477; fax: +1 609 734 3310. E-mail address: [email protected](W. Wein). Medical Image Analysis 12 (2008) 577–585 Contents lists available at ScienceDirect Medical Image Analysis journal homepage: www.elsevier.com/locate/media

Transcript of CT Ultrasound Registration

Page 1: CT Ultrasound Registration

8/8/2019 CT Ultrasound Registration

http://slidepdf.com/reader/full/ct-ultrasound-registration 1/9

Automatic CT-ultrasound registration for diagnostic imagingand image-guided intervention

Wolfgang Wein a, * , Shelby Brunke b , Ali Khamene a , Matthew R. Callstrom c, Nassir Navab d

a Imaging and Visualization Department, Siemens Corporate Research, Inc., 755 College Road East, Princeton, NJ 08540, USAb Siemens Medical Solutions, Inc., Ultrasound Division, Issaquah, WA, USAc Department of Diagnostic Radiology, Mayo Clinic, Rochester, MN, USAd Computer Aided Medical Procedures (CAMP), Technische Universität München, Germany

a r t i c l e i n f o

Article history:Received 1 February 2008Received in revised form 16 May 2008Accepted 10 June 2008Available online 19 June 2008

Keywords:CTUltrasoundRegistrationFusionImage-guided Intervention

a b s t r a c t

The fusion of tracked ultrasound with CT has benets for a variety of clinical applications, however exten-sive manual effort is usually required for correct registration. We developed new methods that allow oneto simulate medical ultrasound from CT in real-time, reproducing the majority of ultrasonic imagingeffects. They are combined with a robust similarity measure that assesses the correlation of a combina-tion of signals extracted from CT with ultrasound, without knowing the inuence of each signal. Thisserves as the foundation of a fully automatic registration, that aligns a 3D ultrasound sweep with the cor-responding tomographic modality using a rigid or an afne transformation model, without any manualinteraction.

These techniques were evaluated in a study involving 25 patients with indeterminate lesions in liverand kidney. The clinical setup, acquisition and registration workow is described, along with the evalu-ation of the registration accuracy with respect to physician-dened Ground Truth. Our new algorithmcorrectly registers without any manual interaction in 76% of the cases, the average RMS TREover multipletarget lesions throughout the liver is 8.1 mm.

Ó 2008 Elsevier B.V. All rights reserved.

1. Introduction

Conventional 2D ultrasound systems can be equipped withposition sensors to perform 3D acquisitions of whole organs, andto obtain spatial information during a procedure. This is usuallydenoted as ‘‘3D Freehand Ultrasound” in the technical community(while ultrasound vendors often declare hand-swept imagingwithout tracking as ‘‘Freehand”). The fusion of such 3D ultrasoundimaging with tomographic modalities can be used not only to im-prove navigation and ultrasound-based guidance for interventionalprocedures, but also to improve diagnostic value, e.g. for assess-ment of indeterminate lesions. This requires that the targetanatomy is precisely registered between ultrasound and the pre-operative modality. Doing so in an automated manner is verychallenging, and is the main focus of this work.

1.1. Clinical context

A common clinical problem is a patient with an indeterminatelesion, contained in the liver or kidney, identied by computedtomography (CT) or magnetic resonance imaging (MRI). Often, fur-

ther clinical work-up involves characterization of the lesion withthe use of ultrasound (US) imaging. US imaging is utilized becausethis modality can often determine whether the lesion is likelybenign or malignant due to the characteristic appearance of manylesions. However, it is sometimes difcult to correlate CT or MRIndings with US imaging due to inherent differences in the imag-ing methods. US imaging has several limiting factors that affect theuse of this methodology for lesional identication and for its use asa guidance method. These factors include: (1) image acquisition isuser dependent, (2) the eld of view is limited, (3) US images aretypically acquired off true axial, sagittal, or coronal planes withresultant difculty correlating with these other cross-sectionalimaging methods, (4) lesional identication can be difcult dueto its echogenicity relative to the organ that is interrogated, and(5) the quality of the imaging is affected by the physical character-istics of the patient and overlying structures such as ribs, subcuta-neous fat, and normal gas-containing structures. In this context,the fusion of CT and ultrasound can improve the diagnostic valueto an extent beyond the ‘‘sum” of the individual modalities, poten-tially sparing an invasive biopsy, where a tissue sample for furtherpathological examination is obtained.

This leads to a second common clinical issue – the need to per-form percutaneous needle biopsy or ablation of an indeterminateor malignant lesion contained in the liver or kidney. In the most

1361-8415/$ - see front matter Ó 2008 Elsevier B.V. All rights reserved.doi:10.1016/j.media.2008.06.006

* Corresponding author. Tel.: +1 609 734 4477; fax: +1 609 734 3310.E-mail address: [email protected] (W. Wein).

Medical Image Analysis 12 (2008) 577–585

Contents lists available at ScienceDirect

Medical Image Analysis

j ou rna l h omep a ge : www.e l s ev i e r. com/ loc a t e /med i a

Page 2: CT Ultrasound Registration

8/8/2019 CT Ultrasound Registration

http://slidepdf.com/reader/full/ct-ultrasound-registration 2/9

Page 3: CT Ultrasound Registration

8/8/2019 CT Ultrasound Registration

http://slidepdf.com/reader/full/ct-ultrasound-registration 3/9

which is reected straight back to the ultrasound transducer de-pends on the angle of incidence h (see Fig. 2 ):

D r ð Z 1 ; Z 2 ; hÞ ¼ ðcos hÞn Z 2 À Z 1 Z 2 þ Z 1

2

ð1Þ

The exponent n describes the heterogenity on the tissue interface,causing the amount of reection to be more or less narrow aroundits perpendicular. We lack detailed physical knowledge from CT,hence we use n ¼ 1, as it simplies the equations and producesgood results. Higher values would restrict the reections of non-perpendicular interfaces, possibly missing to extract some featuresfrom the CT intensities. On the other hand, the similarity measurethat will be explained later is to some extent capable of ignoringadditional information not present in ultrasound. The transmittedintensity t ð Z 1 ; Z 2 Þ does not depend on the angle of incidence, if refraction is neglected:

t ð Z 1 ; Z 2 Þ ¼ 1 ÀZ 2 À Z 1 Z 2 þ Z 1

2

¼4 Z 2 Z 1

ð Z 2 þ Z 1 Þ2 ð2Þ

The X-ray attenuation l measured by a CT scanner is approximatelyproportional to the tissue density, see Fig. 3 . Because tissue densityis in turn proportional to acoustic impedance (as c is assumed con-

stant), we can directly derive the incremental acoustic intensityreection from it

D r ð~ x;~dÞ ¼ ~dT r l ð~ xÞjr l ð~ xÞj

n jr l ð~ xÞj2l ð~ xÞ

2

ð3Þ

for n ¼ 1 : D r ð~ x;~dÞ ¼ ð~dT r l ð~ xÞÞjr l ð~ xÞjð2l ð~ xÞÞ2

ð4Þ

t ð~ xÞ ¼ 1 Àjr l ð~ xÞj2l ð~ xÞ

2

ð5Þ

where l ð~ xÞ is the CT attenuation value at position ~ x; r l ð~ xÞ its spa-tial derivative, and ~d a unit vector denoting the direction of theultrasound wave propagation. The scalar multiplication of ~d withthe normed CT gradient vector yields the angular dependencyequivalent to cos ðhÞ. The ultrasound wave intensity is reduced

according to t ð~ xÞ at each tissue interface, while D r ð~ x;~dÞ contributesto the wave intensity detected by the probe. Integrating over thisreection and transmission behavior along an ultrasonic scanlineyields:

I ð~ xÞ ¼ I 0 exp ÀZ k x

0

jr l ð~ x0 þ k~dÞj2l ð~ x0 þ k~dÞ !

2

dk0@ 1Að~dT r l ð~ xÞÞ

jr l ð~ xÞjð2l ð~ xÞÞ2

ð6Þ

where I 0 is the original intensity of the ultrasound pulse, we deneit as I 0 ¼ 1. In addition, we apply a log-compression, which ampli-es smaller reections. Its parameter a is visually determined, andresembles the dynamic range knob on the ultrasound machine. Thisyields the resulting value of the simulation:

r ð~ xÞ ¼logð1 þ aI ð~ xÞÞ

logð1 þ aÞð7Þ

For a linear array probe, the integral in Eq. 6 can be computedefciently by traversing the columns in the simulated ultrasoundimage from top to bottom while updating the transmittedintensity based on the interpolated CT intensity and gradient val-ues. For curvilinear arrays, we compute the image row-wise fromtop to bottom, while using an auxiliary channel storing theremaining transmitted ultrasound wave intensity (starting with 1in the rst row). For every pixel, this transmission value is re-

trieved by linear interpolation from two pixels in the row above,according to the ultrasound ray angle derived from the curvilineargeometry.

This provides a means to simulate large-scale ultrasonic reec-tion at tissue boundaries, and the related shadowing effects atstrong interfaces like bone. However, individual tissue types havespecic echogeneity and speckle patterns by themselves, basedon the microscopic tissue inhomogenities. There is no simple rela-tionship between tissue echogeneity and CT hounseld units,therefore we add an intensity mapping pðl ð~ xÞÞon a narrow soft-tissue range to the simulated large-scale reection r ð~ xÞ. We use asimple polynomial function, based on a number of correspon-dences (liver tissue, liver vasculature, kidney, gall bladder) be-tween CT/CTA intensities and tissue echogeneity in ultrasound,

see Fig. 4 . Fig. 5 depicts the simulation result for a transverse liverimage, computed from a native CT scan.

Table 1

CT Hounseld units l and physical properties of human tissues (density q , speed of sound c , acoustic impedance Z )

Material l q kgm 3 c m

sÀ Á Z kgðmm Þ2 s

Bone 1000 1912 4080 7.8Muscle 10 . . . 40 1080 1580 1.7Liver 40 . . . 60 1060 1550 1.64Blood 40 1057 1575 1.62

Kidney 30 1038 1560 1.62Brain 43 . . . 46 994 1560 1.55Water 0 1000 1480 1.48Fat À100 . . . À50 952 1459 1.38Air À1000 1.2 330 0.0004

specular reflection

θ

tissue interfaces

transducer

Z1 Z2

Z3

t0 t1 t2

r 1 r 2

Fig. 2. Principle of ultrasonic transmission and reection at multiple tissueinterfaces.

0 500 1000 1500 2000 25000.2

0.4

0.6

0.8

1

1.2

1.4

1.6

1.8

2

Hounsfield Units

D e n s

i t y

Fig. 3. Plot of CT Hounseld units against tissue density, values from Schneideret al. (1996) .

W. Wein et al. / Medical Image Analysis 12 (2008) 577–585 579

Page 4: CT Ultrasound Registration

8/8/2019 CT Ultrasound Registration

http://slidepdf.com/reader/full/ct-ultrasound-registration 4/9

2.2. Registration algorithm

2.2.1. Automatic frame selectionSince we simulate ultrasound imaging effects with respect to

the probe geometry, the original B-mode scan planes of the sweephave to be used rather than a 3D reconstruction. Neighboringframes of the freehand sweep contain similar information, hencewe use always the one out of n frames that has the highest imageentropy. This assures that frames which contain unique ne vascu-larity, that can be located in CT as well, are picked for registration.If two neighboring frames have the highest entropy out of theirgroup of n , only one of them (again with the highest entropy) isused. Furthermore, a threshold is used to discard frames at thebeginning and end of the sweep with little structures. In our exper-iments, n was dened to yield 20–30 frames per sweep forregistration.

2.2.2. Similarity measure considerationsIt seems appropriate to use statistical similarity metrics like

mutual information (MI) and correlation ratio (CR) for assessingthe correspondence of original CT and ultrasound intensities. Intheir general formulation, however, they do not work well forour registration problem, since there are too many possible cong-urations where the joint entropy is minimal (for MI), or the inten-sities fromone image can be predicted well from the other one (forCR). At correct alignment of CT and ultrasound, they typically pro-duce only a small local optimum. Known approaches for restrictingthe possible intensity distributions are distance metrics to jointhistograms learnt from correct registrations (see e.g. Guetteret al. (2005) , Kullback–Leibler divergence), as well as bootstrap-ping parameters for a polynomial intensity mapping in the actualregistration process itself ( Roche et al., 2001 ). In both cases, veryimportant information is disregarded, as e.g. small vascularity isessential for a correct registration within the liver, but due its

appearance on a relatively small fraction of the image content, itwould neither affect a joint histogram or a least-squares estimateof a polynomial intensity mapping. Since CT attenuation measure-ments are mostly reproducible, we will use the constant mappingfunction p dened in Section 2.1 in conjunction with a linearmodel.

2.2.3. The LC 2

similarity measureIn a correlation ratio framework, the parameters of the registra-tion transformation T are modied in order to maximize

CR ¼ 1 À P x2XðU ð xÞ À f ðl ðT ð xÞÞÞÞ2

jXjVar ðU Þð8Þ

with f denoting the mapping function which estimates the intensi-ties of the US image U from the transformed CT image l , and X theshared image domain. If a linear mapping f ðl Þ ¼ a l þ b is assumed,Eq. (8) can be directly related to the common normalized cross-cor-relation (NCC) similarity metric, see Roche et al. (1999) for thederivation.

For a pixel intensity in the US image, the relative contributionsof large-scale reections and general tissue echogeneity are

unknown. Hence both the mapped CT intensity pðl Þ and thesimulated reection r have to be integrated in a correlation frame-work with the US intensity. Using the notation p i ¼ pðl ðT ð~ xiÞÞÞ; r i ¼ r ðT ð~ xiÞÞ; u i ¼ U ð~ xiÞ for the intensity triple at acertain voxel, we dene the intensity function as

f ð~ xiÞ ¼a pi þ br i þ c ð9Þ

The unknown parameters a , b and c then have to minimize

M ab

c0B@

1CAÀ

u1

..

.

un

0BB@1CCA

2

; where M ¼

p1 r 1 1

..

....

..

.

pn r n 1

0BB@1CCA

ð10 Þ

Therefore the solution is

ab

c0B@1CA

¼ ðM T M ÞÀ1 M T u1

..

.

un

0BB@1CCA

¼ P p2i P pir i P pi

P pir i Pr 2i Pr i

P pi Pr i n0B@ 1CA

À1

P piu i

Pr iu i

Pu i0B@ 1CA

ð11 ÞDirect inversion of the symetric matrix M T M results in a closed-form solution for the parameters. They are then inserted in Eq. (8)to yield a novel registration similarity metric, which we denote Lin-ear Correlation of Linear Combination ðLC 2 Þ. It assesses the correla-tion of US intensities u i and a linear combination with unknownweights of signals p i; r i extracted from CT. The value of LC 2 is con-stant with respect to brightness and contrast changes of the US im-age (as NCC), but also independent to how much of the twodescribed physical effects contributes to the image intensities. The

1000 1050 1100 1150 1200 12500

50

100

150

200

250

μ

p

vasculature

vasculature

liver

liver kidney

gall bladder

Fig. 4. Intensity mapping p for CT (dashed) and portal-venous CTA (solid) softtissue. Note that the liver-vasculature relation is inverted in the two modalities.

Fig. 5. Simulation of ultrasonic effects from CT. The original images are depicted in (a) and (b). The simulated large-scale reection is shown in (c), the auxiliary transmission

image in (d). The nal simulation outcome is (e), where reection and echogeneity estimated from CT are combined. (a) CT, (b) ultrasound, (c) reection r , (d) transmission t and (e) simulation r þ p.

580 W. Wein et al. / Medical Image Analysis 12 (2008) 577–585

Page 5: CT Ultrasound Registration

8/8/2019 CT Ultrasound Registration

http://slidepdf.com/reader/full/ct-ultrasound-registration 5/9

latter is important, since e.g. hepatic vasculature or the gall bladderis represented mostly by p (different intensities due to echogeneityin ultrasound, no borders), while large-scale tissue interfaces corre-spond to r (strong edge in ultrasound, comparable intensities onboth sides).

This implicit computation of the parameters a , b and c duringevery poseevaluation equals a simultaneous optimization of radio-metric and geometrical registration parameters (with the radio-metric parameters providing information for simulating USimages from the CT volume). Fig. 6 illustrates this by showingthe simulated intensity according to a pi þ b r i þ c for an alignedand displaced image of the liver. For the aligned image, the param-eters a and b are higher, denoting a good least-squares matching of the structures. For the displaced image, they are very small, caus-ing both the vasculature and liver border to almost disappear. Thevalue of c is expressed in a normalized intensity range ½0 . . .1 .

Note: The fact that the weights for p and r are unknown, sug-gests that a higher-dimensional mutual information (HMI) ap-proach could be used as well ( Gan and Chung, 2005 ). Here, each p; r and u would represent one axis in a three-dimensional jointprobability distribution. We had investigated this approach, com-puting modied versions of MI and CR on a 3D joint histogram;it however resulted in an unstable similarity metric (see also Sec-tion 3.3 ). One reason is that the reection term r has an intensitydistribution containing mostly small values (no reections) andfew yet important large values (representing reections at tissueinterfaces). A non-linear histogram equalization approach wouldbe a pre-requesite to use r in such a framework. Another problem,as pointed out before in Section 2.2.2 , is the unconstrained hugenumber of intensity congurations that lead to a local optimum.

Eqs. (11) and (8) can be computed once for a set of US-CT imagepairs, individually for every frame in the set (as we did in Wein etal., 2007a ), or locally for arbitrary image regions (see Section 2.2.4below). In all but the rst case, the mean of the resulting LC 2 valuesconstitutes the cost function for optimization.

For disregarding ne speckle information in the registration(whose correspondence we cannot extract from CT), and speed-up of the computation, the US images are down-scaled to$ 128 Â 100 pixels (by averaging an integer number of pixels ineach dimension). The top 3 cm along the ultrasonic rays are ig-nored for the measure computation, since they contain only com-

pressed subcutaneous tissue (which we visually conrmed on alarge number of data sets).

2.2.4. Local LC 2 computationGenerally, it can not be assumed that Eq. (9) is valid over the

whole content of a registered image pair. One reason, as pointedout above, is that different anatomic structures yield differentweightings of the echogeneity and reection terms

pand

r . Apart

from that, every ultrasound machine features settings that locallyoptimize the image quality, such as the time gain compensation(TGC) curve or the number and location of focal zones. Last butnot least, orientation-dependant artifacts result in upper structuresinuencing the imaging of anatomy further away from the probe,which might not always be accurately reproduced by our simula-tion from CT.

We therefore compute the LC 2 similarity measure for localpatches centered around every pixel in each image pair. Thisextension is similar to the local normalized cross-correlation(LNCC) similarity metric, which has particularly proven useful for2D–3D registration ( Khamene et al., 2006 ). The patch size has tobe chosen correctly; if it is too small, Eq. (9) will always hold,therefore not decreasing the image similarity with larger misalign-ment. If it is too large, it does not hold for the correct alignment,since structures reecting different weightings for p and r shareone patch. In a robustness study (see Section 3.3 ) carried out fordifferent sizes, we obtained 11 Â 11 pixels as the optimal patchsize.

As a further advantage of the local LC 2 computation, we candrop the intensity mapping for contrasted CT scans ( Fig. 4 ). Theresulting inverse relationship between CT and US intensities of vasculature causes a negative a value for the respective patches(which would not be valid for the remainder of the image). This in-creases the local accuracy particularly of small vascular structures,as some ambiguity canbe introducedby themapping – one canseein Fig. 4 that up to three CT intensities are mapped to the same USintensity.

2.2.5. Optimization strategyAn initial estimate of the orientation is obtained fromthe track-

ing setup, if the patient is supine. Otherwise, an approximate anglearound the cranio-caudal axis is entered manually. The large-scaletranslation is determined by performing an exhaustive search of the translation space. Here, a skin surface clamping approach isused, skipping physically impossible transducer locations, as inWein et al. (2007b) . All congurations within the evaluated 3D-grid suggesting an optimum, are further locally optimized with re-spect to the translation. From the conguration which in turnyields the best result, all six parameters of the rigid transformationare rened. As an optional last step, an optimization is executed onall rigid and three selected afne transformation parameters(henceforth denoted semi-afne ). These are the two scaling param-eters and the one shearing of the sagittal plane, since respiratorymotion mainly causes deformation in that plane ( Rohlng andC.R. Maurer, 2004 ). For all local optimizations, the Amoeba Simplexalgorithm is used, as described in Press et al. (1992) , chapter 10.

3. Experiments

3.1. Setup

In order to evaluate the performance of the registration algo-rithm, a study on abdominal data of 25 patients was performed.Patients were included that had one or more indeterminate lesions

contained in either the liver or kidney, that measured P 0 :5 cmand 6 5 cm in diameter, diagnosed on a prior contrast-enhanced

Fig. 6. The effect of simultaneous simulation and registration. The left column

shows the simulation from CT using the parameters resulting from the LC 2

computation. (a) is well registered, (b) is 1 cm displaced.

W. Wein et al. / Medical Image Analysis 12 (2008) 577–585 581

Page 6: CT Ultrasound Registration

8/8/2019 CT Ultrasound Registration

http://slidepdf.com/reader/full/ct-ultrasound-registration 6/9

CT examination. Patients with cardiac pacemakers or debrillatorswere excluded from the study. No patients were excluded on thebasis of body habitus.

For focal liver lesions, a biphasic contrast-enhanced examina-tion of the abdomen (delay of 45 and 70 s) was performed. For fo-cal renal lesions, a triphasic contrast-enhanced examination (delayof 45, 70 and 180 s) was done. In either case, image reconstructionslice thickness of 6 2 mm was required.

Since the study is ultimately targeted towards the intervention-al application, the decision was made to use a magnetic trackingsystem. The small tracking sensor can be attached right to theultrasound transducer (optionally inside a sterile plastic wrap),and does not require a line of sight to the transmitter. This isadvantageous in the RFA setup with limited space (see Fig. 1 ),and allows tracking of needles as well. Our freehand ultrasoundsystem uses a 3D Guidance tracking system (Ascension TechnologyCorp., Burlington, VT USA) with the at transmitter option, whichslides below the patient mattress and avoids tracking errors in-duced by metal in the patient bed. Images from a Siemens Sequoiaultrasound machine are fed via progressive RGBS video into a PCwith frame grabber. The position sensor was afxed to the trans-ducer using hot-melt adhesive, and the method presented in Weinand Khamene (2008) was used to compute the spatial and tempo-ral calibration.

Transverse liver sweeps on 21 patients (see e.g. Fig. 7 ) and fourkidney sweeps, acquired during breath-hold on inspiration, wereused. They were co-registered with the portal-venous phase of the CT scans, due to the conspicuity of the portal and hepatic veinsin this phase of imaging. For optimal visualization, the ultrasoundexam was executed in various setups - 14x supine, 7x left posterioroblique (LPO), 1x right posterior oblique (RPO), 3x decubitus.

3.2. Registration results

After manually aligning each of the data sets, 5–9 point corre-spondences on anatomical landmarks, mostly vessel bifurcationsin the liver, were selected by an expert. In order to dene thoselandmarks truly in 3D, we visualized both original ultrasoundframes and an arbitrary number of cross-sections, compoundedusing the direct MPR technique presented in Wein et al. (2006) ,each with the respective CT plane (see Fig. 7 b). Using a linked poin-

ter and superimposition options, the physician could precisely lo-cate vessel bifurcations.

While the obtained point correspondences serve as duciallandmarks, additional target point correspondences were dened,depicting the lesions to be ablated. Typically, the center of multiplesmall lesions, or distinguished spots in larger lesions close toperipheral vasculature, were used, creating 1–5 point correspon-dences (in one atypical case 10), which indicate where the highestregistration accuracy is desired for optimal treatment. The pointlocalization errors are expected to be higher than for the ducials.

For registration based on point correspondences, the rigid mo-tion between ultrasound and CT was computed according to Walk-er et al. (1991) . Table 2 lists the mean, minimum and maximum of the root-mean-square (RMS) residual distances of the ducial andtarget points (henceforth denoted as ducial registration error FRE,target registration error TRE) for all data sets. They have been com-puted for point-based registration based on the ducials alone, andboth ducial/target points; as well as rigid & semi-afne automaticregistration using our methods.

The automatic registration completes for 19 patients with anaverage computation time of $ 28 s (C++ implementation executedon an Intel Core 2 Duo 2.2GHz notebook).Two difcult supine datasets, as well as three LPO and one decubitus data, had to be roughlymanually aligned in order for the automatic algorithm to converge.At the initial estimate, before the translation search, the FRE wasbetween 13 and 71 mm(the at transmitter of the positioning sys-tem was placed similarly below each patient).

In 59% of the cases, automatic afne registration yielded a lowerTRE than point-based registration of the ducials (55% for rigid).

Fig. 7. Typical liver data, from a patient with a large metastatis near the hepatic vein conuence. The green plane in (a) corresponds to the ultrasound reconstruction in thelower right. (a) transverse liver sweep and (b) CT MPR planes alongside ultrasound frame (top) and reconstruction (bottom).

Table 2

Average registration accuracy in mm for 25 patients, expressed as root-mean-squareducial registration error (FRE) and target registration error (TRE)

Mean Min Max

FRE TRE FRE TRE FRE TREPoint-based (ducial) 5.0 9.7 2.3 2.8 11.5 28.4Point-based (d. + target) 5.8 5.4 2.4 2.2 11.9 11.1Automatic rigid 10.4 9.0 4.5 3.0 18.7 22.1Automatic afne 9.5 8.1 2.8 3.0 15.3 21.5

582 W. Wein et al. / Medical Image Analysis 12 (2008) 577–585

Page 7: CT Ultrasound Registration

8/8/2019 CT Ultrasound Registration

http://slidepdf.com/reader/full/ct-ultrasound-registration 7/9

This can be attributed to the fact that the image-based techniqueincorporates information throughout the 3D sweep, trying to ndan alignment of all structures visible in both modalities. On theother hand, even careful denition of unique ducial landmarksdoes not necessarily guarantee a small TRE, particulary if the clin-ical targets are not in their vicinity. The FRE after point-based reg-istrationof the ducials canobviouslynot be lowered by automaticregistration, since it represents their residual error. It is composedmostly of inaccuracy in the point selection, tissue deformation be-tween the twomodalities, tracking and calibration errors. In 73%of Fig. 8. Longitudinal image of a right kidney, aligned by automatic rigid registration.

Fig. 9. Comparison of point-based, rigid and afne registration. The patient was positioned left posterior oblique (LPO), the Sequoia Clarify option was enabled, enhancingvasculature. The 1st and 3rd column depict a color overlay of CT and US, 2nd and 4th column show the original US (for better visualization, the reader is referred to the web

version of this article). Arrows point to anatomical clues that visualize the quality of alignment. (a) registration based on ducial point correspondences, FRE = 6.4 mm (b)automatic rigid registration, FRE = 9.8 mm and (c) automatic afne registration, FRE = 8.1 mm.

W. Wein et al. / Medical Image Analysis 12 (2008) 577–585 583

Page 8: CT Ultrasound Registration

8/8/2019 CT Ultrasound Registration

http://slidepdf.com/reader/full/ct-ultrasound-registration 8/9

Page 9: CT Ultrasound Registration

8/8/2019 CT Ultrasound Registration

http://slidepdf.com/reader/full/ct-ultrasound-registration 9/9

References

Arai, O., Shinomura, R., Mitake, T., Sawaki, A., Satake, H., 2006. Real-time virtualsonography (RVS) for breast imaging. Ultrasound in Medicine & Biology 32(Supplement 1), P107–P108.

Crocetti, L., Lencioni, R., DeBeni, S., See, T., Pina, C., Bartolozzi, C., 2008. Targetingliver lesions for radiofrequency ablation: an experimental feasibility studyusing a CT-US fusion imaging system. Investigative Radiology 43, 33–39.

Gan, R., Chung, A.C., 2005. Multi-dimensional mutual information based robust

imageregistration using maximum distance-gradient-magnitude. In: IPMI 2005Proceedings, vol. 3565 of Lecture Notes in Computer Science, pp. 210–221.Guetter, C., Xu, C., Sauer, F., Hornegger, J., 2005. Learning based non-rigid multi-

modal image registration using kullback–Leibler divergence. In: MICCAI 2005proceedings, vol. 3750 of Lecture Notes in Computer Science. pp. 255–262.

Khamene, A., Bloch, P., Wein, W., Sauer, F., Svatos, M., 2006. Automatic portal imagebased patient positioning for radiation therapy. Medical Image Analysis 10, 96–112.

Leroy, A., Mozer, P., Payan, Y., Troccaz, J., 2004. Rigid registration of freehand 3Dultrasound and CT-Scan kidney images. In: MICCAI Proceedings, p. 837ff.

Penney, G., Blackall, J., Hamady, M., Sabharwal, T., Adam, A., Hawkes, D., 2004.Registration of freehand 3D ultrasound and magnetic resonance liver images.Medical Image Analysis 8, 81–91.

Press, W., abd, W.T., Vetterling, S.T., Flannery, B., 1992. Numerical Recipes in C,second ed. CRC Press, Inc.

Roche, A., Malandain, G., Ayache, N., 1999. Unifying maximum likelihoodapproaches in medical image registration. Technical Report, INRIA.

Roche, A., Pennec, X., Malandain, G., Ayache, N., 2001. Rigid registration of 3Dultrasound with MR images: a new approach combining intensity and gradient

information. IEEE Transactions on Medical Imaging 20, 1038–1049.Rohlng, T., Maurer, J.C.R., 2004. Modeling liver motion and deformation during the

respiratory cycle using intensity-based nonrigid registration of gated MR images. Medical Physics 31, 427–432.

Schneider, U., Pedroni, E., Lomax, A., 1996. The calibration of CT hounseld units forradiotherapy treatment planning. Physics in Medicine and Biology 41, 111–124.

Sheafor, D.H., Paulson, E.K., Kliewer, M.A., DeLong, D.M., Nelson, R.C., 2000.Comparison of sonographic and ct guidance techniques: does ct uoroscopydecrease procedure time? American Journal of Roentgenology 174, 939–942.

Stippel, D., Böhm, S., Beckurts, K., Brochhagen, H., Hölscher, A., 2002. Experimentalevaluation of accuracy of radiofrequency ablation using conventionalultrasound or a third-dimension navigation tool. Langenbecks Archives of Surgery 387, 303–308.

vanSonnenberg, E., McMullen, W., Solbiati, L. (Eds.), 2005. Tumor Ablation:Principles and Practice. Springer.

Walker, M., Shao, L., Volz, R., 1991. Estimating 3-D location parameters using dualnumber quaternions. CVGIP: Image Understanding. 358–367.

Wein, W., Khamene, A., 2008. Image-based method for in-vivo freehand ultrasoundcalibration. In: SPIE Medical Imaging 2008, San Diego.

Wein, W., Pache, F., Röper, B., Navab, N., 2006. Backward-warping ultrasoundreconstruction for improving diagnostic value andregistration. In: MICCAI2006Proceedings, Lecture Notes in Computer Science, Springer, pp. 750–757.

Wein, W.,Khamene, A., Clevert, D., Kutter, O., Navab, N.,2007a. Simulation and fullyautomatic multimodal registration of medical ultrasound. In: MICCAI 2007Proceedings, vol. 4791 of Lecture Notes in ComputerScience, Springer, pp. 136–143.

Wein, W., Röper, B., Navab, N., 2007b. Integrating diagnostic B-modeultrasonography into CT-based radiation treatment planning. IEEETransactions on Medical Imaging 26, 866–879.

Wood, T., Rose, D., Chung, M., Allegra, D., Foshag, L., Bilchik, A., 2000.Radiofrequency ablation of 231 unresectable hepatic tumors: indications,limitations, and complications. Annals of Surgical Oncology 7, 593–600.

Zagzebski, J.A., 1996. Essentials Of Ultrasound Physics. Mosby.Zhu, Y., Magee, D., Ratnalingam, R., Kessel, D., 2007. A training system for

ultrasound-guided needle insertion procedures. In: MICCAI 2007 Proceedings,vol. 4791 of Lecture Notes in Computer Science, Springer, pp. 566–574.

W. Wein et al. / Medical Image Analysis 12 (2008) 577–585 585