Bayesian Modeling of Complex-valued fMRI Signals for Brain ... · Bayesian Modeling of...

Bayesian Modeling of Complex-valued fMRI Signals forBrain Activation

Cheng-Han Yu, Raquel Prado, Hernando Ombao, and Daniel B. Rowe

Joint Statistical Meetings, August 1, 2017

1 / 21

Motivation of using a complex-valued model

• Many kinds of data sets are complex-valued (CV), for example,imaging, radar, and sonar. fMRI data are complex-valued after FTand IFT image reconstruction. But most fMRI studies usemagnitude-only (MO) data and phase information is discarded.

2 / 21

Goal and Result

We propose a model that takes real and imaginary parts into account,then fast detect which voxels are activated. We find that the proposedBayesian complex-valued model has higher power and lower type I errorrate than traditional magnitude models.

3 / 21

Complex-valued linear regression: Rowe (2005)

• In fMRI studies, given time t = 1, . . . ,T and at voxel v = 1, . . . ,N,we have (Rowe-Logan constant phase)

y vt = ρvt cos(φv ) + iρvt sin(φv ) + εvt ,

ρvt = αv0 + αv

1x1,t + αv2x2,t + · · ·+ αv

pxp,t ,

•

yv1...y vT

=

1 x11 · · · xp1...

.... . .

...1 x1T · · · xpT

αv0 cos(φv ) + iαv

0 sin(φv )...

αvp cos(φv )︸︷︷︸

βvRe

+i αvp sin(φv )︸︷︷︸

βvIm

+

εv1...εvT

• yv = Xβv

Re + iXβvIm + εv , or yv = Xβv + εv

4 / 21

Complex-valued linear regression: yv = Xβv + εv

• X is a designed matrix formed by Blood Oxygen Level Dependentcontrasts (BOLD), which is convolution of an stimulus function froman experiment and a hemodynamic response function (HRF).

Source: Martin Lindquist (2008)

• In general, noise εvt = εvt,re + iεvt,im ∼ CN1(0, σ2, τ2)

5 / 21

Approach to identifying activations for yv = Xβv + εv

• Variable Selection: βvj 6= 0 iff voxel v at task j is activated.

• Complex normal spike-and-slab prior on βv :

βvj ∼ (1− γvj )CN(0, ω0, λ0)︸︷︷︸spike

+γvj CN(0, ω1, λ1)︸︷︷︸slab

, ω0 < ω1, λ0 < λ1.

• If γvj = 0, treat βvj as zero, and if γvj = 1, βvj is non-zero.

• CN(µ, σ2, 0): Circular normal that real and imaginary parts areindependent.

• Activation is inferred by borrowing information across voxels througha Bernoulli prior on γvj ∼ Ber(θvj = θj) with a common probability ofactivation for all voxels.

6 / 21

Model setup

We develop an complex-valued EM variable selection (C-EMVS)algorithm for fast detecting activation at the lowest voxel level.

yv = Xβv + εv , εv ∼ CNT (0, 2σ2v I, 0), v = 1, ...,N

βvj |γvjindep∼ (1− γvj )CN1(0, djσ

2vΓj , ejσ

2vCj) + γvj CN1(0, σ2vΓj , σ

2vCj),

dj << 1, ej << 1, j = 1, ..., p

γvj |θjIID∼ Ber(θj)

θjIID∼ Beta(aθ, bθ)

σ2v ∼ IG (aσ, bσ)

We determine if the real and imaginary parts of βvj are zero jointly:βvj 6= 0 if Pr(γvj = 1 | β∗, θ∗, σ∗, y) > 0.5, where ∗ means the posteriormode and 0.5 is the threshold value.

7 / 21

Simulation study: data generating

y vt,re = (β0 + βv

1 xbold,t) cos(π/4) + εvt,re , εvt,reIID∼ N(0, 0.25)

y vt,im = (β0 + βv

1 xbold,t) sin(π/4) + εvt,im, εvt,imIID∼ N(0, 0.25)

εvt ∼ CN(0, 0.5, 0)

y vt,mag =

√(y v

t,re)2 + (y vt,im)2, SNR = β0/σ, CNRv = βv

1/σ

8 / 21

Simulation study: modeling

• The model:

yv = Xβv + εv , εv ∼ CNT (0, 2σ2I , 0), v = 1, ...,V

βvj | γvjindep∼ (1− γvj )CN1(0, v02σ2, 0) + γvj CN1(0, 2σ2, 0),

σ2 ∝ 1/σ2

γvj | θjIID∼ Ber(θj)

θjIID∼ Beta(aθ = 1, bθ = 1)

• Voxel v is active if Pr(γvBOLD = 1 | β∗, θ∗, σ∗) > 0.5

SNR 0.5 1 5 10

CNR 0.5 1 1.5 0.5 1 1.5 0.5 1 1.5 0.5 1 1.5

Data No. 1 2 3 4 5 6 7 8 9 10 11 12

9 / 21

• The CV model detects more true positives than the MO when theSNR is small, leading to higher sensitivity, precision and accuracy.

• The CV model performs better in estimating strength.

10 / 21

SNR 0.5 1 5 10

CNR 0.5 1 1.5 0.5 1 1.5 0.5 1 1.5 0.5 1 1.5

Data No. 1 2 3 4 5 6 7 8 9 10 11 12

• EM performs better than lasso and adaptive lasso.• CV-EM performs consistently across different SNRs.

• CV-EM dominates when SNR is small; MO-EM is better when SNR is large.

11 / 21

SNR 0.5 1 5 10

CNR 0.5 1 1.5 0.5 1 1.5 0.5 1 1.5 0.5 1 1.5

Data No. 1 2 3 4 5 6 7 8 9 10 11 12

11 / 21

How to choose v0?

• From Rockova and George (2014) and Wang et al. (2015), choose v∗0that maximizes the Pr(γ | y) that evaluates γ containing only thosevariables for which γv = 1.

• Suggestion: Create a grid between 1/√

100Tp and 1/√

10Tp

12 / 21

Comparison to traditional GLM and different HRFs

13 / 21

Extension 1: AR(1) noises to human subject data

εvt,k = ϕv εvt−1,k + zvt,k , zvt,k ∼ N(0, σ2), k = Re, Im

p(ϕv ) ∼ Unif (−1, 1)

Figure : Data from Karaman, Bruce and Rowe (KBR, 2014) Left to right:KBR-CV, KBR-MO, DeTeCT-ING, EM-CV

• removes false positives outside the brain area, comparing to KBR-CV.• has higher detecting power than KBR-MO.• is comparable to the nonlinear sophisticated DeTeCT-ING model.

14 / 21

Extension 2: IID to Spatial dependence (Bezener (2015))

• The spatial dependence is governed by an underlying areal model.

• Parcellating the images into clusters of voxels.

• A spatial hierarchical prior that allow prior anatomical information isused to model the spatial dependence.

15 / 21

Figure : Left: Spatial MCMC. Right: Non-spatial EMVS

1. The spatial model removes false positives outside the brain and iscomparable to the nonlinear sophisticated DeTeCT-ING model.2. The spatial model further eliminates single isolated active voxels, andencourage grouping active voxels, comparing to the non-spatial models.

16 / 21

Summary

• C-EMVS fast detects activations at the voxel level.

• Bayesian modeling does not have multiple testing issues, for example,Bonferroni or FDR correction.

• Using complex-valued data detects more true positives (active voxels)and/or less false positives, especially when SNR is small, whilemagnitude data can be used if SNR is large.

• C-EMVS is based on linear model and does not use sophisticatedspatio-temporal or nonlinear models, but its detecting performance iscomparable to those models.

• The CV algorithm can be applied to any complex-valued data.

• Thank you! [email protected]

17 / 21

Complex Normal Distribution

• Z = X + iY ∼ CNn(µz ,Γ,C) iff(XY

)∼ N2n

((µx

µy

),Σ =

(ΣX ΣXY

ΣYX ΣY

)). With µz = 0,

Γ := E(ZZ̄′) = ΣX + ΣY + i(ΣXY − ΣYX ),

C := E(ZZ′) = ΣX − ΣY + i(ΣXY + ΣYX ).

• Γ: covariance matrix. C: relation matrix

• C = 0 iff ΣX = ΣY and ΣXY = −ΣYX , and Z is called proper orsecond order circular. Its real and imaginary parts have the samecovariance, and are uncorrelated.

18 / 21

19 / 21

Distribution of Magnitude and Phase

• Given a complex normal r.v. Z = X + iY(1) the amplitude R =

√X 2 + Y 2 follows (marginal) Rician

distribution,(2) the phase φ conditional on R, (φ | R), follows Tikhonovdistribution (ODonoughue and Moura (IEEE 2012)).

• The model yv = Xβv + εv , with βv = βvRe + iβv

Im is linear and easyto work with.

20 / 21

Work in process and future work

• Develop a corresponding approximate inference, say variationalinference that includes spatio-temporal structure and fast detectactivations.

• Find a way to adaptively determine the region shapes and sizes ofimages.

• Examine the effects of different shrinkage priors, for example,spike-slab lasso, horseshoe, etc.

• Build a more general model that includes connectivity issue.

• Study the noncircular complex normal behavior in detail.

21 / 21

Bayesian Modeling of Complex-valued fMRI Signals for Brain ... · Bayesian Modeling of...

Documents

Transcript of Bayesian Modeling of Complex-valued fMRI Signals for Brain ... · Bayesian Modeling of...