Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM...

50
University of British Columbia SLIM Ben Bougher, August 15th 2016 Machine learning applications to geophysical data analysis

Transcript of Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM...

Page 1: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

University  of  British  ColumbiaSLIM

Ben  Bougher,  August  15th  2016

Machine learning applications to geophysical data analysis

Page 2: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

ContributionsPredicting  stratigraphic  units  from  well  logs  using  supervised  learning  • Novel  use  of  the  scattering  transform  as  a  feature  representation  of  well  logs.  

• CSEG  2015  expanded  abstract,  honorable  mention  for  best  student  talk.  

• Published  article  in  the  CSEG  Recorder  (Jan  2015).  Reflection  seismology  as  an  unsupervised  learning  problem  • Generalized  and  automated  a  hydrocarbon  exploration  analysis  workflow.  

• Reservoir  discovery  as  convex  optimization.  • Accepted  abstract,  to  be  presented  @SEG  (Houston,  2016)

2

Page 3: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Reflection seismology as an unsupervised learning problem

3

Approaches: Physics  driven  (conventional)  Data  driven  (thesis  contributions)  

Motivation: Inability  to  discover  hydrocarbons  directly  from  seismic  data.  

Problem: Automatically  segment  potential  hydrocarbon  reserves  from  seismic  images.  

Page 4: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Scattering physics

4

Ubiquitous  in  experimental  physics.  Measure  the  scattering  pattern  from  a  known  source  incident  on  a  material.  Performed  in  highly  controlled  and  calibrated  laboratories  (laser  sources,  temperature  controlled,  vacuums,  etc...).  Reflection  seismology  is  a  scattering  experiment  in  an  uncontrolled  environment.  

Page 5: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Seismic experiment

5

Page 6: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Seismic experiment

5

Page 7: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

receiver position

time

x

dept

h

Migration

6

Migration  maps    shot  records  of  reflections  recorded  at  the  surface  into    images  of  the  subsurface.

Shot  Record Migrated  Image

Page 8: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Migration

7

Page 9: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Migration

7

Page 10: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Angle domain common image gathers

8

θProblem:  Need  angle  dependent  reflectivity  responses  

Solution:  Angle  domain  common  image  gather  migration

x1

x

z1

dept

h

refle

ctiv

ity

Earth  Model R(𝜃)  at  (x1,z1)

Page 11: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Scattering theory (Zoeppritz)

9

θ1θ1

θ2

φ1

φ2

Vp1, Vs1, ρ1

Vp2, Vs2, ρ2

Rpp Rps

Tpp

Tps

2

664

RPP

RPS

TPP

TPS

3

775 =

2

6664

� sin ✓1 � cos�1 sin ✓2 cos�2

cos ✓1 � sin�1 cos ✓2 � sin�2

sin 2✓

VP1VS1

cos2�1⇢2VS22VP1

⇢1V 2S1VP 2

cos 2✓1⇢2VS2VP1

⇢1V 2S1

cos 2�2

�cos�2VS1VP1

sin 2�1⇢2VP2

⇢1VP1

⇢2VS2

⇢1VP1sin 2�2

3

7775

�1 2

664

sin ✓1

cos ✓1

sin 2✓1

cos 2�1

3

775

Problem: Relate angle dependent reflectivity to rock physics.

Assumption: Ray theory approximation.

Solution:

Problem: Non-linear, not useful for inversion.

R(✓) / VP , VS , ⇢

Page 12: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Scattering theory (Shuey)

10

Limitations: Small  perturbations  over  a  background  trend  valid  <  30  degrees  

Benefits:  linear  for  i  and  g  invert  using  simple  least  squares  

0 10 20 30 40 50 60 70 80 90

theta [deg]

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

Rpp

Zoeppritz2-term Shuey

Rpp(✓) = i(�VP ,�⇢) + g(�VP ,�VS ,�⇢) sin2 ✓

R(𝜃)  at  (x1,z1)

Shuey  approximation

Page 13: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Shuey term inversion as a projection

11

Rpp

Rpp(✓) at x1, z1

ADCIG at x1

θ

ADCIGs

Image  for  every  scattering  angle.  An  angle  gather  for  every  slice.  A  reflectivity  curve  for  every  point.

Page 14: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Shuey term inversion as a projection

11

Rpp

Rpp(✓) at x1, z1

Fit  the  reflectivity  curve  using  a  linear  combination  of  the  Shuey  vectors.  Each  reflectivity  curve  is  projected  down  to  two  coefficients.

Page 15: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Shuey term inversion as a projection

11

i g

i,g at x1

i

g

Reduced  the  dimensionality  of  the  angle  gathers  to  two  coefficients.  Projection  coefficients  can  be  plotted  to  analyze  the  multivariate  relationships.

Page 16: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Relation to hydrocarbons

12

�0.4 �0.2 0.0 0.2 0.4

I

�0.4

�0.2

0.0

0.2

0.4

G

mudrock linesand - shalegas sand - shale

g =i

1 + k

h1� 4

hVSihVP i

⇣ 2

m+ k

hVSihVP i

⌘i

*m,  c,  k  are  geological  parameters  determined    empirically  from  well  logs/laboratory  measurements

Hydrocarbon  reserves  are  found  from  outliers  of  a  crossplot!

Brine-­‐saturated  sands  and  shales  follow  a    mudrock  line:  

Hydrocarbon  saturated  sands  deviate  from  this  trend.

Page 17: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Reality bites

13

5 10 15 20 25 30✓ [deg]

�1.6

�1.4

�1.2

�1.0

�0.8

�0.6

�0.4

�0.2

Rpp

⇥104

0 5 10 15 20 25 30 35 40✓ [deg]

0.00

0.05

0.10

0.15

0.20

0.25

Rpp

0 5 10 15 20 25 30✓ [deg]

0

1

2

3

4

5

6

Rpp

0 5 10 15 20 25 30✓ [deg]

0.0

0.1

0.2

0.3

0.4

0.5I

G

Problem:    Shuey  components  can’t  explain  the  features  in  real  data.

Solution:    Use  unsupervised  machine  learning  to  find  better  projections.

BG  dataset Penobscot  dataset Mobil  Viking  dataset

Page 18: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Unsupervised learning problem

14

X 2 Rn⇥d

n  is  number  of  samples  in  the  image,    d  is  the  number  of  angles

Page 19: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Principle component analysis (PCA)

Eigendecomposition  of  the  covariance  matrix:  

Project  onto  the  eigenvectors  with  the  two  largest  eigenvalues.  Maximizes  the  variance  (a  measure  of  information).

15

C = XTX =1

n

nX

i

xixTi

Page 20: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

DEMO

16

http://ec2-­‐54-­‐224-­‐182-­‐64.compute-­‐1.amazonaws.com/#/pca

Page 21: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Marmousi II Earth model

17

0 1000 2000 3000 4000x [m]

0

500

1000

1500

2000

z[m

]

vp

12001400160018002000220024002600

0 1000 2000 3000 4000x [m]

0

500

1000

1500

2000z

[m]

vs

0

150

300

450

600

750

900

1050

0 1000 2000 3000 4000x [m]

0

500

1000

1500

2000

z[m

]

rho

105012001350150016501800195021002250

0 1000 2000 3000 4000x[m]

0

500

1000

1500

2000

z[m

]

zero-o↵set rpp

�0.32�0.24�0.16�0.080.000.080.160.240.32

Specifically  made  for  testing  amplitude  vs.  offset  analysis  

Contains  gas-­‐saturated  sand  embedded  in  shales  and    brine-­‐sands.  

Page 22: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Seismic modeling

18

0 500 1000 1500 2000 2500 3000 3500 4000x [m]

0

500

1000

1500

2000

z[m

]

0 5 10 15 20 25 30✓ [deg]

500 1000 1500 2000 2500 3000 3500 4000 4500 5000x [m]

0

500

1000

1500

2000

2500

z[m

]

�5 0 5 10 15 20 25 30✓ [deg]

Physically  consistent  with  the  Zoeppritz  equations

Migrated  visco-­‐acoustic  survey

True  reflectivity Migrated  seismic

Page 23: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

DEMO

19

http://ec2-­‐54-­‐224-­‐182-­‐64.compute-­‐1.amazonaws.com/#/avo

Page 24: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Kernel PCA

Problem:  Find  a  non-­‐linear  projection  that  provides  better  discrimination  of  trends  and  outliers.  

Solution:  Use  the  “kernel-­‐trick”  to  compute  PCA  in  a  high-­‐dimensional  non-­‐linear  feature  space  

20

Page 25: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Kernel trickPCA  can  be  calculated  from  the  Gramian  inner  product  matrix:  

Replace                          with  a  kernel    

Example  c=2,  b=1

21

hxi,xji (xi,xj)

�(x) = [1,p2x1,

p2x2, x

21, x

22,p2x1x2]

XXT =

0

BBB@

hx1,x1i hx1,x2i . . . hx1,xnihx2,x1i hx2,x2i . . . hx2,xni

......

. . ....

hxn,x1i hxn,x2i . . . hxn,xni

1

CCCA

(xi,xj) = (xTi xj + b)c = h�(xi),�(xj)i

Page 26: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

DEMO

22

http://ec2-­‐54-­‐224-­‐182-­‐64.compute-­‐1.amazonaws.com/#/avo

Page 27: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

RecapSituation:  Exploring  seismic  data  for  anomalous  responses.  Problem:  Physical  model  can  not  explain  real  world  data.  Solution:  Learn  useful  projections  directly  from  the  data.  Assessment:  

• PCA  is  equivalent  for  physically  consistent  data,  but  more  robust  to  processing/acquisition  artifacts.  

• Kernel  PCA  makes  outliers  linearly  separable  from  the  background.  

Next:  Automatic  segmentation  (clustering)  

23

Page 28: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Each  point  is  initially  considered  a  cluster.  Each  iteration  merges  the  closest  clusters  into  a  larger  cluster.  Builds  a  hierarchy  until  a  defined  number  of  classes  is  reached.

Hierarchical clustering

24

0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9�0.1

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0

1

2

3 4

5

6

7

8

9

5 3 1 2 47 96 80

A

BB

C

Unclassified  data Classification  hierarchy

Page 29: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Results - PCA

25

Clustered  physically  consistent  data Clustered  migrated  data

Page 30: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Results - Kernel PCA

26

Clustered  physically  consistent  data Clustered  migrated  data

Page 31: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Summary

Successes:  Each  projection  could  segment  the  reservoir.  Kernel  PCA  provided  advantageous  multivariate  geometries  (linearly  separable).  

Challenges:  Clustering  is  highly  sensitive  to  user  chosen  parameters.  Kernel  PCA  is  computationally  expensive  and  lacks  interpretation.  

27

Page 32: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Robust PCA

Problem:  Find  a  sparse  set  of  outlying  reflectivity  responses  against  a  background  trend.  Assumption:  The  background  trend  of  similar  curves  is  highly  redundant,  which  forms  a  low  rank  matrix.  Solution:  

28

minL,S

kLk⇤ + �kSk1,1 s.t. L+ S = X

Page 33: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Results - physically consistent data

29

0 1000 2000 3000 4000x [m]

0

500

1000

1500

z[m

]

Page 34: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Results - migrated seismic

30

Page 35: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Summary

Successes:  Segmentation  of  reservoir  in  both  images.  Physically  interpretable  segmentation  without  clustering.  

Challenges:  Requires  tuning  of  one  optimization  trade  off  parameter.  Convergence  sensitive  to  the  rank  of  outliers,  not  well  understood.

31

Page 36: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Comparison on field data

32

Data  provided  by  BG  group.  Interpreted  to  contain  a  potential  gas  reserve.  

Compare  unsupervised  methods  to  BG  group’s    approach:  Dynamic  Intercept  Gradient  Inversion  (DIGI).  

Note:  Clustering  approaches  were  not  directly  useful.

Migrated  angle  gathers

potential  gas  target

Page 37: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

DIGI-inverse problem

33

2

4d0

0

3

5=

2

4W W sin

2 ✓�r �r

W (✓me) cos(�me) W (✓me) sin(�me)

3

5ig

d(x, t, ✓) = w(x, t, ✓) ⇤ r(x, t, ✓)Convolution  forward  model:•ill-­‐posed,                term  forces  a  smooth  answer�r

Further  augmented  by  extended  elastic  reflectivity  (EER)  term:EER(�me) = i cos(�me) + g sin(�me)

•  promotes  correlation  between  i  and  g  •                    is  related  a  priori  geological  information

System  is  solved  using  the  conjugate  gradient  based  algorithm  LSQR.

�me

Page 38: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

PCA extended DIGI

34

2

4d0

0

3

5=

2

4Wc1 Wc2�r �r

W (✓me) cos(�me) W (✓me) sin(�me)

3

5ig

5 10 15 20 25 30✓ [deg]

�0.4

�0.2

0.0

0.2

0.4

0.6

0.8Component 1

Component 2

Same  inverse  problem,  but  use  the  two  largest  principle  components  instead  of  the  Shuey  terms.

2

4d0

0

3

5=

2

4W W sin

2 ✓�r �r

W (✓me) cos(�me) W (✓me) sin(�me)

3

5ig

Page 39: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Minimum energy projection

forms  an  image  where  large  values  correspond  to  uncorrelated  i  and  g  terms.  

Thresholding  this  image  will  therefore  segment  outliers.

35

EER(�me) = i cos(�me) + g sin(�me)

Page 40: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Comparison method

Manually  threshold  the  image  to  segment  the  potential  hydrocarbon  reserve  while  maintaining  the  least  amount  of  spurious  segmentation.  

Compare:  • Robust  PCA  • DIGI  • PCA  extended

36

Page 41: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Results - robust PCA

37

0 100 200 300xl

0

200

400

sam

ples

Page 42: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Results - DIGI

38

Page 43: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Results - PCA extend DIGI

39

Page 44: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Results - EER using PCA extended DIGI

40

Page 45: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Results - EER using DIGI

41

Page 46: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Why the difference?

42

5 10 15 20 25 30✓ [deg]

�0.4

�0.2

0.0

0.2

0.4

0.6

0.8Component 1

Component 2

Principle  components Shuey  terms

Principle  components  can  explain  more  features  in  the  data.

Page 47: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Summary

43

• Robust  PCA  provided  the  best  image  segmentation.  • PCA  extended  DIGI  better  separated  the  potential  reservoir  from  the  background  trend.  

• The  extracted  principle  components  showed  significantly  different  shapes  than  the  Shuey  vectors.

Page 48: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Epilogue

Outcome:  Generalized  a  conventional  analysis  approach  using  unsupervised  learning  models.  Successful  in  segmented  potential  hydrocarbon  reserves  from  seismic  data  Future:  More  data,  standardized  datasets  Quantitative  benchmarks

44

Page 49: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

Thanks

45

Page 50: Machine learning applications to geophysical data analysis · Universityof*British*Columbia SLIM BenBougher,August15th2016 Machine learning applications to geophysical data analysis

References[1]Zoeppritz, K., 1919, VII b. Über Reflexion und Durchgang seismischer Wellen durch Unstetigkeitsflächen: Nachrichten von der Gesellschaft der Wissenschaften zu Göttingen, Mathematisch-Physikalische Klasse, 1919, 66–84 [2]Shuey, R. T., 1985, A simplification of the Zoeppritz equations: Geophysics, 50, 609–614. [3]Gardner, G. H. F., L. W. Gardner, and A. R. Gregory, 1974, Formation velocity and density; the diagnostic basics for stratigraphic traps: Geophysics, 39, 770–780. [4]Castagna, J. P., M. L. Batzle, and R. L. Eastwood, 1985, Relationships between compressional-wave and shear-wave velocities in clastic silicate rocks: Geophysics, 50, 571–581. [5]Castagna, J. P., H. W. Swan, and D. J. Foster, 1998, Framework for AVO gradient and intercept interpretation: Geophysics, 63, 948–956. [6]Edgar, J., and J. Selvage, 2013, Dynamic Intercept-gradient Inversion

46