arXiv:1704.02446v1 [cs.CV] 8 Apr 2017Seismic facies recognition based on prestack data using deep...

Seismic facies recognition based on prestack data using deep convolutional autoencoderFeng Qian1, Miao Yin2, Ming-Jun Su3, Yaojun Wang1, Guangmin Hu1

1Center of Information Geoscience, University of Electronic Science and Technology of China,2School of Resource and Environment, University of Electronic Science and Technology of China,3PetroChina Research Institute of Petroleum Exploration and Development (RIPED) - Northwest, Lanzhou, China

SUMMARY

Prestack seismic data carries much useful information that canhelp us find more complex atypical reservoirs. Therefore, weare increasingly inclined to use prestack seismic data for seis-mic facies recognition. However, due to the inclusion of ex-cessive redundancy, effective feature extraction from prestackseismic data becomes critical. In this paper, we consider seis-mic facies recognition based on prestack data as an image clus-tering problem in computer vision (CV) by thinking of eachprestack seismic gather as a picture. We propose a convo-lutional autoencoder (CAE) network for deep feature learn-ing from prestack seismic data, which is more effective thanprincipal component analysis (PCA) in redundancy removingand valid information extraction. Then, using conventionalclassification or clustering techniques (e.g. K-means or self-organizing maps) on the extracted features, we can achieveseismic facies recognition. We applied our method to theprestack data from physical model and LZB region. The re-sult shows that our approach is superior to the conventionals.

INTRODUCTION

Seismic facies recognition is an auxiliary means that we usemachine learning to generate facies map which reveals thedetails of the underlying geological features. With the im-provement of computer’s computing ability, pattern recogni-tion techniques can be used to handle larger seismic data andget more accurate results. As a result, seismic facies recogni-tion becomes increasingly significant.

So far, almost all supervised and unsupervised classificationalgorithms including K-means, self-organizing maps (SOM),generative topographic mapping (GTM), support vector ma-chines (SVM) and artificial neural networks (ANN) etc. havebeen successfully used in seismic facies recognition. In fact,the most important parameter in the analysis is not classifica-tion method but the correct input attributes (Zhao et al., 2015).It means that effective feature extraction from seismic datais the key to seismic facies recognition. Balz et al. (2000)developed a methodology to generate a so-called AVO-tracefrom prestack seismic data which can be classified by con-ventional clustering techniques. Coleou et al. (2003) extractedfeatures via constructing vectors of dip magnitude, coherenceand reflector parallelism. de Matos et al. (2006) used wavelettransforms to identify seismic trace singularities in each ge-ologically oriented segment. Gao (2007) applied gray levelco-occurrence matrix (GLCM) to generate facies map usingSOM. Roy et al. (2013) proposed an SOM classification work-flow based on multiple seismic attributes. Of the seismic facies

recognition methods, only a few extract features from prestackseismic data.

Obviously, prestack seismic data contains richer information,with which we can obtain more details of atypical reservoirsin facies map. Nevertheless, there is also a large amount of re-dundancy in the prestack data that may effect the performanceof seismic facies recognition negatively. Principal componentsanalysis (PCA) and independent components analysis (ICA)are commonly used to reduce redundancy of attributes to low-dimensional independent meta features (Gao, 2007). Theseapproaches just map the original prestack seismic data to asubspace in view of mathematics. However, our motivationis to develop a model, which can learn profound features fromseismic data automatically.

In order to construct the model, referring to the latest technolo-gies in the fields of artificial intelligence (AI) and computervision (CV), we applied convolutional neural network (CNN)in seismic facies recognition. Convolutional neural network isone framework of deep learning which has been used effec-tively in many machine learning tasks. With the excellence offeature learning, CNN has achieved particular success in thedomain of image classification (Krizhevsky et al., 2012), ac-tion recognition (Ji et al., 2013), video classification (Karpathyet al., 2014), speech recognition (Abdel-Hamid et al., 2014)and face recognition (Farfade et al., 2015). Considering theseismic prestack data as images, we convert seismic faciesrecognition to image classification problem resolved by usingCNN to extract features.

In this paper, we develop a convolutional autoencoder (CAE)network with multiple layers. In each layer, higher-level fea-ture maps are generated by convolving features of the lower-level layer with converlutional filters which are trained by min-imizing a loss function (explained in the next section). Firstly,we input the seismic prestack data to train the network and ob-tain extracted features. Then we utilize unsupervised patternrecognition techniques to generate the facies map. We havecompared the facies maps depicted by our method and othertwo methods (PCA and using postack data) with two types ofprestack seismic data. The result demonstrates the superiorityof our approach.

METHOD

The workflow of our method mainly consists of the followingfour steps:(1) Select a proper window of data which contains at least onecrest and one trough per trace in prestack seismic gathers alongan interpreted horizon;(2) Input these two-dimensional data to the convolutional au-

arX

iv:1

704.

0244

6v1

[cs

.CV

] 8

Apr

201

7

Seismic facies recognition based on prestack data using deep convolutional autoencoder

Patch representation

(encoding)

Reconstruction

(decoding)

A window of prestack seismic data

(input)

Feature maps

Reconstructed prestack seismic data

(output)

Convolution Max pooling Unpooling Deconvolution

Figure 1: Architecture of convolutional neural network we proposed to learn deep features from prestack seismic data.

toencoder network and train it;(3) Re-input these two-dimensional data to the trained net-work, obtain features extracted by it;(4) Use unsupervised pattern recognition algorithm to clusterthese extracted features and generate facies map.

Convolutional Neural Network For Feature Learning

An overview of our convolutional neural network is given byFigure 1. The input X is a window of prestack seismic data,which can be considered as an image. Our goal is to learnsome features that can best represent X. That means we wish tofind a mapping Φ, which makes the reconstruction Z as similaras possible to X. Feature learning from seismic prestack datainvolves the following three aspects.

Patch representation. This operation extracts patches fromthe seismic prestack window data X and represents each patchas a high-dimensional tensor. These tensors comprise a set offeature maps, of which the number is equal to the dimension-ality of the tensors. At the beginning the input data is con-volved, and then a max pooling step is used to shrink the con-volutional layers — taking the maximum of each 2×2 matrixwith a strides of 2 along both width and height. Max poolingreduces the spatial size of the representation and hence to alsocontrol overfitting. Formally, for a window of prestack data asan input X, the latent representation of the kth feature map isgiven by

Φk(X) = σ (Ψ(X∗Wk +bbbk)) , (1)

where W represents the filter, Ψ donates max pooling, bbb is thebias which is broadcasted to the whole map. σ is an activationfunction that is considered as a non-linear mapping (we useleaky ReLu function), and ∗ denotes the 2D convolution. Here

W is of a size 1×n×n×k, where 1 is the number of channelsin the input (for prestack seicmic data, the value of is always1), n is the spatial size of a filter, and k is the number of filters.Intuitively, W applies k convolutions on the input, and eachconvolution has a kernel size n× n. The output is composedof k feature maps. bbb is an k-dimensional vector, whose eachelement is associated with a filter. The convolution of an m×mmatrix with an n× n matrix may in fact result in an (m+ n−1)× (m+ n− 1) matrix (full convolution) or in an (m− n+1)× (m−n+1) (valid convolution).

Reconstruction. In the network, the back two layers are sym-metrical with the front two layers, which are called unpoolinglayer and deconvolution layer. The purpose of this design is toreconstruct the unlabeled input. The reconstruction is obtainedusing

Z = σ

(∑k∈H

Ψ(Yk)∗Wk +ccck

), (2)

where again there is one bias ccc per input channel. H identifiesthe group of latent feature maps, W identifies the flip operationover both dimensions of the weights (Masci et al., 2011), Ψ

donates unpooling operation. Generally, max pooling is non-revertible, so instead we formulate a approximate solution thatcan be used in this case. When reconstructing, a single indi-vidual unpooling location is picked randomly and the pooledvalue is placed there, setting the other unpooling location tozero. This is a good approximation for the max unpooling pro-cess, although it may results in some noise in the reconstructedmaps.

Training. We use denoising autoencoding to train the network.The input X is transformed to some new value Xc by randomlysetting values zero with a very small probability and the net-work is trained to reconstruct the original input. Training is


(a) (b) (c)

Figure 2: (a) The original waveform of a window of prestack seismic data. (b) The reconstructed prestack seismic data. (c) Theextracted features using the convolutional autoencoder network.

done individually layer by layer and posed as an optimizationproblem. We find weights W and biases bbb such that the fol-lowing loss function is minimized. We use mean squared error(MSE) as the loss function:

L =1

2n

n∑i=1

(Xi−Zi)2. (3)

The weights W and the biases bbb are initialized to small ran-dom values found using the fan-in and fan-out criteria (Hinton,2012). As for standard networks the back-propagation algo-rithm is applied to compute the gradient of the error functionwith respect to the parameters.

We make use of automatic derivatives calculated by Tensor-Flow with GPUs (Abadi et al., 2016) and iteratively input eachdata taken from prestack seismic gathers while updating thefilters in the direction that minimizes the loss function. Anoriginal input and the reconstruction are shown in Figure 2-aand 2-b. Figure 2-c gives the extracted features from the gather.

Unsupervised Pattern Recognition

This is the final stage of our method for generating facies map.The goal is to cluster the one-dimensional vectors, which isderived from the middle feature maps of the convolutional au-toencoder network. Clustering can be accomplished by severalunsupervised strategies such as K-means clustering, fuzzy c-means, SOM, etc. Without losing generality, we use K-meansalgorithm to generate facies map. It is based on minimizingthe following function

Jm =

N∑i=1

c∑j=1

Umi j∥∥xi−C j

∥∥2, (4)

where m is any real number greater than one, Ui j is the de-gree of membership of Xi in the cluster j, Xi is the ith of d-dimensional measured data, and C j is the d-dimension centerof cluster. The cluster centroids are calculated by using thefollowing equation:

C j =

∑Ni=1 um

i jXi∑Ni=1 um

i j

. (5)

EXAMPLE

To verify the performance of our method, we applied it to twokinds of prestack data. One is from artificial physical model,another is from LZB region which is a real work area. We se-lected a interpreted horizon includes known fractures and tooka window of 48ms along the horizon to obtain data matrices.The learning rate is assigned to 0.02. We set the number oflayers of the feature maps to 10, which are obtained by theconvolutional filters of size 3×3×10. On the other hand, wealso used poststack data and PCA (retain 90% of the principalcomponent) based on prestack data to generate facies maps, towhich we compared the result derived by the proposed method.All the results are shown in Figure 3 and Figure 4.

Physical Model

As shown in Figure 3-a, the physical model consists of threewater tanks and many caves. In Figure 3-b, we can clearlysee the smallest caves and the river (marked with red circle).However, the small caves are not presented in Figure 3-c. Theriver in Figure 3-d, is also not as clear as in Figure 3-b.

LZB Region

The facies maps of the three methods are shown in Figure 4.Similarly, Figure 4-a is generated by our method, of which theeffect is better than the other two.

CONCLUSION

We have presented a novel deep learning approach for seismicfacies recognition based on prestack data. We show that withthe representation and reconstruction architecture of convolu-tional neural network and layer-by-layer unsupervised trainingstrategy, reliable features can be learned directly from prestack


(a)

(b)

(c)

(d)

Figure 3: Physical model and its facies maps. (a) The proto-type of physical model. (b) The result using deep CAE basedon prestack data. (c) The result using poststack data. (d) Theresult using PCA based on prestack data.

(a)

(b)

(c)

Figure 4: Facies maps of LZB region. (a) The result using deepCAE based on prestack data. (b) The result using poststackdata. (c) The result using PCA based on prestack data.

seismic data. With the learned features we achieved superiorperformance compared to the current method which is basedon poststack data or superficial features learned from prestackdata. In future work, we will focus on discovering how the pa-rameters, such as the number of levels, the number of hiddenunits, and the sparsity of active hidden units of the convolu-tional autoencoders affect the performance.

ACKNOWLEDGMENTS

This work was supported by the Natural Science Foundationof China (No. U1562218). We thank the CNPC Key Labo-ratory of Geophysical Prospecting under China University ofPetroleum (Beijing) for their cooperation in providing data andsupport.


REFERENCES

Abadi, M., A. Agarwal, P. Barham, E. Brevdo, Z. Chen, C. Citro, G. S. Corrado, A. Davis, J. Dean, M. Devin, et al., 2016,Tensorflow: Large-scale machine learning on heterogeneous distributed systems: arXiv preprint arXiv:1603.04467.

Abdel-Hamid, O., A.-r. Mohamed, H. Jiang, L. Deng, G. Penn, and D. Yu, 2014, Convolutional neural networks for speech recog-nition: IEEE/ACM Transactions on audio, speech, and language processing, 22, 1533–1545.

Balz, O., F. Pivot, and D. Badolato, 2000, Fast identification of avo-anomalies using classification of prestack waveforms, in SEGTechnical Program Expanded Abstracts 2000: Society of Exploration Geophysicists, 106–109.

Coleou, T., M. Poupon, and K. Azbel, 2003, Unsupervised seismic facies classification: A review and comparison of techniquesand implementation: The Leading Edge, 22, 942–953.

de Matos, M. C., P. L. Osorio, and P. R. Johann, 2006, Unsupervised seismic facies analysis using wavelet transform and self-organizing maps: Geophysics, 72, P9–P21.

Farfade, S. S., M. J. Saberian, and L.-J. Li, 2015, Multi-view face detection using deep convolutional neural networks: Proceedingsof the 5th ACM on International Conference on Multimedia Retrieval, ACM, 643–650.

Gao, D., 2007, Application of three-dimensional seismic texture analysis with special reference to deep-marine facies discriminationand interpretation: Offshore angola, west africa: AAPG bulletin, 91, 1665–1683.

Hinton, G. E., 2012, A practical guide to training restricted boltzmann machines, in Neural networks: Tricks of the trade: Springer,599–619.

Ji, S., W. Xu, M. Yang, and K. Yu, 2013, 3d convolutional neural networks for human action recognition: IEEE transactions onpattern analysis and machine intelligence, 35, 221–231.

Karpathy, A., G. Toderici, S. Shetty, T. Leung, R. Sukthankar, and L. Fei-Fei, 2014, Large-scale video classification with convolu-tional neural networks: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 1725–1732.

Krizhevsky, A., I. Sutskever, and G. E. Hinton, 2012, Imagenet classification with deep convolutional neural networks: Advancesin neural information processing systems, 1097–1105.

Masci, J., U. Meier, D. Ciresan, and J. Schmidhuber, 2011, Stacked convolutional auto-encoders for hierarchical feature extraction:International Conference on Artificial Neural Networks, Springer, 52–59.

Roy, A., B. L. Dowdell, and K. J. Marfurt, 2013, Characterizing a mississippian tripolitic chert reservoir using 3d unsupervised andsupervised multiattribute seismic facies analysis: An example from osage county, oklahoma: Interpretation, 1, SB109–SB124.

Zhao, T., V. Jayaram, A. Roy, and K. J. Marfurt, 2015, A comparison of classification techniques for seismic facies recognition:Interpretation, 3, SAE29–SAE58.

http://arxiv.org/abs/1603.04467

arXiv:1704.02446v1 [cs.CV] 8 Apr 2017Seismic facies recognition based on prestack data using deep...

Documents

Transcript of arXiv:1704.02446v1 [cs.CV] 8 Apr 2017Seismic facies recognition based on prestack data using deep...