Optical wavelet matched filters for shift-invariant pattern recognition

3
February 15, 1993 / Vol. 18, No. 4 / OPTICS LETTERS Optical wavelet matched filters for shift-invariant pattern recognition Yunlong Sheng and Danny Roberge D6partement de Physique, Centre d'Optique, Photonique et Laser, Universit6Laval, Sainte-Foy GiK 7P4, Canada Harold Szu Information Processing Group,Naval Surface Warfare Center-White Oak, 19001 New Hampshire Avenue, Silver Spring, Maryland 20903-5000 Taiwei Lu Physical Optics Corporation,20600 Gramercy Place, Suite 103, Torrance, California 90501 Received October 1, 1992 We introduce optical wavelet matched filters that perform the wavelet transforms for edge enhancement and perform correlations between the wavelet coefficients for shift-invariant pattern recognition. These new bandpass matched filters show improved discrimination capability with respect to the conventional matched spatial filter and improved signal-to-noise ratio with respect to the phase-only matched filter. The wavelet transform (WT) is efficient for mul- tiresolution local analysis of signals. It has been successfully used for image compression.' This Let- ter describes a new application of the WT to optical pattern recognition. The WT can be used to detect speech, seismic, biomedical, and other signals by the so-called cocktail party effect. 23 Shapes, scales, or positions of those signals may not be exactly known. For these applications, the WT is more efficient than the Fourier spectrum analysis. When the signals to be detected are known, a priori, automatic tar- get recognition usually involves two major opera- tions, namely, preprocessing for feature extraction and noise suppression followed by correlation with a library of templates. A major difficulty for ap- plications of the orthogonal discrete WIT to pattern recognition is the lack of shift invariance. Optical real-time WIT based on the explicit WT correlation has been demonstrated. 3 - 5 The optical WITis inherently shift invariant. We use the WI' to enhance significant features of patterns and use correlations between the WIT coefficients for pattern recognition. Then we introduce the wavelet matched filter that performs the WIT and the correlation of the WIT coefficients in a single step. Preliminary results are shown to support the usefulness of this approach. The VVI is a correlation between a two-dimensional (2-D) signal f(x,y) and a set of the dilated wavelets h.(x,y): Wf(sXsyxy) = f(x,y) * h.(x,y). (1) The wavelets are generated from a mother wavelet h(x,y) by hs(xY)= y h(s s ) (2) s-S S xSy where (sr, sy) are the dilation factors. 6 ' 7 The wavelets must satisfy an admissible condition that h(x,y) os- cillates to have a zero mean. The wavelets should also have some regularity so that they are local in both space and frequency domains. For example, the Gaussian function is local in space and is in- definitely derivable. A derivative of any order n of the Gaussian function may be used as the wavelet. The Fourier transform of this wavelet is also lo- cal, indefinitely derivable, and has zero of order n about the zero frequency. The wavelet admissible condition is satisfied. The Mexican-hat wavelet is the second-order derivative of the Gaussian function h(x,y) = V 2 g(xy). As a result of the linearity of the correlation and the differentiation operations, we can write V 2 (g 8 * f)(x,y) = [V 2 g 8 (Xy)] * f (x,y) = Wf(sX, sy,x,y). (3) The WT is then equal to the second-order derivative of the smoothed image f * g,. The smoothing func- tion is the dilated Gaussian function gs(x,y). The WT coefficient Wf(sx, syxy) is zero when f * g, is constant or changes linearly but exhibits a positive and a negative peak in two sides of the edge. A zero crossing occurs at the point of inflection of f * g 8 . Thus the WT enhances edges of the image. Chang- ing the scales (sr,sy) of the wavelets may change the size and orientation of the smoothing function. This property of the WITis useful because irradiance in- tensity changes in a 2-D image can occur at different scales and directions. By choosing the scale factors, the WT may enhance the selected edges in a 2-D image, as we show in the following for the optical WT of the letter E. 0146-9592/93/040299-03$5.00/0 © 1993 Optical Society of America 299

Transcript of Optical wavelet matched filters for shift-invariant pattern recognition

Page 1: Optical wavelet matched filters for shift-invariant pattern recognition

February 15, 1993 / Vol. 18, No. 4 / OPTICS LETTERS

Optical wavelet matched filters for shift-invariantpattern recognition

Yunlong Sheng and Danny Roberge

D6partement de Physique, Centre d'Optique, Photonique et Laser, Universit6 Laval, Sainte-Foy GiK 7P4, Canada

Harold Szu

Information Processing Group, Naval Surface Warfare Center-White Oak, 19001 New Hampshire Avenue,Silver Spring, Maryland 20903-5000

Taiwei Lu

Physical Optics Corporation, 20600 Gramercy Place, Suite 103, Torrance, California 90501

Received October 1, 1992

We introduce optical wavelet matched filters that perform the wavelet transforms for edge enhancementand perform correlations between the wavelet coefficients for shift-invariant pattern recognition. These newbandpass matched filters show improved discrimination capability with respect to the conventional matchedspatial filter and improved signal-to-noise ratio with respect to the phase-only matched filter.

The wavelet transform (WT) is efficient for mul-tiresolution local analysis of signals. It has beensuccessfully used for image compression.' This Let-ter describes a new application of the WT to opticalpattern recognition. The WT can be used to detectspeech, seismic, biomedical, and other signals by theso-called cocktail party effect.23 Shapes, scales, orpositions of those signals may not be exactly known.For these applications, the WT is more efficient thanthe Fourier spectrum analysis. When the signalsto be detected are known, a priori, automatic tar-get recognition usually involves two major opera-tions, namely, preprocessing for feature extractionand noise suppression followed by correlation witha library of templates. A major difficulty for ap-plications of the orthogonal discrete WIT to patternrecognition is the lack of shift invariance.

Optical real-time WIT based on the explicit WTcorrelation has been demonstrated. 3-5 The opticalWIT is inherently shift invariant. We use the WI'to enhance significant features of patterns and usecorrelations between the WIT coefficients for patternrecognition. Then we introduce the wavelet matchedfilter that performs the WIT and the correlation of theWIT coefficients in a single step. Preliminary resultsare shown to support the usefulness of this approach.

The VVI is a correlation between a two-dimensional(2-D) signal f(x,y) and a set of the dilated waveletsh.(x,y):

Wf(sXsyxy) = f(x,y) * h.(x,y). (1)

The wavelets are generated from a mother waveleth(x,y) by

hs(xY)= y h(s s ) (2)s-S S xSy

where (sr, sy) are the dilation factors.6'7 The waveletsmust satisfy an admissible condition that h(x,y) os-cillates to have a zero mean. The wavelets shouldalso have some regularity so that they are local inboth space and frequency domains. For example,the Gaussian function is local in space and is in-definitely derivable. A derivative of any order n ofthe Gaussian function may be used as the wavelet.The Fourier transform of this wavelet is also lo-cal, indefinitely derivable, and has zero of order nabout the zero frequency. The wavelet admissiblecondition is satisfied. The Mexican-hat wavelet isthe second-order derivative of the Gaussian functionh(x,y) = V2g(xy). As a result of the linearity of thecorrelation and the differentiation operations, we canwrite

V2(g8 * f)(x,y) = [V2g8(Xy)] * f (x,y) = Wf(sX, sy,x,y).(3)

The WT is then equal to the second-order derivativeof the smoothed image f * g,. The smoothing func-tion is the dilated Gaussian function gs(x,y). TheWT coefficient Wf(sx, syxy) is zero when f * g, isconstant or changes linearly but exhibits a positiveand a negative peak in two sides of the edge. Azero crossing occurs at the point of inflection of f * g8.Thus the WT enhances edges of the image. Chang-ing the scales (sr, sy) of the wavelets may change thesize and orientation of the smoothing function. Thisproperty of the WIT is useful because irradiance in-tensity changes in a 2-D image can occur at differentscales and directions. By choosing the scale factors,the WT may enhance the selected edges in a 2-Dimage, as we show in the following for the opticalWT of the letter E.

0146-9592/93/040299-03$5.00/0 © 1993 Optical Society of America

299

Page 2: Optical wavelet matched filters for shift-invariant pattern recognition

300 OPTICS LETTERS / Vol. 18, No. 4 / February 15, 1993

Fig. 1. Optical wavelet transform. Clockwise from topleft: input image; output with the 1-D Mexican-hatwavelet with s, = 2; output with the wavelet of Eq. (6)with s = 2; output with the wavelet of Eq. (5) withsx = sy = 2.

'Y lWT filter| tsxy

valued and were made as photographic transmittancemasks. Optical WIT outputs obtained in a conven-tional optical correlator are shown in Fig. 1, whereenhancement of the selected edges can be easily seen.

Edges are the most significant features of images.The VWT coefficient of a 2-D image is the edge-enhanced pattern. Pattern recognition by corre-lations between the WI coefficients will havegood discrimination capability. For this purpose,we could holographically record the complex-valuedoptical WIT coefficients Wf(sXsy,x,y) of the librarypatterns f(x,y) as the conventional matched spatialfilters and use a second optical correlator to performthe correlations between Wf(sx,,sy,x,y) and the WITcoefficient Wt(s.,,syxy) for an unknown input t(x,y),as illustrated in Fig. 2(a). However, this approachwould involve two cascaded optical correlators. Thistwo-step correlation would result in more systemcomplexity and low light efficiency.

According to Eq. (4), the correlation Wf * Wt maybe expressed in the Fourier plane as

f J Wt(s., sy,x',y)Wf*(sx, sy,x' - x,y' - y)dx'dy'(a) -

t(x,y) Wavelet matched filters WfrWt_ F(uvj H(sxusyv)j2

(b)

Fig. 2. Two approaches to obtain correlations betweentwo wavelet transform coefficients for pattern recognition:(a) Two steps, (b) one step, where ftx,y) is known, a priori,and t(x,y) is a unknown input pattern.

In the Fourier plane the VVT is expressed as

= ff|T(u, v)H(sxu, syv)F*(u, v)H*(sxu, syv)

x exp[i2 iT(xu + yv)]dudv, (7)

where T(u, v) is the Fourier transform of t(x,y) andthe constants S"

2Sy

2 are omitted. Thus the correla-tion Wf * W, can be directly obtained with a filterequal to F(u, v)IH(sxu, syv)I2 , as described in Fig. 2(b).We call this filter the wavelet matched filters. It isthe product of the conventional matched filter F(u, v)and the square modulus IH(sxu, syv)12 of the WIT filter.

Wf(SX, sy,x,y) = s'sy |f F(u, v)H*(su, syv)

x exp[i2ir(xu + yv)]dudv, (4)

where F(u, v) is the Fourier transform of the signalf(x,y), and H(sxu, syv) is the Fourier transform of thedilated wavelets hs(x,y). Optical WT is based onEq. (4) with H(s.u,syv) as the WT filters.8

In an optical experiment we demonstrated the WIof a capital letter E. The image has step edgesmainly along the x and y directions. We chose,therefore, for the wavelet a one-dimensional (1-D)Mexican-hat function Vy2g(y) that enhanced thehorizontal edges of the image and a separable productof two Laplacians of two 1-D Gaussian functionsVX2g(x)Vy2g(y), expressed in the Fourier plane as

H(u, v) = sX2 u 2 exp(-27r 2sX2 u 2 )sy 2 v 2 exp(-2 7r2sY2 v 2 )

(5)

with sx =ssy, which enhanced the corners8 of theimage. We also used the Laplacian of the isotropic2-D Gaussian function with sx = sy = s, expressed inthe Fourier plane as

H(u,v) = s2(U2 + v2 )exp[-2ir2S 2 (U2 + v2)], (6)

which enhanced the edges of all directions. The WTfilters described by Eqs. (5) and (6) were all positive

PEE E

(a) (b)

(c) (d)Fig. 3. Optical wavelet matched filters. (a) Input, (b)output of the conventional matched filter, (c) output of thewavelet matched filter with the 1-D Mexican-hat waveletwith s, = 5, (d) output with the wavelet of Eq. (5) withsx = s, = 5.

Page 3: Optical wavelet matched filters for shift-invariant pattern recognition

February 15, 1993 / Vol. 18, No. 4 / OPTICS LETTERS

Fig. 4. Three-dimensional plots of the outputs of fourphase-only wavelet matched filters with the dyadicwavelets of Eq. (6) with s = 1/2 (left-hand corner), s = 1(top corner), s = 2 (bottom corner), and s = 4 (right-handcorner). The input was that of Fig. 3(a) degraded by anadditive random noise.

The wavelet matched filter is shift invariant. Whent =f, an autocorrelation peak of Wf(ssy, x,y) willappear at the location of the t(x,y).

In the experiment, the input scene contained threeletters E of different sizes and a letter P, as shown inFig. 3(a). The square moduli IH(sxu, syv)I2 of the WTfilters were made as the photographic transmittancemasks. The optical wavelet matched filter was asuperimposition of the mask IH(su, sv)J2 with a con-ventional computer-generated holographic matchedfilter F(u, v) for the letter E in the bottom right-hand corner of Fig. 3(a). The wavelets were a 1-DMexican-hat wavelet with sy = 5 that enhanced hor-izontal edges of the images and a separable productof two 1-D Mexican-hat wavelets described by Eq. (5)with s -sy = 5 that enhanced corners of the images.Figures 3(c) and 3(d) show the optical correlationoutputs of the two wavelet matched filters wherethe high cross-correlation peaks in the output of theconventional matched filter shown in Fig. 3(b) wereeliminated.

In comparison with high-pass filters, such as theGrad operator,9 '10 the wavelet matched filters arebandpass matched filters owing to localization of thewavelets in the frequency domain. The bandpass fil-ter has better output signal-to-noise ratio (SNR) thanthat of the high-pass filter. Moreover, the adaptivechoice for the wavelet function and its scale factorsallows enhancement of the selected features of theimage.

The phase-only filter (POF), described by(D(u,v) = F(u,v)/lF(u,uv), is also a high-pass fil-ter that has high light efficiency and good dis-crimination." The wavelet matched filter can alsobe generated with the POF as '(u,vu)IH(sxu,syv)I2,where the bandpass function IH(s.u, sV)I 2 may im-prove the output SNR. Mallat and Hwang6 recentlyproved that if the regularity of f(x,y) is Lipschitza at (x0,yo), then in a neighborhood of (xo,y0) themodulus of the WVT, IWf(s,x,y)I, will decay as O(sa)when the scale s tends to zero, on the condition

that the wavelets have n vanishing moments for0 < a <n. Thus,ifthef(x,y)isdiscontinuousandisLipschitz a - 0 at the edges, the IWf(s,x,y)A remainsapproximately constant over a large range of scales.White noise has negative Lipschitz exponent every-where, and its WT modulus IWn(s,x,y)j will increaseproportionally to s-' when the scale s tends to zero.Mallat eliminated the noise by an ad hoc algorithm,removing maxima of the WT modulus whose ampli-tudes increase when the scale decreases. For a noisyinput t + n, the wavelet matched filter will generatethe output (W, + Wj * Wf, where the output noiseWn is correlated and smoothed with Wf. The outputnoise Wn * Wf still increases when the scale s tendsto zero, as shown in the following experiment.

Figure 4 shows the correlation outputs obtained bycomputer simulation. The wavelet matched filterswere combinations of the POF, CD (u, v), of the letterE in the bottom right-hand corner in Fig. 3(a), withthe dyadic wavelets described by Eq. (6) with s =1/2, 1, 2, 4. The input was that shown in Fig. 3(a),degraded by an additive random noise with the SNRequal to 2. The output SNR was equal to 21 for theconventional POF and increased from 28 to 45 and63 for the wavelet matched filters with s = 1/2, 1, 2,respectively, but decreased to 26 with s = 4. Whenthe scale of the wavelet was too large, the waveletmatched filter lost its discrimination capability andhigh cross-correlation peaks appeared, as shown inFig. 4. These results suggest a possible optimizationof the wavelet scale factors.

In summary, we have introduced wavelet matchedfilters that perform in a single step both the VVTfor edge enhancement and correlation between theWT coefficients for optical shift-invariant patternrecognition. The bandpass wavelet matched filtershave shown improved discrimination capability com-pared with the conventional matched spatial filterand higher output SNR compared with the phase-only filters.

References

1. S. G. Mallat, IEEE Trans. Acoust. Speech SignalProcess. 37, 2091 (1989).

2. R. Kronland-Martinet, J. Morlet, and A. Grossmann,Int. J. Pattern Recogn. Artif. Intell. 1, 273 (1987).

3. H. Szu, Y. Sheng, and J. Chen, Appl. Opt. 31, 3267(1992).

4. E. Freysz, B. Pouligny, F. Argoul, and A. Arneodo,Phys. Rev. Lett. 64, 745 (1990).

5. Y. Zhang, Y. Li, E. G. Kanterakis, A. Katz, X. J. Lu, R.Tolimieri, and N. P. Caviris, Opt. Lett. 17, 210 (1992).

6. S. G. Mallat and W. L. Hwang, IEEE Trans. Inf.Theory 38, 617 (1992).

7. B. Telfer and H. Szu, Opt. Eng. 31, 1830 (1992).8. Y. Sheng, T. Lu, and D. Roberge, Opt. Eng. 31, 1859

(1992).9. S. Lowenthal and Y. Belvaux, C. R. Acad. Sci. Ser. B

262, 413 (1966).10. A. W. Lohmman and D. P. Paris, Appl. Opt. 7, 651

(1968).11. J. L. Horner and P. D. Gianino, Appl. Opt. 23, 812

(1984).

301