[IEEE 2013 20th IEEE International Conference on Image Processing (ICIP) - Melbourne, Australia...

4
Content-Based Image Retrieval with Ordered Dither Block Truncation Coding Features Jing-Ming Guo, Senior Member, IEEE, and Heri Prasetyo Department of Electrical Engineering National Taiwan University of Science and Technology Taipei, Taiwan E-mail: [email protected], [email protected] ABSTRACT This paper presents a technique for Content-Based Image Retrieval (CBIR) by exploiting the low complexity advantage of the Ordered-Dither Block Truncation Coding (ODBTC) for generating image content descriptors. The two image features, namely Color Co-occurrence Feature (CCF) and Bit Pattern Features (BPF), are generated from ODBTC encoded data streams (without really performing an image compression or decoding process) to measure the similarity between two images. Experimental results show that the proposed method is superior to the Block Truncation Coding (BTC) image retrieval system and other former methods, and prove that the ODBTC scheme is not only suited for image compression for its simplicity, but also offers a conveniently way for image indexing in the content-based image retrieval system. Index Terms — Ordered Dither Block Truncation Coding, Content-Based Image Retrieval, Color Co-occurrence Feature, Bit Pattern Feature 1. INTRODUCTION Block Truncation Coding (BTC) is a simple and fast lossy compression technique for images [1]. It has played an important role in the history of digital image coding in that many advanced coding technique have been developed based on BTC, or have been inspired by the success of BTC. In recent decades, several enhancements have been proposed in the literature [2-3] to improve the computational complexity, compression ratio, and reconstructed image quality of the BTC. The computational simplicity of the BTC and BTC-based image coding techniques have made it as an attractive tools in applications requiring fast real-time implementation. The dithering-based BTC, namely Ordered Dither Block Truncation Coding (ODBTC), was proposed by Guo-Wu [3] to improve the perceptual quality by involving the low-pass nature of the Human Visual System (HVS). It is based on the fact that continuous and halftone images are perceived similarly by human vision when viewed from a distance. The ODBTC scheme employs the dither array Look-Up-Table (LUT) to further increase the efficiency of encoding process. Except for the image compression, some interesting applications exploit the efficiency and low computational complexity of the ODBTC algorithm have been developed. Guo et.al. [4] proposed some image watermarking techniques by utilizing the simplicity of the ODBTC. Then, the ODBTC watermark offers a good tool as privacy and ownership protection as well. A simple application incorporating BTC in the real application is on the CBIR system as suggested in [5-8]. The BTC has already demonstrated its efficiency for image coding, it can also be conveniently and effectively used in image indexing and retrieval for image database applications. In this study, a new content-based image retrieval is proposed by employing the ODBTC encoding process. In the proposed scheme, the Color Co-occurrence Features (CCF) and Bit Pattern Features (BPF) are calculated from the ODBTC color quantizers and bitmap image, respectively. The rest of this paper is organized as follows: The ODBTC is briefly introduced and generalized for color images in Section II to facilitate the understanding of the prospective readers. Section III elaborates the proposed method for the color image CBIR. Extensive experimental results are reported in Section IV. Finally, the conclusion is drawn in Section V. 2. ORDERED-DITHER BLOCK TRUNCATION CODING AND ITS COLOR EXTENSION The algorithm of the Ordered Dithered Block Truncation Coding (ODBTC) [3] is generalized to color images in coping the object of the proposed CBIR. The main advantage of the ODBTC image compression is on its low complexity on the bitmap generating by incorporating the Look-Up Table (LUT) and free of mathematical multiplication and division operations on the two quantizers determination. In addition, the ODBTC can yield an image quality much better than that of the BTC as reported in [3]. Suppose an original image in RGB color space of size ܯis divided into several non-overlapping blocks of size and processed independently. Figure 1 shows the conceptual block diagram of the ODBTC encoding for a color image. Let ܤൌ ሼሺ, ሻ; ൌ 1,2, … , ; ൌ 1,2, … , /ሽ be a set of image blocks of size , containing the RGB color information. The original image block ሺ, ሻ is first converted into the inter-band average image as , ሺ, ሻ , by averaging the red, green, and blue of each pixel as , ሺ, ሻ ൌ , ሺ, ሻ , ሺ, ሻ , ሺ, ሻ൧ (1) where ൌ 1,2, … , ; ൌ 1,2, … , . 4006 978-1-4799-2341-0/13/$31.00 ©2013 IEEE ICIP 2013

Transcript of [IEEE 2013 20th IEEE International Conference on Image Processing (ICIP) - Melbourne, Australia...

Page 1: [IEEE 2013 20th IEEE International Conference on Image Processing (ICIP) - Melbourne, Australia (2013.09.15-2013.09.18)] 2013 IEEE International Conference on Image Processing - Content-based

Content-Based Image Retrieval with Ordered Dither Block Truncation Coding Features

Jing-Ming Guo, Senior Member, IEEE, and Heri Prasetyo

Department of Electrical Engineering

National Taiwan University of Science and Technology Taipei, Taiwan

E-mail: [email protected], [email protected]

ABSTRACT This paper presents a technique for Content-Based Image Retrieval (CBIR) by exploiting the low complexity advantage of the Ordered-Dither Block Truncation Coding (ODBTC) for generating image content descriptors. The two image features, namely Color Co-occurrence Feature (CCF) and Bit Pattern Features (BPF), are generated from ODBTC encoded data streams (without really performing an image compression or decoding process) to measure the similarity between two images. Experimental results show that the proposed method is superior to the Block Truncation Coding (BTC) image retrieval system and other former methods, and prove that the ODBTC scheme is not only suited for image compression for its simplicity, but also offers a conveniently way for image indexing in the content-based image retrieval system.

Index Terms — Ordered Dither Block Truncation Coding, Content-Based Image Retrieval, Color Co-occurrence Feature, Bit Pattern Feature

1. INTRODUCTION Block Truncation Coding (BTC) is a simple and fast lossy compression technique for images [1]. It has played an important role in the history of digital image coding in that many advanced coding technique have been developed based on BTC, or have been inspired by the success of BTC. In recent decades, several enhancements have been proposed in the literature [2-3] to improve the computational complexity, compression ratio, and reconstructed image quality of the BTC.

The computational simplicity of the BTC and BTC-based image coding techniques have made it as an attractive tools in applications requiring fast real-time implementation. The dithering-based BTC, namely Ordered Dither Block Truncation Coding (ODBTC), was proposed by Guo-Wu [3] to improve the perceptual quality by involving the low-pass nature of the Human Visual System (HVS). It is based on the fact that continuous and halftone images are perceived similarly by human vision when viewed from a distance. The ODBTC scheme employs the dither array Look-Up-Table (LUT) to further increase the efficiency of encoding process.

Except for the image compression, some interesting applications exploit the efficiency and low computational complexity of the ODBTC algorithm have been developed. Guo et.al. [4] proposed some image watermarking techniques by

utilizing the simplicity of the ODBTC. Then, the ODBTC watermark offers a good tool as privacy and ownership protection as well.

A simple application incorporating BTC in the real application is on the CBIR system as suggested in [5-8]. The BTC has already demonstrated its efficiency for image coding, it can also be conveniently and effectively used in image indexing and retrieval for image database applications.

In this study, a new content-based image retrieval is proposed by employing the ODBTC encoding process. In the proposed scheme, the Color Co-occurrence Features (CCF) and Bit Pattern Features (BPF) are calculated from the ODBTC color quantizers and bitmap image, respectively.

The rest of this paper is organized as follows: The ODBTC is briefly introduced and generalized for color images in Section II to facilitate the understanding of the prospective readers. Section III elaborates the proposed method for the color image CBIR. Extensive experimental results are reported in Section IV. Finally, the conclusion is drawn in Section V.

2. ORDERED-DITHER BLOCK TRUNCATION CODING AND ITS COLOR EXTENSION

The algorithm of the Ordered Dithered Block Truncation Coding (ODBTC) [3] is generalized to color images in coping the object of the proposed CBIR. The main advantage of the ODBTC image compression is on its low complexity on the bitmap generating by incorporating the Look-Up Table (LUT) and free of mathematical multiplication and division operations on the two quantizers determination. In addition, the ODBTC can yield an image quality much better than that of the BTC as reported in [3].

Suppose an original image in RGB color space of size is divided into several non-overlapping blocks of size

and processed independently. Figure 1 shows the conceptual block diagram of the ODBTC encoding for a color image. Let , ; 1,2, … , / ; 1,2, … , / be a set of image blocks of size , containing the RGB color information. The original image block ,  is first converted into the inter-band average image as , ,  , by averaging the red, green, and blue of each pixel as

, , , , , , , , (1)

where 1,2, … , ; 1,2, … , .

4006978-1-4799-2341-0/13/$31.00 ©2013 IEEE ICIP 2013

Page 2: [IEEE 2013 20th IEEE International Conference on Image Processing (ICIP) - Melbourne, Australia (2013.09.15-2013.09.18)] 2013 IEEE International Conference on Image Processing - Content-based

where the index , denotes the pixel coordinate the block , . Let , denotes the dither array index at the position

, with 1,2, … , and 1,2, … ,   . Let , , … , be a set of the scaled version of the dithered

array which can be computed as

,,

; 1,2,… , ; 1,2, … ,   (2)

where and denote the minimum and maximum value of the dither array, respectively. The set of , , … , can be off-line pre-calculated and stored as the Look-Up-Table (LUT) for later usage, and the complexity can be significantly reduced in the practical applications. The value of denotes the dither array index in LUT, and defined as , ,  . Since 0 255 , it implies that all of the values of dither array

, lie in the range 0,255 . The variables , and , represent the minimum and maximum values of the

inter-band average image on block , , which can be computed as follow:

, , , , (3) , , , ,   (4) Subsequently, an image bitmap for each block from ODBTC

process ,  can be easily determined as thresholding between the inter-band average pixel with the scaled version of dither array. For the pixel , ; 1,2, … , ; 1,2, … , on each block, the thresholding process is given by:

, ,1;   , , , ,

0;   , , , , (5)

Except for generating and sending the image bitmap to the decoder, the minimum and maximum quantizers of each block are also stored and transmitted to the ODBTC decoder. The sets of minimum and maximum quantizers for all blocks are given as

, ; 1,2,… , ; 1,2, … , (6)

, ; 1,2, … , ; 1,2, … , (7)

where , and , in each block are determined as the minimum and maximum values over red, green, and blue channels on the corresponding the image block , . These two values can be formulated as follow:

, , , , , , , , , , , , (8)

, , , , , , , , , , , , (9)

where 1,2,… , ; 1,2, … ,

At the end of the ODBTC encoding, the bitmap image, , the minimum, , and maximum quantizer, , are transmitted over transmission channel to form the encoded data stream.

3. ODBTC INDEXING Two features are introduced in the proposed method to characterize the contents of an image contents, i.e. Color Co-occurrence features (CCF) and Bit Pattern Features (BPF). Figure 3 shows the block diagram of the proposed method in CBIR. A. Color Co-occurrence Feature (CCF)

The attribute of an image also can be obtained from the image color distribution by employing the color co-occurrence which calculates the probability of the occurrence of a pixel and its adjacent to have the specific color and represents the spatial information of an image.

Let , , … , and , , … , be the codebook generated from the minimum quantizer,   , and maximum quantizer,   , respectively. Where

 ,  , and  are a codeword from minimum and maximum quantizer, and codebook size, respectively. The indexing process for minimum quantizer can be considered as closest matching between the minimum value of each block , with the codebook , , … , which meets the following condition:

, , ,…, , , (10)

The indexing for maximum quantizer ofn each block , with codebook , , … , is defined as:

, , ,…, , , (11)

The Color Co-occurrence Features (CCF) for a given image can be directly computed as:

, Pr , , , (12) where , 1,2, … , . B. Bit Pattern Feature (BPF) Another feature, namely Bit Pattern Feature (BPF), is used to characterize the edges and image contents. Let , , … , be the bit pattern codebook consists of binary codewords. The bitmap on block ,  is indexed based on the most similarity between this bitmap and the codeword which meets the following condition:

, , ,…, , , (13)

where ∙,∙   denotes the Hamming distance between the two binary patterns (vectors).

Then the BPF can be derived as the probability of bitmap image is mapped to codeword in the bitmap codebook. The BPF is then defined as:

, ; 1,2, … ,   (14) The bitmap codebook is generated using binary vector

quantization with soft centroids [17], involving several images on the training stage.

C. The similarity measure of the features The similarity between a query image and the set of images in the database can be calculated using the relative distance measure [5] as follow:

,| , , |

, ,

                                     ∑| |

(15)

where and are similarity weighting constants which represent the percentage contribution of the CCF and BPF in the proposed image retrieval. A small number is placed at the denominator to avoid the mathematic division error.

4007

Page 3: [IEEE 2013 20th IEEE International Conference on Image Processing (ICIP) - Melbourne, Australia (2013.09.15-2013.09.18)] 2013 IEEE International Conference on Image Processing - Content-based

The precision and recall measurements used to describe the performance of the retrieval system are defined as

, (16)

, (17)

where denotes the number of the retrieved images; denotes the number of relevant (similar) images against query image on retrieved images; and denotes the number of all relevant images in the database.

4. EXPERIMENT RESULTS The extensive experiments were conducted to examine the performance of the proposed method. The database consists of 1000 color images from the commercially available Corel Photo Collections [18]. These images are grouped into 10 classes with each class containing 100 images of size 384 256. A. Comparison with BTC indexing scheme All images in the database are first encoded using the ODBTC and BTC with block of sizes 4 4, 8 8, and 16 16, and then the image feature descriptors are obtained by both schemes. The color and bit pattern codebook sizes are set at 64 and 256, respectively, for the BTC indexing [5] and the proposed ODBTC indexing. The color codebook size for BTC-YCbCr indexing [7] is set at 64. The color histogram for BTC indexing [6] is set at

256 . The similarity weighting constants are set at 1, 1 .

Five images are randomly selected from the database to generate the codebook with the proposed scheme and the method in [5, 7]. All of the images in database are turned as queries, then a set of retrieved images is returned and ordered in ascending order. Figure 3 shows the results of the average precision and recall rate of each method over various block sizes. As it can be seen, the proposed method outperforms the BTC indexing method [5, 6, 7] under all block sizes and all different number of retrieved images,

10, 20, … , 100 . Notably, the proposed method yields a better result compared

to [5] and [7] under the same or lower feature dimensionality. B. Comparison with former schemes In this subsection, numerous former schemes are involved for performance comparisons. In this experiment, each image in the database is turned as the query and then the average precision rate is computed among all the query images under the number of retrieved images 20. Table I reports the comparisons among the proposed scheme and the former schemes. It is clear that the proposed method can achieve the best result even though the image feature descriptors are derived and extracted from low level visual features, i.e. color distribution and ordered dither bit pattern obtained from the block-based ODBTC encoding process. Notably, in [14] simply ten images are turned as query images. In this case, the average precision rate provided by [14] is 0.806, while the proposed scheme can provide 0.9187 in this scenario. Thus, a better result still can be obtained by the proposed scheme than that

of the method proposed in [14].

5. CONCLUSIONS In this study, an image retrieval system is presented by exploiting the ODBTC encoded data stream to construct the image features, namely Color Co-occurrence Features and Bit Pattern Features. As documented in the experimental results, the proposed scheme can provide the best average precision rate compared to various former schemes in the literature, and thus it can be a very competitive candidates in color image retrieval application.

6. REFERENCES [1] E. J. Delp and O. R. Mitchell, “Image coding using block truncation coding,” IEEE Trans. Commun., vol. 27, pp. 1335–1342, Sept. 1979. [2] M. D. Lema, and O. R. Mitchell, “Absolute moment block truncation coding and its application to color images,” IEEE Trans. Commun., vol. COM-32, pp. 1148-1157, Oct. 1984. [3] J. M. Guo, and M. F. Wu, “Improved Block Truncation Coding Based on the Void-and-Cluster Dithering Approach,” IEEE Trans. Image Processing, vol. 18, no. 1, pp. 211-213, Jan. 2009. [4] J. M. Guo, M. F. Wu, and Y. C. Kang, “Watermarking in conjugate ordered dither block truncation coding images,” Signal Processing, vol. 89, no. 10, pp. 1864-1882, Oct. 2009. [5] G. Qiu, “Color Image Indexing Using BTC,” IEEE Trans. Image Processing, Vol. 12, No. 1, Jan. 2003. [6] M. R. Gahroudi, and M. R. Sarshar, “Image retrieval based on texture and color method in BTC-VQ compressed domain,” Int. Symp. on Signal Processing and Its Application, 20th, Feb. 2007. [7] F.X. Yu, H. Luo, and Z.M. Lu, “Colour image retrieval using pattern co-occurrence matrices based on BTC and VQ,” Electronics Letters, 20th, vol. 47, no. 2, Jan. 2011. [8] S. Silakari, M. Motwani, and M. Maheshwari, “Color image clustering using block truncation algorithm,” Int. Jour. of Comp. Science Issues, vol. 4, no. 2, 2009. [9] C. H. Lin, R. T. Chen, and Y. K. Chan, “A smart content-based image retrieval system based on color and texture feature,” Image and Vision Computing, vol. 27, no. 6, pp. 658–665, May 2009. [10] N. Jhanwar, S. Chaudhurib, G. Seetharamanc, and B. Zavidovique, “Content based image retrieval using motif co-occurrence matrix,” Image and Vision Computing, vol. 22, pp. 1211–1220, Dec. 2004. [11] P. W. Huang, and S. K. Dai, “Image retrieval by texture similarity,” Pattern Recognition, vol. 36, no. 3, pp. 665–679, Mar. 2003. [12] T. C. Lu, and C. C. Chang, “Color image retrieval technique based on color features and image bitmap,” Inf. Process. Manage., vol. 43, no. 2, pp. 461-472, Mar. 2007. [13] T. W. Chiang and T. W. Tsai, “Content-based image retrieval via the multiresolution wavelet features of interest,” J. Inf. Technol. Appl., Vol. 1, no. 3, pp. 205-214, Dec. 2006. [14] C. C. Lai, and Y. C. Chen, “A user-Oriented Image Retrieval System Based on Interactive Genetic Algorithm,” IEEE Trans. Instrumentation and Measurement, vol. 60, no. 10, October 2011.

4008

Page 4: [IEEE 2013 20th IEEE International Conference on Image Processing (ICIP) - Melbourne, Australia (2013.09.15-2013.09.18)] 2013 IEEE International Conference on Image Processing - Content-based

[15] Z. M. Lu, and H. Burkhardt, “Colour image retrieval based on DCT-domain vector quantization index histograms,” Electronics Letters, vol. 41, no. 17, 2005. [16] P. Poursistani, H. Nezamabadi-pour, R. A. Moghadam, and M. Saeed, “Image indexing and retrieval in JPEG compressed domain based on vector quantization,” Mathematical and Computer Modeling, 2011, http://dx.doi.org/10.1016/j.mcm.2011.11.064. [17] P. Franti, and T. Kaukoranta, “Binary vector quantizer design using soft centroids,” Sign. Proc.: Image Comm., Vol. 14, page 677-681, 1999. [18] Corel Photo Collection Color Image Database, online available on http://wang.ist.psu.edu/docs/realted/.

Fig. 1. Block diagram of ODBTC encoding.

Fig. 2. Block diagram of ODBTC indexing.

Fig 3. Precision and Recall for BTC and ODBTC indexing.

TABLE I. PRECISION COMPARISON.

Methods People Beach Building Bus Dinosaur Elephant

Gahroudi [6] 0.411 0.344 0.335 0.249 0.889 0.297 Jhanwar [10] 0.453 0.398 0.374 0.741 0.915 0.304 Huang [11] 0.424 0.446 0.411 0.852 0.587 0.426 Chiang [13] 0.060 0.930 0.260 0.070 1.000 0.680 Silakari [8] 0.336 0.424 0.792 0.448 0.971 0.448

Qiu [5] 0.495 0.423 0.405 0.909 0.955 0.498 Z.M. Lu [15] 0.630 0.318 0.480 0.674 0.988 0.494

Lu [12] 0.500 0.900 0.590 0.520 1.000 0.800 Yu [7] 0.849 0.356 0.616 0.818 1.000 0.591 Lin [9] 0.683 0.540 0.562 0.888 0.993 0.658

Poursistani [16] 0.702 0.444 0.708 0.763 1.000 0.638 ODBTC indexing 0.8465 0.466 0.6815 0.885 0.992 0.733

Methods Flower Horse Mountaint Food Average

Gahroudi [6] 0.789 0.204 0.209 0.235 0.396

Jhanwar [10] 0.852 0.568 0.293 0.370 0.526

Huang [11] 0.898 0.589 0.268 0.427 0.532

Chiang [13] 0.880 0.260 0.260 0.930 0.533

Silakari [8] 0.941 0.588 0.432 0.223 0.560

Qiu [5] 0.821 0.689 0.313 0.443 0.595

Z.M. Lu [15] 0.884 0.588 0.338 0.612 0.600

Lu [12] 0.800 0.680 0.410 0.450 0.665

Yu [7] 0.931 0.928 0.404 0.682 0.717

Lin [9] 0.891 0.803 0.522 0.733 0.727

Poursistani [16] 0.924 0.947 0.562 0.745 0.743

ODBTC indexing 0.964 0.939 0.4735 0.8055 0.7786

0 0.2 0.4 0.6 0.80.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1Comparison on block size : 4x4

Recall

Pre

cisi

on

ODBTC Indexing, Nc=64,N

b=256

ODBTC Indexing, Nc=64,N

b=64

BTC - YCbCr[7]BTC Indexing [5]BTC Color Histogram [6]

0 0.1 0.2 0.3 0.4 0.5 0.6 0.70.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1Comparison on block size : 8x8

Recall

Pre

cisi

on

ODBTC Indexing, N

c=64,N

b=256

ODBTC Indexing, Nc=64,N

b=64

BTC - YCbCr[7]BTC Indexing [5]BTC Color Histogram [6]

0 0.2 0.4 0.6 0.80.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1Comparison on block size : 16x16

Recall

Pre

cisi

on

ODBTC Indexing, Nc=64,N

b=256

ODBTC Indexing, Nc=64,N

b=64

BTC - YCbCr[7]BTC Indexing [5]BTC Color Histogram [6]

4009