A Sparse Texture Representation Using Local Affine Regions
|
This paper introduces a texture representation suitable for recognizing images of textured surfaces under a wide range of transformations, including viewpoint changes and nonrigid deformations. At the feature extraction stage, a sparse set of affine Harris and Laplacian regions is found in the image. Each of these regions can be thought of as a texture element having a characteristic elliptic shape and a distinctive appearance pattern. This pattern is captured in an affine-invariant fashion via a process of shape normalization followed by the computation of two novel descriptors, the spin image and the RIFT descriptor. When affine invariance is not required, the original elliptical shape serves as an additional discriminative feature for texture recognition. The proposed approach is evaluated in retrieval and classification tasks using the entire Brodatz database and a publicly available collection of 1,000 photographs of textured surfaces taken from different viewpoints.
[1] 1265 A. Baumberg, “Reliable Feature Matching across Widely Separated Views,” Proc. Computer Vision and Pattern Recognition, vol. 1, pp. 774-781, 2000.
[2] S. Belongie, J. Malik, and J. Puzicha, “Shape Matching and Object Recognition Using Shape Contexts,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 4, pp. 509-522, Apr. 2002.
[3] D. Blostein and N. Ahuja, “A Multiscale Region Detector,” Computer Vision, Graphics, and Image Processing, vol. 45, pp. 22-41, 1989.
[4] P. Brodatz, Textures: A Photographic Album for Artists and Designers. New York: Dover, 1966.
[5] F.S. Cohen, Z. Fan, and M.A.S. Patel, “Classification of Rotated and Scaled Textured Images Using Gaussian Markov Field Models,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 2, pp. 192-202, Feb. 1991.
[6] J.L Crowley and A.C. Parker, “A Representation of Shape Based on Peaks and Ridges in the Difference of Low-Pass Transform,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 6, pp. 156-170, 1984.
[7] O.G. Cula and K.J. Dana, “Compact Representation of Bidirectional Texture Functions,” Proc. Computer Vision and Pattern Recognition, vol. 1, pp. 1041-1047. 2001.
[8] K.J. Dana, B. van Ginneken, S.K. Nayar, and J.J. Koenderink, “Reflectance and Texture of Real World Surfaces,” ACM Trans. Graphics, vol. 18, no. 1, pp. 1-34, 1999.
[9] J. Gårding and T. Lindeberg, “Direct Computation of Shape Cues Using Scale-Adapted Spatial Derivative Operators,” Int'l J. Computer Vision, vol. 17, no. 2, pp. 163-191, 1996.
[10] B. Georgescu, I. Shimshoni, and P. Meer, “Mean Shift Based Clustering in High Dimensions: A Texture Classification Example,” Proc. Int'l Conf. Computer Vision, pp. 456-463, 2003.
[11] R. Haralick, “Statistical and Structural Approaches to Texture,” Proc. IEEE, vol. 67, pp. 786-804, 1979.
[12] C. Harris and M. Stephens, “A Combined Corner and Edge Detector,” Proc. Fourth Alvey Vision Conf., M.M. Matthews, ed., pp. 147-151, 1988.
[13] A. Johnson and M. Hebert, “Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 21, no. 5, pp. 433-449, May 1999.
[14] B. Julesz, “Visual Pattern Discrimination,” IRE Trans. Information Theory, vol. 8, pp. 84-92, 1962.
[15] B. Julesz, “Textons, the Elements of Texture Perception and Their Interactions,” Nature, vol. 290, pp. 91-97, 1981.
[16] T. Kadir and M. Brady, “Scale, Saliency and Image Description,” Int'l J. Computer Vision, vol. 45, no. 2, pp. 83-105, 2001.
[17] L. Kaufman and P. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis. New York: John Wiley & Sons, 1990.
[18] J. Koenderink and A. Van Doorn, “Representation of Local Geometry in the Visual System,” Biological Cybernetics, vol. 55, pp. 367-375, 1987.
[19] J. Koenderink and A. Van Doorn, “The Structure of Locally Orderless Images,” Int'l J. Computer Vision, vol. 31, nos. 2/3, pp. 159-168, 1999.
[20] S. Lazebnik, C. Schmid, and J. Ponce, “Affine-Invariant Local Descriptors and Neighborhood Statistics for Texture Recognition,” Proc. Int'l Conf. Computer Vision, pp. 649-655, 2003.
[21] S. Lazebnik, C. Schmid, and J. Ponce, “A Sparse Texture Representation Using Affine-Invariant Regions,” Proc. Computer Vision and Pattern Recognition, vol. 2, pp. 319-324, 2003.
[22] T. Leung and J. Malik, “Recognizing Surfaces Using Three-Dimensional Textons,” Int'l J. Computer Vision, vol. 43, no. 1, pp. 29-44, 2001.
[23] E. Levina and P. Bickel, “The Earth Mover's Distance is the Mallows Distance: Some Insights from Statistics,” Proc. Int'l Conf. Computer Vision, vol. 2, pp. 251-256, 2001.
[24] T. Lindeberg, “Feature Detection with Automatic Scale Selection,” Int'l J. Computer Vision, vol. 30, no. 2, pp. 77-116, 1998.
[25] F. Liu and R.W. Picard, “Periodicity, Directionality, and Randomness: Wold Features for Image Modeling and Retrieval,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 18, no. 7, pp. 722-733, July 1996.
[26] X. Llado, J. Marti, and M. Petrou, “Classification of Textures Seen from Different Distances and under Varying Illumination Direction,” Proc. IEEE Int'l Conf. Image Processing, vol. 1, pp. 833-836, 2003.
[27] D. Lowe, “Distinctive Image Features from Scale-Invariant Keypoints,” Int'l J. Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[28] J. Malik, S. Belongie, T. Leung, and J. Shi, “Contour and Texture Analysis for Image Segmentation,” Int'l J. Computer Vision, vol. 43, no. 1, pp. 7-27, 2001.
[29] J. Malik and P. Perona, “Preattentive Texture Discrimination with Early Vision Mechanisms,” J. Optical Soc. Am. A, vol. 7, no. 5, pp. 923-932, 1990.
[30] J. Mao and A. Jain, “Texture Classification and Segmentation Using Multiresolution Simultaneous Autoregressive Models,” Pattern Recognition, vol. 25, pp. 173-188, 1992.
[31] J. Matas, O. Chum, U. Martin, and T. Pajdla, “Robust Wide Baseline Stereo from Maximally Stable Extremal Regions,” Proc. British Machine Vision Conf., vol. 1, pp. 384-393, 2002.
[32] K. Mikolajczyk and C. Schmid, “Indexing Based on Scale Invariant Interest Points,” Proc. Int'l Conf. Computer Vision, pp. 525-531, 2001.
[33] K. Mikolajczyk and C. Schmid, “An Affine Invariant Interest Point Detector,” Proc. European Conf. Computer Vision, vol. 1, pp. 128-142, 2002.
[34] K. Mikolajczyk and C. Schmid, “A Performance Evaluation of Local Descriptors,” Proc. Computer Vision and Pattern Recognition, vol. 2, pp. 257-263, 2003.
[35] R. Picard, T. Kabir, and F. Liu, “Real-Time Recognition with the Entire Brodatz Texture Database,” Proc. Computer Vision and Pattern Recognition, pp. 638-639, 1993.
[36] J. Puzicha, Y. Rubner, C. Tomasi, and J. Buhmann, “Empirical Evaluation of Dissimilarity Measures for Color and Texture,” Proc. Int'l Conf. Computer Vision, vol. 2, pp. 1165-1172, 1999.
[37] T. Randen and J. Husøy, “Filtering for Texture Classification: A Comparative Study,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 21, no. 4, pp. 291-310, Apr. 1999.
[38] Y. Rubner and C. Tomasi, “Texture-Based Image Retrieval without Segmentation,” Proc. Int'l Conf. Computer Vision, pp. 1018-1024, 1999.
[39] Y. Rubner, C. Tomasi, and L. Guibas, “The Earth Mover's Distance as a Metric for Image Retrieval,” Int'l J. Computer Vision, vol. 40, no. 2, pp. 99-121, 2000.
[40] F. Schaffalitzky and A. Zisserman, “Viewpoint Invariant Texture Matching and Wide Baseline Stereo,” Proc. Int'l Conf. Computer Vision, vol 2, pp. 636-643, 2001.
[41] F. Schaffalitzky and A. Zisserman, “Multi-View Matching for Unordered Image Sets, or ‘How Do I Organize my Holiday Snaps?’” Proc. European Conf. Computer Vision, vol. 1, pp. 414-431, 2002.
[42] C. Schmid, “Constructing Models for Content-Based Image Retrieval,” Proc. Computer Vision and Pattern Recognition, vol. 2, pp. 39-45, 2001.
[43] C. Schmid and R. Mohr, “Local Gray-Value Invariants for Image Retrieval,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 5, pp. 530-535, May 1997.
[44] J. Sivic and A. Zisserman, “Video Google: A Text Retrieval Approach to Object Matching in Videos,” Proc. Int'l Conf. Computer Vision, pp. 1470-1477, 2003.
[45] T. Tuytelaars and L. Van Gool, “Matching Widely Separated Views Based on Affinely Invariant Neighbourhoods,” Int'l J. Computer Vision, vol. 59, no. 1, pp. 61-85, 2004.
[46] M. Varma and A. Zisserman, “Classifying Images of Materials: Achieving Viewpoint and Illumination Independence,” Proc. European Conf. Computer Vision, vol. 3, pp. 255-271, 2002.
[47] M. Varma and A. Zisserman, “Texture Classification: Are Filter Banks Necessary?” Proc. Computer Vision and Pattern Recognition, vol. 2, pp. 691-698, 2003.
[48] H. Voorhees and T. Poggio, “Detecting Textons and Texture Boundaries in Natural Images,” Proc. Int'l Conf. Computer Vision, pp. 250-258, 1987.
[49] J. Wu and M.J. Chantler, “Combining Gradient and Albedo Data for Rotation Invariant Classification of 3D Surface Texture,” Proc. Int'l Conf. Computer Vision, vol. 2, pp. 848-855, 2003.
[50] K. Xu, B. Georgescu, D. Comaniciu, and P. Meer, “Performance Analysis in Content-Based Retrieval with Textures,” Proc. Int'l Conf. Pattern Recognition, vol. 4, pp. 275-278, 2000.
[51] J. Zhang, P. Fieguth, and D. Wang, “Random Field Models,” Handbook of Image and Video Processing, A. Bovik, ed., pp. 301-312, San Diego, Calif.: Academic Press, 2000.
Index Terms:
Index Terms- Image processing and computer vision, feature measurement, texture, pattern recognition.
Citation:
Svetlana Lazebnik, Cordelia Schmid, Jean Ponce, "A Sparse Texture Representation Using Local Affine Regions," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 8, pp. 1265-1278, Aug. 2005, doi:10.1109/TPAMI.2005.151