loading...
Algorithms for Defining Visual Regions-of-Interest: Comparison with Eye Fixations
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/34.877520September 2000 (vol. 22 no. 9) pp. 970-982
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   

Abstract—Many machine vision applications, such as compression, pictorial database querying, and image understanding, often need to analyze in detail only a representative subset of the image, which may be arranged into sequences of loci called regions-of-interest, ROIs. We have investigated and developed a methodology that serves to automatically identify such a subset of aROIs (algorithmically detected ROIs) using different Image Processing Algorithms, IPAs, and appropriate clustering procedures. In human perception, an internal representation directs top-down, context-dependent sequences of eye movements to fixate on similar sequences of hROIs (human identified ROIs). In this paper, we introduce our methodology and we compare aROIs with hROIs as a criterion for evaluating and selecting bottom-up, context-free algorithms. An application is finally discussed.

[1] 970 T.A. Bahill and L.W. Stark, “Trajectories of Saccadic Eye Movements,” Scientific Am., vol. 240, pp. 84-93, 1979.
[2] S. Brandt and L.W. Stark, “Spontaneous Eye Movements during Visual Imagery Reflect the Content of the Visual Scene,” J. Cognitive Neuroscience, vol. 9, no. 1, pp. 27-38, 1997.
[3] J. Canny, “A Computational Approach to Edge Detection,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 8, no. 6, pp. 679-698, June 1986.
[4] I. Daubechies,“Ten lectures on wavelets,” SIAM CBMS-61, 1992.
[5] C. De Vleeschouwer, X. Marichal, T. Delmot, and B. Macq, “A Fuzzy Logic System Able to Detect Interesting Areas of a Video Sequence,” Proc. SPIE, vol. 3,016, pp. 234-245, 1997.
[6] V. Di Gesú and C. Valenti, “The Discrete Symmetry Transform in Computer Vision,” Technical Report 011-95, Laboratory for Computer Science (DMA), Univ. of Palermo, 1995.
[7] L. Fisher, Fixed Effects Analysis of Variance. New York: Academic Press, 1978.
[8] L. Itti, C. Koch, and E. Niebur, “A Model for Saliency-Based Visual Attention for Rapid Scene Analysis,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 11, pp. 1,254-1,259, Nov. 1998.
[9] P.J. Locher and C.F. Nodine, “Symmetry Catches the Eye,” Eye Movements: From Physiology to Cognition, pp. 353-361, 1987.
[10] S.K. Mannan, K.H. Ruddock, and D.S. Wooding, “The Relationship between the Locations of Spatial Features and Those of Fixations Made during Visual Examination of Briefly Presented Images,” Spatial Vision, vol. 10, no. 3, pp. 165-188, 1996.
[11] E. Niebur and C. Koch, “Control of Selective Visual Attention: Modeling the 'Where' Pathway,” Advances in Neural Information Processing Systems, D.S. Touretzky, M.C. Mozer, and M.E. Hasselmo, eds., vol. 8, pp. 802-808, MIT Press, 1996.
[12] D. Noton and L.W. Stark, “Scanpaths in Eye Movements During Pattern Perception,” Science, vol. 17, no. 1, pp. 308-311, 1971.
[13] W. Osberger and A.J. Maeder, “Automatic Identification of Perceptually Important Regions in an Image,” Proc. Int'l Conf. Pattern Recognition, pp. 17-20, 1998.
[14] R. Plamondon and C.M. Privitera, “The Segmentation of Cursive Handwriting: An Approach Based on Off-Line Recovery of the Motor-Temporal Information,” IEEE Trans. Image Processing, vol. 8, no. 1, pp. 80-91, Jan. 1999.
[15] C.M. Privitera, M. Azzariti, and L.W Stark, “Locating Regions-of-Interest for the Mars Rover,” Int'l J. Remote Sensing, 2000. to appear.
[16] C.M. Privitera and L.W. Stark, “Evaluating Image Processing Algorithms That Predict Regions of Interest,” Pattern Recognition Letters, vol. 19, no. 1, pp. 1,037-1,043, 1998.
[17] C.M. Privitera and L.W. Stark, “Focused JPEG Encoding Based upon Automatic Preidentified Regions of Interest,” Proc. SPIE, vol. 3,644, pp. 552-558, 1999.
[18] D. Reisfeld, H. Wolfsow, and Y. Yeshurun, “Context-Free Attentional Operators: the Generalized Symmetry Transform,” Int'l J. Computer Vision, vol. 14, pp. 119-130, 1995.
[19] W. Richards and L. Kaufman, “Centre-of-Gravity Tendencies for Fixations and Flow Patterns,” Perception and Psychology, vol. 5, pp. 81-84, 1969.
[20] O. Rioul and P. Duhamel, Fast Algorithms for Discrete and Continuous Wavelet Transforms IEEE Trans. Information Theory, vol. 38, no. 2, pp. 569-586, Mar. 1992.
[21] L.W. Stark and Y. Choi, “Experimental Metaphysics: The Scanpath as an Epistemological Mechanism,” Visual Attention and Cognition, W.H. Zangemeister, H.S. Stiehl, and C. Freksa, eds., pp. 3-69, 1996.
[22] L.W. Stark and C.M. Privitera, “Top-Down and Bottom-Up Image Processing,” Proc. of IEEE Int'l Conf. Neural Networks, vol. 4, pp. 2,294-2,299, June 1997.
[23] L.W. Stark, C.M. Privitera, H. Yang, M. Azzariti, Y.F. Ho, A. Chan, C. Krischer, and A. Weinberger, “Scanpath Memory Binding: Multiple Read-Out Experiments,” Proc. SPIE, vol. 3,644, pp. 495-510, 1999.
[24] P.P. Vaidyanathan, Multirate Systems and Filter Banks. Englewood Cliffs, N.J.: Prentice-Hall, 1993.
[25] A.L. Yarbus, Eye Movements and Vision. New York: Plenum Press, 1967.
[26] W.H. Zangemeister, K. Sherman, and L.W. Stark, “Evidence for Global Scanpath Strategy in Viewing Abstract Compared with Realistic Images,” Neuropsychologia, vol. 33, no. 8, pp. 1,009-1,025, 1995.
[27] J. Zhao, Y. Shimazu, K. Ohta, R. Hayasaka, and Y. Matsushita, “A JPEG Codec Adaptive to the Relative Importance of Regions in an Image,” IEEE Trans. Information Processing Soc. Japan, vol. 38, no. 8, pp. 1,531-1,542, 1997.

Index Terms:
Eye movements, scanpath theory, regions of interest identification and comparison.
Citation:
Claudio M. Privitera, Lawrence W. Stark, "Algorithms for Defining Visual Regions-of-Interest: Comparison with Eye Fixations," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 9, pp. 970-982, Sept. 2000, doi:10.1109/34.877520
Usage of this product signifies your acceptance of the Terms of Use.