Bin Wang, University of Science and Technology of China, Hefei 230026, China
Zhiwei Li, Microsoft Research Asia, 49 Zhichun Road, Beijing 100080, China
Mingjing Li, Microsoft Research Asia, 49 Zhichun Road, Beijing 100080, China
Wei-ying Ma, Microsoft Research Asia, 49 Zhichun Road, Beijing 100080, China
Finding visually identical images in large image collections is important for many applications such as intelligence propriety protection and search result presentation. Several algorithms have been reported in the literature, but they are not suitable for large image collections. In this paper, a novel algorithm is proposed to handle the situation, in which each image is compactly represented by a hash code. To detect duplicate images, only the hash codes are required. In addition, a very efficient search method is implemented to quickly group images with similar hash codes for fast detection. The experiments show that our algorithm can be both efficient and effective for duplicate detection in web image search.
Citation:
Bin Wang, Zhiwei Li, Mingjing Li, Wei-ying Ma, "Large-Scale Duplicate Detection for Web Image Search," icme, pp.353-356, 2006 IEEE International Conference on Multimedia and Expo, 2006