A new approach to tracking weakly modeled objects in a semantically rich domain is presented. We define a closed-world as a space-time region of an image sequence in which the complete taxonomy of objects is known, and in which each pixel should be explained as belonging to one of those objects. Given contextual object information, context-specific features can be dynamically selected as the basis for tracking. A context-specific feature is one that has been chosen based upon the context to maximize the chance of successful tracking between frames. Our work is motivated by the goal of video annotation-the semi-automatic generation of symbolic descriptions of action taking place in a contextually-rich dynamic scene. We describe how contextual knowledge in the "football domain" can be applied to closed-world football player tracking and present the details of our implementation. We include tracking results based on hundreds of images that demonstrate the wide range of tracking situations the algorithm successfully handles as well as a few examples of where the algorithm fails.
Index Terms:
tracking; image sequences; object detection; sport; computer vision; closed-world tracking; weakly modeled object tracking; semantically rich domain; space-time region; image sequence; complete object taxonomy; pixel; contextual object information; context-specific features; video annotation; semi-automatic symbolic description generation; action; contextually-rich dynamic scene; football domain; closed-world football player tracking
Citation:
S.S. Intille, A.F. Bobick, "Closed-world tracking," iccv, pp.672, Fifth International Conference on Computer Vision (ICCV'95), 1995