loading...
Generating Natural Language Description of Human Behavior from Video Images
Barcelona, Spain September 03-September 08
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICPR.2000.90302015th International Conference on Patt ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Atsuhiro Kojima, Osaka Prefecture University
Masao Izumi, Osaka Prefecture University
Takeshi Tamura, Osaka Prefecture University
Kunio Fukunaga, Osaka Prefecture University
In visual surveillance applications, it is becoming popular to perceive video images and to interpret them using natural language concepts. In this paper, we propose a new approach to generate natural language description of human behavior appeared in real video images. First, a head region of a human, on behalf of the whole body, is extracted from each frame. Using a model-based method, three dimensional pose and position of the head are estimated. Next, the trajectory of these parameters is divided into segments of monotonous motions. For each segment, we evaluate conceptual features such as degree of change of pose and position and that of relative distance to some objects in the surroundings, and so on. By calculating product of these feature values, a most suitable verb is selected and other syntactic elements are supplied. Finally, natural language text is generated using technique of machine translation.
Citation:
Atsuhiro Kojima, Masao Izumi, Takeshi Tamura, Kunio Fukunaga, "Generating Natural Language Description of Human Behavior from Video Images," icpr, vol. 4, pp.4728, 15th International Conference on Pattern Recognition (ICPR'00) - Volume 4, 2000
Usage of this product signifies your acceptance of the Terms of Use.