loading...
Integrated Analysis of Speech and Images as a Probabilistic Decoding Process
Quebec City, QC, Canada August 11-August 15
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICPR.2002.104837116th International Conference on Patt ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sven Wachsmuth, Bielefeld University
Gerhard Sagerer, Bielefeld University
Speech understanding and vision are the two most important modalities in human-human communication. However, the emulation of these by a computer faces fundamental difficulties due to noisy data, vague meanings, previously unseen objects or unheard words, occlusions, spontaneous speech effects, and context dependence. Thus, the interpretation processes on both channels are highly error-prone. This paper presents a new perspective on the problem of relating speech and image interpretations as a probabilistic decoding process. It is shown that such an integration scheme is robust regarding partial or erroneous interpretations. Furthermore, it is shown that implicit error correction strategies can be formulated in this probabilistic framework that lead to improved scene interpretation.
Citation:
Sven Wachsmuth, Gerhard Sagerer, "Integrated Analysis of Speech and Images as a Probabilistic Decoding Process," icpr, vol. 2, pp.20588, 16th International Conference on Pattern Recognition (ICPR'02) - Volume 2, 2002
Usage of this product signifies your acceptance of the Terms of Use.