loading...
Audio Segmentation and Speaker Localization in Meeting Videos
Hong Kong August 20-August 24
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICPR.2006.28318th International Conference on Patt ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Himanshu Vajaria, University of South Florida, Tampa, FL,33620, USA
Tanmoy Islam, University of South Florida, Tampa, FL,33620, USA
Sudeep Sarkar, University of South Florida, Tampa, FL,33620, USA
Ravi Sankar, University of South Florida, Tampa, FL,33620, USA
Ranga Kasturi, University of South Florida, Tampa, FL,33620, USA
Segmenting different individuals in a group meeting and their speech is an important first step for various tasks such as meeting transcription, automatic camera panning, multimedia retrieval and monologue detection. In this effort, given a meeting room video, we attempt to segment individual person?s speech and localize them in the video, based on data from a single audio and video source. The segmentation method is driven by audio and enhanced by video cues. We used Bayesian Information Criterion (BIC) to segment the feature vector streams and graph spectral partitioning to cluster them. We compare our results with audio based segmentation method and our localization technique with the commonly used mutual information.
Citation:
Himanshu Vajaria, Tanmoy Islam, Sudeep Sarkar, Ravi Sankar, Ranga Kasturi, "Audio Segmentation and Speaker Localization in Meeting Videos," icpr, vol. 2, pp.1150-1153, 18th International Conference on Pattern Recognition (ICPR'06) Volume 2, 2006
Usage of this product signifies your acceptance of the Terms of Use.