loading...
Visual Speech Recognition: A Solution from Feature Extraction to Words Classification
S?o Carlos, Brazil October 12-October 15
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/SIBGRA.2003.1241036XVI Brazilian Symposium on Computer G ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Jacques Facon, Pontifical Catholic University of Parana
Díbio Leandro Borges, Pontifical Catholic University of Parana
Audio-visual Speech Recognition has been an active area of research lately. A bit, and yet unsolved, part of this problem is the visual only recognition, or lip reading. Considering an image sequence of a person pronouncing a word, a full image analysis solution would have to segment the mouth area, extract relevant features, and use them to be able to classify the word from those visual features. In this paper we approach this problem by proposing a segmentation technique for the lips contours together with a set of features based on the extracted contours which is able to perform lip reading with promising results. We have collected visual speech sequences in our lab and show the results here for a set of ten words in Brazilian Portuguese, spoken by different speakers in more than 150 samples. The approach can be extended and applied to other spoken languages as well.
Citation:
Luciana Gonçalves da Silveira, Jacques Facon, Díbio Leandro Borges, "Visual Speech Recognition: A Solution from Feature Extraction to Words Classification," sibgrapi, pp.399, XVI Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.