loading...
Visual Speech Recognition Using Image Moments and Multiresolution Wavelet Images
Sydney, Australia July 26-July 28
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CGIV.2006.92International Conference on Computer ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Wai C. Yau, RMIT University, Australia
Dinesh K. Kumar, RMIT University, Australia
Sridhar P. Arjunan, RMIT University, Australia
Sanjay Kumar, RMIT University, Australia
This paper describes a new technique for recognizing speech using visual speech information. The video data of the speaker?s mouth is represented using grayscale images named as motion history image (MHI). MHI is generated by applying accumulative image differencing on the frames of the video to implicitly represent the temporal information of the mouth movement. The MHIs are decomposed into wavelet sub images using Discrete Stationary Wavelet Transform (SWT). Three moment-based features (geometric moments, Zernike moments and Hu moments) are extracted from the SWT approximate sub images. Multilayer perceptron (MLP) type artificial neural network (ANN) with back propagation learning algorithm is used to classify the moments features. This paper evaluates and compares the image representation ability of the different moments. The initial experiments show that this method can classify English consonants with an error rate less than 5%.
Index Terms:
visual speech recognition, motion history image, image moments, discrete stationary wavelet transform
Citation:
Wai C. Yau, Dinesh K. Kumar, Sridhar P. Arjunan, Sanjay Kumar, "Visual Speech Recognition Using Image Moments and Multiresolution Wavelet Images," cgiv, pp.194-199, International Conference on Computer Graphics, Imaging and Visualisation (CGIV'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.