In this work a sub-symbolic technique for automatic, data driven language models construction is presented. Such a technique can be used to arrange a language-modelling module, which can be easily integrated in existing speech recognition architectures, such as the well-found HTK architecture. The proposed technique takes advantages from both the traditional LSA approach and from a novel application of a probability space metric known as "Hellinger?s distance." Experimental trials are also presented, in order to validate the proposed approach.
Citation:
Francesco Agostaro, Giovanni Pilato, Giorgio Vassallo, Salvatore Gaglio, "A Sub-Symbolic Approach to Word Modelling for Domain Specific Speech Recognition," camp, pp.321-326, Seventh International Workshop on Computer Architecture for Machine Perception (CAMP'05), 2005