loading...
Cross-Genre Feature Comparisons for Spoken Sentence Segmentation
Irvine, California September 17-September 19
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICSC.2007.89International Conference on Semantic ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sebastien Cuendet, International Computer Science Institute, USA
Dilek Hakkani-Tur, International Computer Science Institute, USA
Elizabeth Shriberg, International Computer Science Institute, USA; SRI International, USA
James Fung, International Computer Science Institute, USA
Benoit Favre, International Computer Science Institute, USA
Automatic sentence segmentation of spoken language is an important precursor to downstream natural language processing. Previous studies combine lexical and prosodic features, but can impose significant computational challenges because of the large size of feature sets. Little is understood about which features most benefit performance, particularly for speech data from different speaking styles. We compare sentence segmentation for speech from broadcast news versus natural multi-party meetings, using identical lexical and prosodic feature sets across genres. Results based on boosting and forward selection for this task show that (1) features sets can be reduced with little or no loss in performance, and (2) the contribution of different feature types differs significantly by genre. We conclude that more efficient approaches to sentence segmentation and similar tasks can be achieved, especially if genre differences are taken into account.
Citation:
Sebastien Cuendet, Dilek Hakkani-Tur, Elizabeth Shriberg, James Fung, Benoit Favre, "Cross-Genre Feature Comparisons for Spoken Sentence Segmentation," icsc, pp.265-274, International Conference on Semantic Computing (ICSC 2007), 2007
Usage of this product signifies your acceptance of the Terms of Use.


Suggestions