loading...
Detecting Buzz from Time-Sequenced Document Streams
Hong Kong, China March 29-April 01
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/EEE.2005.572005 IEEE International Conference on ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Jeonghee Yi, IBM Almaden Research Center, San Jose, CA
This paper presents a formal method of detecting emerging and changing interests that appear in document streams arriving continuously over time. Examples of such document streams include email, news articles, and weblogs (or blogs). We utilize the temporal information associated with documents in the streams and discover emerging issues and topics of interest and their change by detecting buzzwords in the documents. Buzzwords are terms that occur with strong momentum for a relatively short period of time.
Our approach for buzz detection is based on the notion of "burst of activities" proposed by Kleinberg. The burst of activities is modeled using a weighted automaton. We propose an algorithm to discover buzzwords of high intensity measured by their momentum and relative duration of the bursts. The method is applied and validated on a stream of blog postings and we report the experiment results.
Citation:
Jeonghee Yi, "Detecting Buzz from Time-Sequenced Document Streams," eee, pp.347-352, 2005 IEEE International Conference on e-Technology, e-Commerce and e-Service (EEE'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.