loading...
Streaming Random Forests
Banff, Alberta, Canada September 06-September 08
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/IDEAS.2007.4211th International Database Engineeri ...
 This Article 
 
PDF
HTML
IEEE Xplore Subscribers
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Hanady Abdulsalam, Queen's University, Canada
David B. Skillicorn, Queen's University, Canada
Patrick Martin, Queen's University, Canada
Many recent applications deal with data streams, conceptually endless sequences of data records, often arriving at high flow rates. Standard data-mining techniques typically assume that records can be accessed multiple times and so do not naturally extend to streaming data. Algorithms for mining streams must be able to extract all necessary information from records with only one, or perhaps a few, passes over the data. We present the Streaming Random Forests algorithm, an online and incremental stream classification algorithm that extends Breiman?s Random Forests algorithm. The Streaming Random Forests algorithm grows multiple decision trees, and classifies unlabelled records based on the plurality of tree votes. We evaluate the classification accuracy of the Streaming Random Forests algorithm on several datasets, and show that its accuracy is comparable to the standard Random Forest algorithm.
Index Terms:
Data mining, Classification, Decision trees, Data-stream classification, Random Forests.
Citation:
Hanady Abdulsalam, David B. Skillicorn, Patrick Martin, "Streaming Random Forests," ideas, pp.225-232, 11th International Database Engineering and Applications Symposium (IDEAS 2007), 2007
Usage of this product signifies your acceptance of the Terms of Use.