loading...
ParRescue: Scalable Parallel Algorithm and Implementation for Biclustering over Large Distributed Datasets
Lisboa, Portugal July 04-July 07
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDCS.2006.6226th IEEE International Conference on ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Jianhong Zhou, University of Illinois at Chicago
Ashfaq Khokhar, University of Illinois at Chicago
Biclustering refers to simultaneously capturing correlations present among subsets of attributes (columns) and records (rows). It is widely used in data mining applications including biological data analysis, financial forecasting, and text mining. Biclustering algorithms are significantly more complex compared to the classical one dimensional clustering techniques, particularly those requiring multiple computing platforms for large and distributed data sets. In this paper, we develop an efficient scalable algorithm, referred to as ParRescue(Parallel Residue Co-clustering), that is capable of performing biclustering on extremely large or geographically distributed data sets. ParRescue divides the cluster tasks among processors with minimal communication costs thus making it scalable over large number of computing nodes. The proposed implementation is based on an existing sequential approach that has been modified for amenable parallel implementation. The proposed Par- Rescue algorithm has been implemented using MPI and the performance results are reported based on executions on a 64 node Linux PC cluster connected over 100 Mbits links. The experimental results show scalable performance with near linear speedups across different data and machine sizes compared to the modified sequential algorithm.
Citation:
Jianhong Zhou, Ashfaq Khokhar, "ParRescue: Scalable Parallel Algorithm and Implementation for Biclustering over Large Distributed Datasets," icdcs, pp.21, 26th IEEE International Conference on Distributed Computing Systems (ICDCS'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.


Suggestions