loading...
A Parallel Algorithm for Closed Cube Computation
May 14-May 16
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICIS.2008.63Seventh IEEE/ACIS International Confe ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Closed cubing is a very efficient algorithm for data cube compression proposed recently in the literature. It losslessly condenses a group of cells into one cell if these cells have the same aggregate value and preserve roll-up/drill-down semantics. Despite its importance, parallel closed cubing solutions for huge data sets are not well studied so far to the best of the authors’ knowledge. This paper presents a parallel closed cube construction and query algorithm over low cost PC clusters using the MapReduce framework. In addition, we proved that with the number of data blocks increases, the closed cubes’ storage size decreases gradually. Thus users can specify the number of data blocks to balance the performance between cubes storage and query time. Experimental study demonstrates that our algorithm is efficient and scalable.
Index Terms:
OLAP, parallel computation, closed cube, MapReduce, Hadoop
Citation:
Jinguo You, Jianqing Xi, Pingjian Zhang, Hu Chen, "A Parallel Algorithm for Closed Cube Computation," icis, pp.95-99, Seventh IEEE/ACIS International Conference on Computer and Information Science (icis 2008), 2008
Usage of this product signifies your acceptance of the Terms of Use.