loading...
On Creating Efficient Object-relational Views of Scientific Datasets
Columbus, Ohio August 14-August 18
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICPP.2006.562006 International Conference on Para ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sivaramakrishnan Narayanan, The Ohio State University, USA
Tahsin Kurc, The Ohio State University, USA
Umit Catalyurek, The Ohio State University, USA
Joel Saltz, The Ohio State University, USA
Scientific datasets are often large and distributed in flat files across several storage nodes. Scientists frequently want to analyze subsets of these datasets. A data source abstraction that provides an object-relational view of data while hiding the details of storage and transport mechanisms and dataset layouts is useful in this regard. In this abstraction, Basic Data Sources (BDS) interpret flat files as a set of records and are the building blocks of the view mechanism. Derived Data Sources (DDS) may be built on top of BDSs and provide more complex objects that serve the scientists? needs. The simplest DDS is one that supports a join based view over BDSs. We investigate issues involving building such DDSs for scientific applications and consider distributed versions of the indexed join and the Grace Hash join algorithms. We construct cost models that capture their performance in a restricted space of dataset and system parameters and compare them analytically and experimentally.
Citation:
Sivaramakrishnan Narayanan, Tahsin Kurc, Umit Catalyurek, Joel Saltz, "On Creating Efficient Object-relational Views of Scientific Datasets," icpp, pp.551-558, 2006 International Conference on Parallel Processing (ICPP'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.