Supporting computational data model representation with high-performance I/O in parallel netCDF

Kui Gao*, Chen Jin, Alok Choudhary, Wei Keng Liao

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

Parallel computational scientific applications have been described by their computation and communication patterns. From a storage and I/O perspective, these applications can also be grouped into separate data models based on the way data is organized and accessed during simulation, analysis, and visualization. Parallel netCDF is a popular library used in many scientific applications to store scientific datasets and provides high-performance parallel I/O. Although the metadata-rich netCDF file format can effectively store and describe regular multi-dimensional array datasets, it does not address the full range of current and future computational science data models. In this paper, we present a new storage scheme in Parallel netCDF to represent a broad variety of data models used in modern computational scientific applications. This scheme also allows concurrent metadata construction for different data objects from multiple groups of application processes, an important feature in obtaining a high degree of I/O parallelism for data models exhibiting irregular data distribution. Furthermore, we employ non-blocking I/O functions to aggregate irregularly distributed data requests into large, contiguous data requests, to achieve high-performance I/O. Using an example of adaptive mesh refinement data model, we demonstrate the proposed scheme can produce scalable performance results for both data and metadata creation and access.

Original languageEnglish (US)
Title of host publication18th International Conference on High Performance Computing, HiPC 2011
PublisherIEEE Computer Society
ISBN (Print)9781457719516
DOIs
StatePublished - 2011
Event18th International Conference on High Performance Computing, HiPC 2011 - Bangalore, India
Duration: Dec 18 2011Dec 21 2011

Publication series

Name18th International Conference on High Performance Computing, HiPC 2011

Other

Other18th International Conference on High Performance Computing, HiPC 2011
Country/TerritoryIndia
CityBangalore
Period12/18/1112/21/11

Keywords

  • Data Model
  • Parallel I/O
  • Parallel netCDF

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Supporting computational data model representation with high-performance I/O in parallel netCDF'. Together they form a unique fingerprint.

Cite this