TY - GEN
T1 - Design and implementation of a parallel I/O runtime system for irregular applications
AU - No, Jaechun
AU - Park, Sung Soon
AU - Carretero, Jesús
AU - Choudhary, Alok Nidhi
AU - Chen, Pang
PY - 1998/1/1
Y1 - 1998/1/1
N2 - In this paper we present the design, implementation and evaluation of a runtime system based on collective I/O techniques for irregular applications. We present two models, namely, «Collective I/O» and «Pipelined Collective I/O». In the first scheme, all processors participate in the I/O simultaneously making scheduling of I/O requests simpler but creating a possibility of contention at the I/O nodes. In the second approach, processors are grouped into several groups, so that only one group performs I/O simultaneously, while the next group performs communication to rearrange data, and this entire process is pipelined to reduce I/O node contention dynamically. Both models have been optimized by using software caching, chunking and on-line compression mechanisms. We demonstrate that we can obtain significantly high-performance for I/O above what has been possible so far. The performance results are presented on an Intel Paragon and on the ASCI/Red teraflops machine at Sandia National Labs.
AB - In this paper we present the design, implementation and evaluation of a runtime system based on collective I/O techniques for irregular applications. We present two models, namely, «Collective I/O» and «Pipelined Collective I/O». In the first scheme, all processors participate in the I/O simultaneously making scheduling of I/O requests simpler but creating a possibility of contention at the I/O nodes. In the second approach, processors are grouped into several groups, so that only one group performs I/O simultaneously, while the next group performs communication to rearrange data, and this entire process is pipelined to reduce I/O node contention dynamically. Both models have been optimized by using software caching, chunking and on-line compression mechanisms. We demonstrate that we can obtain significantly high-performance for I/O above what has been possible so far. The performance results are presented on an Intel Paragon and on the ASCI/Red teraflops machine at Sandia National Labs.
UR - http://www.scopus.com/inward/record.url?scp=85044993084&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85044993084&partnerID=8YFLogxK
U2 - 10.1109/IPPS.1998.669926
DO - 10.1109/IPPS.1998.669926
M3 - Conference contribution
AN - SCOPUS:85044993084
T3 - Proceedings of the 1st Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing, IPPS/SPDP 1998
SP - 280
EP - 284
BT - Proceedings of the 1st Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing, IPPS/SPDP 1998
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 1st Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing, IPPS/SPDP 1998
Y2 - 30 March 1998 through 3 April 1998
ER -