ZOID: I/O-forwarding infrastructure for petascale architectures

Kamil Iskra*, John W. Romein, Kazutomo Yoshii, Pete Beckman

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

85 Scopus citations

Abstract

The ZeptoOS project is developing an open-source alternative to the proprietary software stacks available on contemporary massively parallel architectures. The aim is to enable computer science research on these architectures, enhance community collaboration, and foster innovation. In this paper, we introduce a component of ZeptoOS called ZOID-an I/O-forwarding infrastructure for architectures such as IBM Blue Gene that decouple file and socket I/O from the compute nodes, shipping those functions to dedicated I/O nodes. Through the use of optimized network protocols and data paths, as well as a multithreaded daemon running on I/O nodes, ZOID provides greater performance than does the stock infrastructure. We present a set of benchmark results that highlight the improvements. Crucially, the flexibility of our infrastructure is a vast improvement over the stock infrastructure, allowing users to forward data using custom-designed application interfaces, through an easy-to-use plug-in mechanism. This capability is used for realtime telescope data transfers, extensively discussed in the paper. Plug-in-speciflc threads implement prefetching of data obtained over sockets from an input cluster and merge results from individual compute nodes before sending them out, significantly reducing required network bandwidth. This approach allows a ZOID version of the application to handle a larger number of subbands per I/O node, or even to bypass the input cluster altogether, plugging the input from remote receiver stations directly into the I/O nodes. Using the resources more efficiently can result in considerable savings.

Original languageEnglish (US)
Title of host publicationPPoPP'08 - Proceedings of the 2008 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Pages153-161
Number of pages9
StatePublished - 2008
Event13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'08 - Salt Lake City, UT, United States
Duration: Feb 20 2008Feb 23 2008

Publication series

NameProceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP

Conference

Conference13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'08
Country/TerritoryUnited States
CitySalt Lake City, UT
Period2/20/082/23/08

Keywords

  • I/O forwarding
  • Petascale

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'ZOID: I/O-forwarding infrastructure for petascale architectures'. Together they form a unique fingerprint.

Cite this