TY - JOUR
T1 - Communication and memory requirements as the basis for mapping task and data parallel programs
AU - Subhlok, Jaspal
AU - O'Hallaron, David R.
AU - Gross, Thomas
AU - Dinda, Peter A.
AU - Webb, Jon
PY - 1994
Y1 - 1994
N2 - For a wide variety of applications, both task and data parallelism must be exploited to achieve the best possible performance on a multicomputer. Recent research has underlined the importance of exploiting task and data parallelism in a single compiler framework, and such a compiler can map a single source program in many different ways onto a parallel machine. The tradeoffs between task and data parallelism are complex and depend on the characteristics of the program to be executed, most significantly the memory and communication requirements, and the performance parameters of the target parallel machine. In this paper, we present a framework to isolate and examine the specific characteristics of programs that determine the performance for different mappings. Our focus is on applications that process a stream of input, and whose computation structure is fairly static and predictable. We describe three such applications that were developed with our compiler: fast Fourier transforms, narrowband tracking radar, and multibaseline stereo. We examine the tradeoffs between various mappings for them and show how the framework is used to obtain efficient mappings.
AB - For a wide variety of applications, both task and data parallelism must be exploited to achieve the best possible performance on a multicomputer. Recent research has underlined the importance of exploiting task and data parallelism in a single compiler framework, and such a compiler can map a single source program in many different ways onto a parallel machine. The tradeoffs between task and data parallelism are complex and depend on the characteristics of the program to be executed, most significantly the memory and communication requirements, and the performance parameters of the target parallel machine. In this paper, we present a framework to isolate and examine the specific characteristics of programs that determine the performance for different mappings. Our focus is on applications that process a stream of input, and whose computation structure is fairly static and predictable. We describe three such applications that were developed with our compiler: fast Fourier transforms, narrowband tracking radar, and multibaseline stereo. We examine the tradeoffs between various mappings for them and show how the framework is used to obtain efficient mappings.
UR - http://www.scopus.com/inward/record.url?scp=0028732783&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0028732783&partnerID=8YFLogxK
U2 - 10.1145/602770.602832
DO - 10.1145/602770.602832
M3 - Conference article
AN - SCOPUS:0028732783
SN - 1063-9535
SP - 330
EP - 339
JO - Proceedings of the ACM/IEEE Supercomputing Conference
JF - Proceedings of the ACM/IEEE Supercomputing Conference
T2 - Proceedings of the 1994 Supercomputing Conference
Y2 - 14 November 1994 through 18 November 1994
ER -