TY - GEN
T1 - Scalable libraries for Fortran 90D/High Performance Fortran
AU - Bozkus, Z.
AU - Choudhary, Alok Nidhi
AU - Fox, G.
AU - Haupt, T.
AU - Ranka, S.
AU - Thakur, R.
AU - Wang, Jhy Chun
N1 - Funding Information:
'This work was supportedin part by DARPA under contract DABT63-91-C-0028. The content of the information does not necessarily reflect the position or the policy of the Government and no official endorsement should be inferred. t Also supported in part by NSF Young Investigator Award CCR-9357804. $To whom all correspondenceshould be sent. Also supported in part by NSF under CCR-9110812
Publisher Copyright:
© 1994 IEEE.
PY - 1993
Y1 - 1993
N2 - High Performance Fortran (HPF) is a new language, based on Fortran 90, developed by HPF Forum. The language was designed to support data parallel programming with top performance on MIMD and SIMD computers with non-uniform memory access costs. The main features of the language include the FORALL construct, new intrinsic functions and data distribution directives. A perusal of HPF shows that most of the parallelism is hidden in the runtime library. Further, efficient parallelization of FORALL construct and array assignment functions on distributed memory machines requires the use of collective communication to access non-local data. This communication could be structured (like shift, broadcast, all-to-all communication) or unstructured. Thus, the scalability of the code generated by the compiler depend on the scalability of these libraries. In this paper, we present the design and performance of an scalable library for the intrinsic functions and the collective communication library.
AB - High Performance Fortran (HPF) is a new language, based on Fortran 90, developed by HPF Forum. The language was designed to support data parallel programming with top performance on MIMD and SIMD computers with non-uniform memory access costs. The main features of the language include the FORALL construct, new intrinsic functions and data distribution directives. A perusal of HPF shows that most of the parallelism is hidden in the runtime library. Further, efficient parallelization of FORALL construct and array assignment functions on distributed memory machines requires the use of collective communication to access non-local data. This communication could be structured (like shift, broadcast, all-to-all communication) or unstructured. Thus, the scalability of the code generated by the compiler depend on the scalability of these libraries. In this paper, we present the design and performance of an scalable library for the intrinsic functions and the collective communication library.
UR - http://www.scopus.com/inward/record.url?scp=85031121680&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85031121680&partnerID=8YFLogxK
U2 - 10.1109/SPLC.1993.365581
DO - 10.1109/SPLC.1993.365581
M3 - Conference contribution
AN - SCOPUS:85031121680
T3 - Proceedings of Scalable Parallel Libraries Conference, SPLC 1993
SP - 67
EP - 76
BT - Proceedings of Scalable Parallel Libraries Conference, SPLC 1993
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 1993 Scalable Parallel Libraries Conference, SPLC 1993
Y2 - 6 October 1993 through 8 October 1993
ER -