TY - GEN
T1 - Scientific application acceleration with reconfigurable functional units
AU - Rupnow, Kyle
AU - Underwood, Keith
AU - Compton, Katherine
N1 - Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.
PY - 2007
Y1 - 2007
N2 - While scientific applications in the past were limited by floating point computations, modern scientific applications use more unstructured formulations. These applications have a significant percentage of integer computation - increasingly a limiting factor in scientific application performance. real scientific applications employed at Sandia National Labs, integer computations constitute on average 37% of the application operations, forming large and complex dataflow graphs. Reconfigurable Functional Units (RFUs) are a particularly attractive accelerator for these graphs because they can potentially accelerate many unique graphs with a small amount of additional hardware. In this study, we analyze application traces of Sandia's scientific applications and the SPEC-FP benchmark suite. First we select a set of dataflow graphs to accelerate using the RFU, then we use execution-based simulation to determine the acceleration potential of the applications when using an RFU. On average, a set of 32 or fewer graphs is sufficient to capture the dataflow behavior of 30% of the integer computation, and more than half of Sandia applications show an improvement of 5% or more.
AB - While scientific applications in the past were limited by floating point computations, modern scientific applications use more unstructured formulations. These applications have a significant percentage of integer computation - increasingly a limiting factor in scientific application performance. real scientific applications employed at Sandia National Labs, integer computations constitute on average 37% of the application operations, forming large and complex dataflow graphs. Reconfigurable Functional Units (RFUs) are a particularly attractive accelerator for these graphs because they can potentially accelerate many unique graphs with a small amount of additional hardware. In this study, we analyze application traces of Sandia's scientific applications and the SPEC-FP benchmark suite. First we select a set of dataflow graphs to accelerate using the RFU, then we use execution-based simulation to determine the acceleration potential of the applications when using an RFU. On average, a set of 32 or fewer graphs is sufficient to capture the dataflow behavior of 30% of the integer computation, and more than half of Sandia applications show an improvement of 5% or more.
UR - http://www.scopus.com/inward/record.url?scp=47349086116&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=47349086116&partnerID=8YFLogxK
U2 - 10.1109/FCCM.2007.14
DO - 10.1109/FCCM.2007.14
M3 - Conference contribution
AN - SCOPUS:47349086116
SN - 0769529402
SN - 9780769529400
T3 - Proceedings 2007 IEEE Symposium on Field-Programme Custom Computing Machines, FCCM 2007
SP - 261
EP - 271
BT - Proceedings 2007 IEEE Symposium on Field-Programmable Custom Computing Machines, FCCM 2032
PB - IEEE Computer Society
T2 - 15th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, FCCM 2007
Y2 - 23 April 2007 through 25 April 2007
ER -