TY - JOUR
T1 - Integrating scheduling and physical design into a coherent compilation cycle for reconfigurable computing architectures
AU - Bazargan, Kia
AU - Ogrenci, Seda
AU - Sarrafzadeh, Majid
PY - 2001
Y1 - 2001
N2 - Advances in the FPGA technology, both in terms of device capacity and architecture, have resulted in introduction of reconfigurable computing machines, where the hardware adapts itself to the running application to gain speedup. To keep up with the ever-growing performance expectations of such systems, designers need new methodologies and tools for developing reconfigurable computing systems (RCS). This paper addresses the need for fast compilation and physical design phase to be used in application development/debugging/testing cycle for RCS. We present a high-level synthesis approach that is integrated with placement, making the compilation cycle much faster. On the average, our tool generates the VHDL code (and the corresponding placement information) from the data flow graph of a program in less than a minute. By compromising 30% in the clock frequency of the circuit, we can achieve about 10 times speedup in the Xilinx placement phase, and 2.5 times overall speedup in the Xilinx place-and-route p hase, a reasonable trade-off when developing RCS applications.
AB - Advances in the FPGA technology, both in terms of device capacity and architecture, have resulted in introduction of reconfigurable computing machines, where the hardware adapts itself to the running application to gain speedup. To keep up with the ever-growing performance expectations of such systems, designers need new methodologies and tools for developing reconfigurable computing systems (RCS). This paper addresses the need for fast compilation and physical design phase to be used in application development/debugging/testing cycle for RCS. We present a high-level synthesis approach that is integrated with placement, making the compilation cycle much faster. On the average, our tool generates the VHDL code (and the corresponding placement information) from the data flow graph of a program in less than a minute. By compromising 30% in the clock frequency of the circuit, we can achieve about 10 times speedup in the Xilinx placement phase, and 2.5 times overall speedup in the Xilinx place-and-route p hase, a reasonable trade-off when developing RCS applications.
UR - http://www.scopus.com/inward/record.url?scp=0034848108&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0034848108&partnerID=8YFLogxK
U2 - 10.1109/DAC.2001.156216
DO - 10.1109/DAC.2001.156216
M3 - Article
AN - SCOPUS:0034848108
SN - 0738-100X
SP - 635
EP - 640
JO - Proceedings - Design Automation Conference
JF - Proceedings - Design Automation Conference
ER -