TY - GEN
T1 - Analyzing the impact of on-chip network traffic on program phases for CMPs
AU - Zhang, Yu
AU - Ozisikyilmaz, Berkin
AU - Memik, Gokhan
AU - Kim, John
AU - Choudhary, Alok Nidhi
PY - 2009
Y1 - 2009
N2 - It is known that the execution of programs exhibits repetitive phases; in other words, the execution of programs can be partitioned into segments of execution, during which the application exhibits unique architectural properties. This property has been used for various optimization goals. In addition, phase information is utilized to reduce the run time of the architectural simulation. Conventionally, an application is examined in an architecture-independent manner (such as the number of times a basic block is executed) to extract information about the phases and then only the representative execution intervals are executed to analyze architectural choices. We claim that such approaches are becoming inadequate in the many-core era as application execution is not dominated by the instructions only, but instead the communication structure of the application is becoming as important as the instruction behavior. Hence, we propose to utilize communication behavior to determine the phases of an application. Our results reveal that the inclusion of the communication information can increase the accuracy of the phase detection significantly. Specifically, for SPLASH2 and MineBench applications, the average (geometric mean) CPI error rate with the instruction-based phase detection is 11.01%, while our phase detection scheme has an average error rate of 3.41% when compared to the simulations that run the applications to completion.
AB - It is known that the execution of programs exhibits repetitive phases; in other words, the execution of programs can be partitioned into segments of execution, during which the application exhibits unique architectural properties. This property has been used for various optimization goals. In addition, phase information is utilized to reduce the run time of the architectural simulation. Conventionally, an application is examined in an architecture-independent manner (such as the number of times a basic block is executed) to extract information about the phases and then only the representative execution intervals are executed to analyze architectural choices. We claim that such approaches are becoming inadequate in the many-core era as application execution is not dominated by the instructions only, but instead the communication structure of the application is becoming as important as the instruction behavior. Hence, we propose to utilize communication behavior to determine the phases of an application. Our results reveal that the inclusion of the communication information can increase the accuracy of the phase detection significantly. Specifically, for SPLASH2 and MineBench applications, the average (geometric mean) CPI error rate with the instruction-based phase detection is 11.01%, while our phase detection scheme has an average error rate of 3.41% when compared to the simulations that run the applications to completion.
UR - http://www.scopus.com/inward/record.url?scp=70349170773&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349170773&partnerID=8YFLogxK
U2 - 10.1109/ISPASS.2009.4919653
DO - 10.1109/ISPASS.2009.4919653
M3 - Conference contribution
AN - SCOPUS:70349170773
SN - 9781424441846
T3 - ISPASS 2009 - International Symposium on Performance Analysis of Systems and Software
SP - 218
EP - 226
BT - ISPASS 2009 - International Symposium on Performance Analysis of Systems and Software
T2 - International Symposium on Performance Analysis of Systems and Software, ISPASS 2009
Y2 - 26 April 2009 through 28 April 2009
ER -