TY - GEN
T1 - An architectural characterization study of data mining and bioinformatics workloads
AU - Özisikyilmaz, Berkin
AU - Narayanan, Ramanathan
AU - Zambreno, Joseph
AU - Memik, Gokhan
AU - Choudhary, Alok Nidhi
PY - 2006
Y1 - 2006
N2 - Data mining is the process of automatically finding implicit, previously unknown, and potentially useful information from large volumes of data. Recent advances in data extraction techniques have resulted in tremendous increase in the input data size of data mining applications. Data mining systems, on the other hand, have been unable to maintain the same rate of growth. Therefore, there is an increasing need to understand the bottlenecks associated with the execution of these applications in modern architectures. In this paper, we present MineBench, a publicly available benchmark suite containing fifteen representative data mining applications belonging to various categories: classification, clustering, association rule mining and optimization. First, we highlight the uniqueness of data mining applications. Subsequently, we evaluate the MineBench applications on an 8-way shared memory (SMP) machine and analyze important performance characteristics such as L1 and L2 cache miss rates, branch misprediction rates.
AB - Data mining is the process of automatically finding implicit, previously unknown, and potentially useful information from large volumes of data. Recent advances in data extraction techniques have resulted in tremendous increase in the input data size of data mining applications. Data mining systems, on the other hand, have been unable to maintain the same rate of growth. Therefore, there is an increasing need to understand the bottlenecks associated with the execution of these applications in modern architectures. In this paper, we present MineBench, a publicly available benchmark suite containing fifteen representative data mining applications belonging to various categories: classification, clustering, association rule mining and optimization. First, we highlight the uniqueness of data mining applications. Subsequently, we evaluate the MineBench applications on an 8-way shared memory (SMP) machine and analyze important performance characteristics such as L1 and L2 cache miss rates, branch misprediction rates.
UR - http://www.scopus.com/inward/record.url?scp=48449087685&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=48449087685&partnerID=8YFLogxK
U2 - 10.1109/IISWC.2006.302730
DO - 10.1109/IISWC.2006.302730
M3 - Conference contribution
AN - SCOPUS:48449087685
SN - 1424405084
SN - 9781424405084
T3 - Proceedings of the 2006 IEEE International Symposium on Workload Characterization, IISWC - 2006
SP - 61
EP - 70
BT - Proceedings of the 2006 IEEE International Symposium on Workload Characterization, IISWC - 2006
T2 - IEEE International Symposium on Workload Characterization, IISWC-2006
Y2 - 25 October 2006 through 27 October 2006
ER -