An architectural characterization study of data mining and bioinformatics workloads

Berkin Özisikyilmaz*, Ramanathan Narayanan, Joseph Zambreno, Gokhan Memik, Alok Nidhi Choudhary

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

17 Scopus citations

Abstract

Data mining is the process of automatically finding implicit, previously unknown, and potentially useful information from large volumes of data. Recent advances in data extraction techniques have resulted in tremendous increase in the input data size of data mining applications. Data mining systems, on the other hand, have been unable to maintain the same rate of growth. Therefore, there is an increasing need to understand the bottlenecks associated with the execution of these applications in modern architectures. In this paper, we present MineBench, a publicly available benchmark suite containing fifteen representative data mining applications belonging to various categories: classification, clustering, association rule mining and optimization. First, we highlight the uniqueness of data mining applications. Subsequently, we evaluate the MineBench applications on an 8-way shared memory (SMP) machine and analyze important performance characteristics such as L1 and L2 cache miss rates, branch misprediction rates.

Original languageEnglish (US)
Title of host publicationProceedings of the 2006 IEEE International Symposium on Workload Characterization, IISWC - 2006
Pages61-70
Number of pages10
DOIs
StatePublished - Dec 1 2006
EventIEEE International Symposium on Workload Characterization, IISWC-2006 - San Jose, CA, United States
Duration: Oct 25 2006Oct 27 2006

Publication series

NameProceedings of the 2006 IEEE International Symposium on Workload Characterization, IISWC - 2006

Other

OtherIEEE International Symposium on Workload Characterization, IISWC-2006
CountryUnited States
CitySan Jose, CA
Period10/25/0610/27/06

ASJC Scopus subject areas

  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'An architectural characterization study of data mining and bioinformatics workloads'. Together they form a unique fingerprint.

Cite this