Application-level error measurements for network processors

Arindam Mallik, Matthew C. Wildrick, Gokhan Memik*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Faults in computer systems can occur due to a variety of reasons. These include internal effects such as coupling and external effects such as alpha particles. As we move towards smaller manufacturing technologies, the probability of errors for a single transistor is likely to increase. Even if this probability remains the same, the probability of a fault in a processor will increase linearly with the boost in the number of transistors per chip. In many systems, an error has a binary effect, i.e., the output is either correct or erroneous. However, networking systems exhibit different properties. For example, although a portion of the code behaves incorrectly due to a fault, the application can still work correctly. Therefore, measuring the effects of faults on the network processor applications require new measurement metrics to be developed. Particularly, hardware faults need to be measured in the context of their effect on the application behavior. In this paper, we highlight essential application properties and data structures that can be used to measure the error behavior of network processors. Using these metrics, we study the error behavior of seven representative networking applications under different cache access fault probabilities. With this study, we hope to bridge the gap between the circuit-level phenomena and their impact on the application behavior.

Original languageEnglish (US)
Pages (from-to)1870-1877
Number of pages8
JournalIEICE Transactions on Information and Systems
VolumeE88-D
Issue number8
DOIs
StatePublished - Aug 2005

Keywords

  • Fault tolerance
  • Network processors

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Computer Vision and Pattern Recognition
  • Electrical and Electronic Engineering
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Application-level error measurements for network processors'. Together they form a unique fingerprint.

Cite this