TY - JOUR
T1 - Hypsarrhythmia assessment exhibits poor interrater reliability
T2 - A threat to clinical trial validity
AU - Hussain, Shaun A.
AU - Kwong, Grace
AU - Millichap, John J.
AU - Mytinger, John R.
AU - Ryan, Nicole
AU - Matsumoto, Joyce H.
AU - Wu, Joyce Y.
AU - Lerner, Jason T.
AU - Sankar, Raman
N1 - Publisher Copyright:
© Wiley Periodicals, Inc. © 2014 International League Against Epilepsy.
PY - 2015/1/1
Y1 - 2015/1/1
N2 - Summary Objective Hypsarrhythmia is the classic interictal electroencephalographic pattern associated with infantile spasms, and characterized by high voltage, disorganization, and multifocal independent epileptiform discharges. Given this seemingly simple definition, one might expect excellent interrater reliability (IRR) in the identification of this pattern. Alternatively, it may be argued that assessments of voltage and disorganization are fairly subjective, and thus quite challenging in borderline cases. We sought to test the IRR of hypsarrhythmia assessment in a systematic fashion. Methods Six blinded pediatric electroencephalographers from four centers reviewed 22 electroencephalography (EEG) samples from patients with infantile spasms. Each sample was 5 min in duration and included only wakefulness. Raters determined if each EEG was abnormal and if hypsarrhythmia was present/absent, and characterized relevant features: voltage, organization, epileptiform discharges, slowing, interictal attenuations, symmetry, and synchrony. In addition, raters indicated their level of confidence for each assessment. Multirater kappa statistics (κ) were calculated for the assessment of hypsarrhythmia and each feature. Results Although IRR was favorable in determining whether a study was normal or abnormal (κ = 0.89), reliability was unfavorable for assessment of hypsarrhythmia (κ = 0.40), modified hypsarrhythmia (κ = 0.47), high voltage (κ = 0.37), disorganization (κ = 0.22), multifocal epileptiform discharges (κ = 0.68), interictal voltage attenuations (κ = 0.21), slowing (κ = 0.20), asymmetry (κ = 0.26), and asynchrony (κ = 0.08). Despite generally unsatisfactory interrater agreement, raters consistently reported high confidence in assessments. Significance This study contradicts the view that hypsarrhythmia assessment is straightforward. Even small variability in the identification of hypsarrhythmia has potentially deleterious consequences for clinical care, as its presence or absence impacts decisions to pursue high-risk and high-cost therapies. These inconsistencies may similarly confound studies in which abolition of hypsarrhythmia is an outcome measure. There is a great need for practical, reliable, and unbiased measures of hypsarrhythmia.
AB - Summary Objective Hypsarrhythmia is the classic interictal electroencephalographic pattern associated with infantile spasms, and characterized by high voltage, disorganization, and multifocal independent epileptiform discharges. Given this seemingly simple definition, one might expect excellent interrater reliability (IRR) in the identification of this pattern. Alternatively, it may be argued that assessments of voltage and disorganization are fairly subjective, and thus quite challenging in borderline cases. We sought to test the IRR of hypsarrhythmia assessment in a systematic fashion. Methods Six blinded pediatric electroencephalographers from four centers reviewed 22 electroencephalography (EEG) samples from patients with infantile spasms. Each sample was 5 min in duration and included only wakefulness. Raters determined if each EEG was abnormal and if hypsarrhythmia was present/absent, and characterized relevant features: voltage, organization, epileptiform discharges, slowing, interictal attenuations, symmetry, and synchrony. In addition, raters indicated their level of confidence for each assessment. Multirater kappa statistics (κ) were calculated for the assessment of hypsarrhythmia and each feature. Results Although IRR was favorable in determining whether a study was normal or abnormal (κ = 0.89), reliability was unfavorable for assessment of hypsarrhythmia (κ = 0.40), modified hypsarrhythmia (κ = 0.47), high voltage (κ = 0.37), disorganization (κ = 0.22), multifocal epileptiform discharges (κ = 0.68), interictal voltage attenuations (κ = 0.21), slowing (κ = 0.20), asymmetry (κ = 0.26), and asynchrony (κ = 0.08). Despite generally unsatisfactory interrater agreement, raters consistently reported high confidence in assessments. Significance This study contradicts the view that hypsarrhythmia assessment is straightforward. Even small variability in the identification of hypsarrhythmia has potentially deleterious consequences for clinical care, as its presence or absence impacts decisions to pursue high-risk and high-cost therapies. These inconsistencies may similarly confound studies in which abolition of hypsarrhythmia is an outcome measure. There is a great need for practical, reliable, and unbiased measures of hypsarrhythmia.
KW - Electroencephalography
KW - Hypsarrhythmia
KW - Infantile spasms
KW - Interrater reliability
KW - West syndrome
UR - http://www.scopus.com/inward/record.url?scp=84921653858&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84921653858&partnerID=8YFLogxK
U2 - 10.1111/epi.12861
DO - 10.1111/epi.12861
M3 - Article
C2 - 25385396
AN - SCOPUS:84921653858
SN - 0013-9580
VL - 56
SP - 77
EP - 81
JO - Epilepsia
JF - Epilepsia
IS - 1
ER -