TY - JOUR
T1 - Infant polysomnography
T2 - Reliability and validity of infant arousal assessment
AU - Crowell, David H.
AU - Kulp, Thomas D.
AU - Kapuniai, Linda E.
AU - Hunt, Carl E.
AU - Brooks, Lee J.
AU - Weese-Mayer, Debra E.
AU - Silvestri, Jean
AU - Ward, Sally Davidson
AU - Corwin, Michael
AU - Tinsley, Larry
AU - Peucker, Mark
N1 - Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 2002
Y1 - 2002
N2 - Infant arousal scoring based on the Atlas Task Force definition of transient EEG arousal was evaluated to determine (1) whether transient arousals can be identified and assessed reliably in infants and (2) whether arousal and no-arousal epochs scored previously by trained raters can be validated reliably by independent sleep experts. Phase I for inter- and intrarater reliability scoring was based on two datasets of sleep epochs selected randomly from noctumal polysomnograms of healthy full-term, preterm, idiopathic apparent life-threatening event cases, and siblings of Sudden Infant Death Syndrome infants of 35 to 64 weeks postconceptional age. After training, test set 1 reliability was assessed and discrepancies identified. After retraining, test set 2 was scored by the same raters to determine interrater reliability. Later, three raters from the trained group rescored test set 2 to assess inter- and intrarater reliabilities. Interrater and intrarater reliability κ'S, with 95% confidence intervals, ranged from substantial to almost perfect levels of agreement. Interrater reliabilities for spontaneous arousals were initially moderate and then substantial. During the validation phase, 315 previously scored epochs were presented to four sleep experts to rate as containing arousal or no-arousal events. Interrater expert agreements were diverse and considered as noninterpretable. Concordance in sleep experts' agreements, based on identification of the previously sampled arousal and no-arousal epochs, was used as a secondary evaluative technique. Results showed agreement by two or more experts on 86% of the Collaborative Home Infant Monitoring Evaluation Study arousal scored events. Conversely, only 1% of the Collaborative Home Infant Monitoring Evaluation Study-scored no-arousal epochs were rated as an arousal. In summary, this study presents an empirically tested model with procedures and criteria for attaining improved reliability in transient EEG arousal assessments in infants using the modified Atlas Task Force standards. With training based on specific criteria, substantial inter- and intrarater agreement in identifying infant arousals was demonstrated. Corroborative validation results were too disparate for meaningful interpretation. Alternate evaluation based on concordance agreements supports reliance on infant EEG criteria for assessment. Results mandate additional confirmatory validation studies with specific training on infant EEG arousal assessment criteria.
AB - Infant arousal scoring based on the Atlas Task Force definition of transient EEG arousal was evaluated to determine (1) whether transient arousals can be identified and assessed reliably in infants and (2) whether arousal and no-arousal epochs scored previously by trained raters can be validated reliably by independent sleep experts. Phase I for inter- and intrarater reliability scoring was based on two datasets of sleep epochs selected randomly from noctumal polysomnograms of healthy full-term, preterm, idiopathic apparent life-threatening event cases, and siblings of Sudden Infant Death Syndrome infants of 35 to 64 weeks postconceptional age. After training, test set 1 reliability was assessed and discrepancies identified. After retraining, test set 2 was scored by the same raters to determine interrater reliability. Later, three raters from the trained group rescored test set 2 to assess inter- and intrarater reliabilities. Interrater and intrarater reliability κ'S, with 95% confidence intervals, ranged from substantial to almost perfect levels of agreement. Interrater reliabilities for spontaneous arousals were initially moderate and then substantial. During the validation phase, 315 previously scored epochs were presented to four sleep experts to rate as containing arousal or no-arousal events. Interrater expert agreements were diverse and considered as noninterpretable. Concordance in sleep experts' agreements, based on identification of the previously sampled arousal and no-arousal epochs, was used as a secondary evaluative technique. Results showed agreement by two or more experts on 86% of the Collaborative Home Infant Monitoring Evaluation Study arousal scored events. Conversely, only 1% of the Collaborative Home Infant Monitoring Evaluation Study-scored no-arousal epochs were rated as an arousal. In summary, this study presents an empirically tested model with procedures and criteria for attaining improved reliability in transient EEG arousal assessments in infants using the modified Atlas Task Force standards. With training based on specific criteria, substantial inter- and intrarater agreement in identifying infant arousals was demonstrated. Corroborative validation results were too disparate for meaningful interpretation. Alternate evaluation based on concordance agreements supports reliance on infant EEG criteria for assessment. Results mandate additional confirmatory validation studies with specific training on infant EEG arousal assessment criteria.
KW - Infant
KW - Polysomnography
KW - Reliability
KW - Transient EEG arousal
KW - Validity
UR - http://www.scopus.com/inward/record.url?scp=0036450970&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0036450970&partnerID=8YFLogxK
U2 - 10.1097/00004691-200210000-00010
DO - 10.1097/00004691-200210000-00010
M3 - Article
C2 - 12477992
AN - SCOPUS:0036450970
SN - 0736-0258
VL - 19
SP - 469
EP - 483
JO - Journal of Clinical Neurophysiology
JF - Journal of Clinical Neurophysiology
IS - 5
ER -