TY - JOUR
T1 - Evidence-based practice for equating health status items
T2 - Sample size and IRT model
AU - Cook, Karon F.
AU - Taylor, Patrick W.
AU - Dodd, Barbara G.
AU - Teal, Cayla R.
AU - McHorney, Colleen A.
PY - 2007
Y1 - 2007
N2 - Background: In the development of health outcome measures, the pool of candidate items may be divided into multiple forms, thus "spreading" response burden over two or more study samples. Item responses collected using this approach result in two or more forms whose scores are not equivalent. Therefore, the item responses must be equated (adjusted) to a common mathematical metric. Objectives: The purpose of this study was to examine the effect of sample size, test size, and selection of item response theory model in equating three forms of a health status measure. Each of the forms was comprised of a set of items unique to it and a set of anchor items common across forms. Research Design: The study was a secondary data analysis of patients' responses to the developmental item pool for the Health of Seniors Survey. A completely crossed design was used with 25 replications per study cell. Results: We found that the quality of equatings was affected greatly by sample size. Its effect was far more substantial than choice of IRT model. Little or no advantage was observed for equatings based on 60 or 72 items versus those based on 48 items. Conclusions: We concluded that samples of less than 300 are clearly unacceptable for equating multiple forms. Additional sample size guidelines are offered based on our results.
AB - Background: In the development of health outcome measures, the pool of candidate items may be divided into multiple forms, thus "spreading" response burden over two or more study samples. Item responses collected using this approach result in two or more forms whose scores are not equivalent. Therefore, the item responses must be equated (adjusted) to a common mathematical metric. Objectives: The purpose of this study was to examine the effect of sample size, test size, and selection of item response theory model in equating three forms of a health status measure. Each of the forms was comprised of a set of items unique to it and a set of anchor items common across forms. Research Design: The study was a secondary data analysis of patients' responses to the developmental item pool for the Health of Seniors Survey. A completely crossed design was used with 25 replications per study cell. Results: We found that the quality of equatings was affected greatly by sample size. Its effect was far more substantial than choice of IRT model. Little or no advantage was observed for equatings based on 60 or 72 items versus those based on 48 items. Conclusions: We concluded that samples of less than 300 are clearly unacceptable for equating multiple forms. Additional sample size guidelines are offered based on our results.
UR - http://www.scopus.com/inward/record.url?scp=34547426858&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547426858&partnerID=8YFLogxK
M3 - Article
C2 - 17440260
AN - SCOPUS:34547426858
SN - 1529-7713
VL - 8
SP - 175
EP - 189
JO - Journal of applied measurement
JF - Journal of applied measurement
IS - 2
ER -