TY - JOUR
T1 - Correcting an analysis of variance for clustering
AU - Hedges, Larry V.
AU - Rhoads, Christopher H.
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2011/2
Y1 - 2011/2
N2 - A great deal of educational and social data arises from cluster sampling designs where clusters involve schools, classrooms, or communities. A mistake that is sometimes encountered in the analysis of such data is to ignore the effect of clustering and analyse the data as if it were based on a simple random sample. This typically leads to an overstatement of the precision of results and too liberal conclusions about precision and statistical significance of mean differences. This paper gives simple corrections to the test statistics that would be computed in an analysis of variance if clustering were (incorrectly) ignored. The corrections are multiplicative factors depending on the total sample size, the cluster size, and the intraclass correlation structure. For example, the corrected F statistic has Fisher's F distribution with reduced degrees of freedom. The corrected statistic reduces to the F statistic computed by ignoring clustering when the intraclass correlations are zero. It reduces to the F statistic computed using cluster means when the intraclass correlations are unity, and it is in between otherwise. A similar adjustment to the usual statistic for testing a linear contrast among group means is described.
AB - A great deal of educational and social data arises from cluster sampling designs where clusters involve schools, classrooms, or communities. A mistake that is sometimes encountered in the analysis of such data is to ignore the effect of clustering and analyse the data as if it were based on a simple random sample. This typically leads to an overstatement of the precision of results and too liberal conclusions about precision and statistical significance of mean differences. This paper gives simple corrections to the test statistics that would be computed in an analysis of variance if clustering were (incorrectly) ignored. The corrections are multiplicative factors depending on the total sample size, the cluster size, and the intraclass correlation structure. For example, the corrected F statistic has Fisher's F distribution with reduced degrees of freedom. The corrected statistic reduces to the F statistic computed by ignoring clustering when the intraclass correlations are zero. It reduces to the F statistic computed using cluster means when the intraclass correlations are unity, and it is in between otherwise. A similar adjustment to the usual statistic for testing a linear contrast among group means is described.
UR - http://www.scopus.com/inward/record.url?scp=79751494477&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79751494477&partnerID=8YFLogxK
U2 - 10.1111/j.2044-8317.2010.02005.x
DO - 10.1111/j.2044-8317.2010.02005.x
M3 - Article
C2 - 21506943
AN - SCOPUS:79751494477
VL - 64
SP - 20
EP - 37
JO - British Journal of Statistical Psychology
JF - British Journal of Statistical Psychology
SN - 0007-1102
IS - 1
ER -