Abstract
This study uses within-study comparisons to assess the relative importance of covariate choice, unreliability in the measurement of these covariates, and whether regression or various forms of propensity score analysis are used to analyze the outcome data. Two of the within-study comparisons are of the four-arm type, and many more are of the three-arm type. To examine unreliability, simulations of differences in reliability are deliberately introduced into the 2 four-arm studies. Results are similar across the samples of studies reviewed with their wide range of non-experimental designs and topic areas. Covariate choice counts most, unreliability next most, and the mode of data analysis hardly matters at all. Unreliability has larger effects the more important a covariate is for bias reduction, but even so the very best covariates measured with a reliability of only. 60 still do better than substantively poor covariates that are measured perfectly. Why regression methods do as well as propensity score methods used in several different ways is a mystery still because, in theory, propensity scores would seem to have a distinct advantage in many practical applications, especially those where functional forms are in doubt.
Original language | English (US) |
---|---|
Pages (from-to) | 828-847 |
Number of pages | 20 |
Journal | Multivariate Behavioral Research |
Volume | 44 |
Issue number | 6 |
DOIs | |
State | Published - 2009 |
Funding
Thomas D. Cook and Peter M. Steiner were supported in part by Grant R305U070003 from the Institute for Educational Sciences, U.S. Department of Education. Peter M. Steiner was also supported by grants from the W. T. Grant Foundation and Spencer Foundation. Thanks to David Kenny for a critical and helpful reading.
ASJC Scopus subject areas
- Experimental and Cognitive Psychology
- Arts and Humanities (miscellaneous)
- Statistics and Probability