TY - JOUR
T1 - A Computational Study of Commonsense Science
T2 - An Exploration in the Automated Analysis of Clinical Interview Data
AU - Sherin, Bruce
PY - 2013/10
Y1 - 2013/10
N2 - A large body of research in the learning sciences has focused on students' commonsense science knowledge-the everyday knowledge of the natural world that is gained outside of formal instruction. Although researchers studying commonsense science have employed a variety of methods, 1-on-1 clinical interviews have played a unique role. The data that result from these interviews take the form of video recordings, which in turn are often compiled into written transcripts and coded by human analysts. In this article, I explore the application of computational techniques to the analysis of this familiar type of data. I describe the success I have had using extremely simple methods from computational linguistics-methods that are based on rudimentary vector space models and simple clustering algorithms. These automated analyses are employed in an exploratory mode as a way to discover student conceptions in the data. The aims of this article are primarily methodological in nature: I attempt to show that it is possible to use techniques from computational linguistics to analyze data from commonsense science interviews in a manner that may provide convergent support for the work of human analysts. As a test bed, I draw on transcripts of a corpus of interviews in which 54 middle school students were asked to explain the seasons.
AB - A large body of research in the learning sciences has focused on students' commonsense science knowledge-the everyday knowledge of the natural world that is gained outside of formal instruction. Although researchers studying commonsense science have employed a variety of methods, 1-on-1 clinical interviews have played a unique role. The data that result from these interviews take the form of video recordings, which in turn are often compiled into written transcripts and coded by human analysts. In this article, I explore the application of computational techniques to the analysis of this familiar type of data. I describe the success I have had using extremely simple methods from computational linguistics-methods that are based on rudimentary vector space models and simple clustering algorithms. These automated analyses are employed in an exploratory mode as a way to discover student conceptions in the data. The aims of this article are primarily methodological in nature: I attempt to show that it is possible to use techniques from computational linguistics to analyze data from commonsense science interviews in a manner that may provide convergent support for the work of human analysts. As a test bed, I draw on transcripts of a corpus of interviews in which 54 middle school students were asked to explain the seasons.
UR - http://www.scopus.com/inward/record.url?scp=84887871909&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84887871909&partnerID=8YFLogxK
U2 - 10.1080/10508406.2013.836654
DO - 10.1080/10508406.2013.836654
M3 - Article
AN - SCOPUS:84887871909
SN - 1050-8406
VL - 22
SP - 600
EP - 638
JO - Journal of the Learning Sciences
JF - Journal of the Learning Sciences
IS - 4
ER -