TY - GEN
T1 - Automatic,Weight Learning for Multiple Data Sources when Learning from Demonstration
AU - Argall, Brenna D.
AU - Browning, Brett
AU - Veloso, Manuela
PY - 2009
Y1 - 2009
N2 - Traditional approaches to programming robots are generally inaccessible to non-robotics-experts. A promising exception is the Learning from Demonstration paradigm. Here a policy mapping world observations to action selection is learned, by generalizing from task demonstrations by a teacher. Most Learning from Demonstration work to date considers data from a single teacher. In this paper, we consider the incorporation of demonstrations from multiple teachers. In particular, we contribute an algorithm that handles multiple data sources, and additionally reasons about reliability differences between them. For example, multiple teachers could be inequally proficient at performing the demonstrated task. We introduce Demonstration Weight Learning (DWL) as a Learning from Demonstration algorithm that explicitly represents multiple data sources and learns to select between them, based on their observed reliability and according to an adaptive expert learning inspired approach. We present a first implementation of DWL within a simulated robot domain. Data sources are shown to differ in reliability, and weighting is found impact task execution success. Furthermore, DWL is shown to produce appropriate data source weights that improve policy performance.
AB - Traditional approaches to programming robots are generally inaccessible to non-robotics-experts. A promising exception is the Learning from Demonstration paradigm. Here a policy mapping world observations to action selection is learned, by generalizing from task demonstrations by a teacher. Most Learning from Demonstration work to date considers data from a single teacher. In this paper, we consider the incorporation of demonstrations from multiple teachers. In particular, we contribute an algorithm that handles multiple data sources, and additionally reasons about reliability differences between them. For example, multiple teachers could be inequally proficient at performing the demonstrated task. We introduce Demonstration Weight Learning (DWL) as a Learning from Demonstration algorithm that explicitly represents multiple data sources and learns to select between them, based on their observed reliability and according to an adaptive expert learning inspired approach. We present a first implementation of DWL within a simulated robot domain. Data sources are shown to differ in reliability, and weighting is found impact task execution success. Furthermore, DWL is shown to produce appropriate data source weights that improve policy performance.
UR - http://www.scopus.com/inward/record.url?scp=70350418434&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70350418434&partnerID=8YFLogxK
U2 - 10.1109/ROBOT.2009.5152668
DO - 10.1109/ROBOT.2009.5152668
M3 - Conference contribution
AN - SCOPUS:70350418434
SN - 9781424427895
T3 - Proceedings - IEEE International Conference on Robotics and Automation
SP - 226
EP - 231
BT - 2009 IEEE International Conference on Robotics and Automation, ICRA '09
T2 - 2009 IEEE International Conference on Robotics and Automation, ICRA '09
Y2 - 12 May 2009 through 17 May 2009
ER -