Research Statement

Guy Hoffman

 

Physical practice and motor learning are crucial capacities for many activities, yet only partially understood in humans, and mostly ignored in Artificial Intelligence and robotics research. We use practice to achieve our most important and impressive behaviors, from walking and handling objects; through collaboration and communication; to artistic and athletic performance. Decidedly disparate from traditional notions of learning, which are mostly abstract and information-driven, practiced skills are unique insofar as they cannot be achieved by observation alone, and are inherently first-person and embodied. One cannot learn to ski in the classroom, and dance ensembles cannot perfect their fluency through written correspondence.

My research goal is to develop a computational cognitive model of practice and implement it on robotic and other artificial agents. This could serve to inform psychological accounts of practice, explaining behavioral and neurological findings. It will also result in the design of agents that (i) demonstrate a higher level of efficiency and quality when repeatedly performing a task, and (ii) perform at a higher level of adaptation and coordination when performing a task with humans.

Achieving this goal could have a significant impact on the fields of psychology, robotics, human-machine interfaces, education, medicine, entertainment, athletics, and beyond. By modeling the way motor skills are acquired, we uncover a key mechanism of human development and adult behavior. We may improve the methods for learning new skills; speed up recovery from injuries; find new ways to embellish athletic and artistic performance; and reach a deeper understanding of the brain-body interface. By building agents that not only learn, but practice their skills, we will be able to design systems that tailor their actions to the task at hand, and fit the particular behavior and rhythm of their human collaborators. This will allow us to achieve a more natural human-machine interaction, one that displays the kind of efficient and satisfying performance that humans are accustomed to from each other. It will also serve to promote the yet unachieved goal of long-term human-robot collaboration in our daily environments—be it homes, offices, schools, hospitals, or workshops.

Methodology

My research program rests on four principles (see Figure): (a) the study of human practice; (b) the computational modeling of physical practice; (c) the design of robotic and virtual agents which implement these computational models, and (d) the evaluation of these agents through benchmarks and human-subject studies.

As I have done in the past, this methodology relies on and encourages an interdisciplinary collaboration with researchers in related fields, such as cognitive and behavioral psychology.

Past Research

My past research lays the foundations for the proposed endeavor. During my early doctoral work at the MIT Media Lab, I have explored the mechanisms underlying human-human and human-robot collaboration, including non-verbal behavior, turn-taking, gaze, and the grounding of shared attention.[1] I have challenged existing notions of machine learning through human-subject studies in a collaborative setting,[2] implemented a working cognitive architecture on a humanoid robot displaying socially appropriate behavior,[3] and with my colleagues have conducted human subject experiments measuring the effect of robotic nonverbal behavior on untrained users.[4]

My focus then shifted to investigation of fluency and the role of anticipation in human-robot teamwork. I have designed anticipatory action paradigms and tested them in human-subject studies within a simulated Markov Decision Process.[5] Later, I formulated a perceptual-symbol-based cognitive architecture for fluent human-robot interaction. This system includes onsets of many of my larger research goals: it enables a robot to learn from repetition, to anticipate human activities, and thus to increasingly coordinate its actions with those of a human partner.[6] I have implemented and tested this paradigm on two robots, including a robotic desk lamp, which I designed and constructed for these experiments. My current research at the Georgia Institute of Technology further explores these notions in the context of a human-robot musical performance, designing a robot that practices with a human musician to achieve rehearsed behavior.

Embodied Cognition and Anticipation

Two principles emerge from my investigation of joint practice: one is that modeling the attainment of a physical skill relies on an embodied view of cognition. Most current attempts to model intelligence, and in particular artificial intelligence, are constrained to notions of abstract symbolic processing, in which data flows bottom-up from the sensory to the decision-making and motor systems. Instead, I have been exploring an embodied approach, viewing concept recall and decision-making as tightly integrated with perception and exploring the bidirectional (top-down and bottom-up) relationship between sensing, learning, and acting.

Second, physical rehearsal is closely tied to the notion of anticipation. Practice results in perceptual priming to anticipated sensory input, leading to faster and more fluent action and interaction. I have shown initial results demonstrating how anticipation can be used in a robot engaged in team practice to enhance both the efficiency and the fluency of the interaction.6

Research Program

Cognitive Modeling    Looking ahead, I wish to continue exploring computational models of motor learning. I am specifically interested in models which relate high-level cognition to perceptual and motor experiences, and the transfer of capabilities from "conscious" to "automatic" processing. Moreover, I will investigate anticipation, its role in perceptual processing, and the relationship between physical first-person practice and the construction of anticipatory cues. Within this framework, the learned connectivity between perception and action is of major interest. In the context of joint practice, I will explore the relation between embodiment, anticipation, and nonverbal behavior.

Benchmarks    There is a need for benchmarks for practiced behaviors. In the past, I have proposed three benchmarks for fluent joint activity. I would like to contribute to the development of a battery of metrics in this nascent research area, hopefully enabling a comparative discussion of cognitive modeling approaches for practice and joint physical activities.

Human-robot Interaction    Models of physical practice are appropriate to be validated through research in robotics and human-robot interaction. I plan to demonstrate my models on physical robots. These robotic agents need not be complex humanoids. By focusing on behavior and motion, even simple non-anthropomorphic robots can be used to evaluate coordinated behavior. I hope to build agents that can practice on their own and jointly with humans. This could further our capacity to build robots that work efficiently and fluently, and could also serve as applied support for recently emerging psychological models of embodied cognition and anticipation.

Comparative Studies    It makes sense to compare the performance of these models to both traditional machine learning approaches, and to real human practice. This calls for comparative theoretical analysis, and comparative behavioral study of the above-mentioned implementations, using the newly construed benchmarks. Within this context, human-subject studies are invaluable, and I regard them as a central part of my proposed research agenda.

Long-term adaptation    Practiced skills also affect the capacity of robotic and virtual agents to adapt to humans.  In the future, we may be able to design systems that could—through practice—learn new tasks with little initial information, and fit the particular behavior and rhythm of their human counterparts. We want to design robots and virtual agents that can play a long-term role in our daily lives. Whether these are robotic nurses for elderly living in their homes, or navigation systems in our cars, adaptively practicing systems could alter the way we interact with personal technology.

Application to Human Practice     Finally, if we are able to successfully model physical practice and implement these models in artificial agents, we could apply our understandings to the embellishment of human practice. A long-term goal of this research is the reverse transfer of our findings in order to improve human physical performance.

 

Summary

We are at a highly opportune moment to explore the notion of physical practice through robotic agents: robot technologies have reached a maturity and commercial availability to enable rapid prototyping and extensive laboratory and field research; and a growing array of techniques and findings in cognitive psychology shed new light on the way humans perfect their performance. I hope to work at this crossroads to make a meaningful contribution to this endeavor.

 



[1] Breazeal, Hoffman, and Lockerd (2004), Teaching and Working with Robots as a Collaboration.

[2] Thomaz, Hoffman, and Breazeal (2006), Reinforcement Learning with Human Teachers.

[3] Hoffman and Breazeal (2004), Collaboration in Human-Robot Teams.

[4] Breazeal et al. (2005), Effects of Nonverbal Communication on Efficiency and Robustness in Human-robot Teamwork.

[5] Hoffman and Breazeal (2007), Effects of Anticipatory Action on Human-robot Teamwork.

[6] Hoffman and Breazeal (2008), Anticipatory Perceptual Simulation for Human-Robot Joint Practice: Theory and Application Study.