Watch and learn: How the brain gains knowledge through observation

Summary: The brain chooses between two neural systems responsible for emulation learning and imitation learning. Study reveals how the brain chooses which strategy to employ when faced with an observational learning task.

Source: CalTech

Humans have a number of ways to learn how to do new things. One of those ways is through observation: watching another person perform a task, and then doing what they did. Think of a child that learns how to “adult” by observing their parents as they pay for groceries or make a phone call.

It has long been theorized that there are two types of observational learning: imitation and emulation. Imitation is when one person copies another person’s behaviors to achieve the same goal. For example, if you watch the numbers a person dials to open a safe so that you, too, can open it. Emulation, on the other hand, occurs when someone watches another person achieve a goal, infers their goals, and then works to achieve those same goals without copying the other person’s actions. In this case, you might watch the person open the safe, see there are valuables inside, and then later cut it open with a saw.

A new study conducted at Caltech has showed how the brain chooses between the two neural systems responsible for each of these kinds of learning. The study, which appears in the journal Neuron, reveals for the first time how the brain chooses which strategy to employ when faced with an observational learning task.

The research was led by Caroline Charpentier, a postdoctoral scholar in neuroscience who works in the lab of John O’Doherty, professor of psychology in the Division of the Humanities and Social Sciences.

“Depending on the context, sometimes imitation works best, and sometimes emulation is more reliable. Here we wanted to show whether and how the brain can keep track of both strategies in parallel and adaptively pick the best strategy in a given context,” Charpentier says.

In the study, participants were placed in a functional magnetic resonance imaging (fMRI) machine so their brain activity could be monitored while they performed an observational learning task. Once in the machine, they were presented with virtual slot machines that would dispense three colors of tokens: red, green, or blue. Only one of those token colors had monetary value at a given time, but the participants were not told which color that was. The only information they were provided directly was the probability that a particular slot machine would dispense a token of a given color.

In most of the trials, the participants were asked to observe another person play the slot machines, and were told that this person had full knowledge of which color was valuable. By watching the other person pick which slot machine to play, they could gain information that would help improve their chances of receiving a valuable token when it was their turn to play.

However, because it was important for the researchers to discern which observational learning strategy the participants were using when they played the slot machine after watching the other people take a turn, the researchers created two different trial scenarios. In one scenario, the participant was permitted to play the same slot machine as the person they had been observing. Since they played the same machine, the participant could mimic the behavior of the other person and thus engaged in imitative observational learning. In the other scenario, the participant could not play the same machine, which prevented them from simply imitating the other player’s actions and forced them to use an emulation learning strategy.

Charpentier says the fMRI data showed that each of these strategies correlated with activity in specific parts of the brain.

This is a diagram from the study
When emulation of another person’s goals becomes too difficult, the brain resorts to imitative learning. The image is credited to CalTech.

“Imitation tends to rely on regions that we refer to as the mirror system of the brain, which is active both when someone performs an action, such as grabbing an object off the table, or when they watch someone else perform that same action,” she says. “The emulation strategy mapped more to the mentalizing network, which is used for inferring another person’s thoughts and goals, or basically putting yourself in someone else’s shoes and trying to think what they would think.”

Charpentier adds that once the research team had completed the participants’ brain scans, they were able to build a mathematical model of how participants learn from the observed player and chooses between the slot machines. The model suggests that the decision of which strategy to employ is determined by how reliable the emulation strategy is, and results show evidence for this “reliability of emulation” signal in several brain areas.

“If emulation is reliable, these regions are more active and emulation is more likely to be used, while if emulation is not reliable or too uncertain these regions are less active and we prefer imitation” she says. “In other words, our behavior is a mix of both strategies and the brain can weigh in on which one is best at any point in time.”

Charpentier says that, going forward, she would like to investigate the interactions of the regions of the brain involved in observational learning, or their so-called functional connectivity. In addition, she would like to see if the brain follows a similar model for choosing between other types of learning: if, for example, a person has to choose between learning from experience or learning from observation.

Charpentier’s co-authors include Kiyohito Iiyaga and John O’Doherty.

Funding: Funding for the research was provided by the National Institute of Mental Health (Caltech Conte Center for the Neurobiology of Social Decision-Making).

About this neuroscience research article

Source:
CalTech
Media Contacts:
Press Office – CalTech
Image Source:
The image is credited to CalTech.

Original Research: Closed access
“”Neuro-computational account of arbitration between choice imitation and goal emulation during human observational learning”. Caroline J. Charpentier, Kiyohito Iigaya, John P. O’Doherty.
Science doi:10.1016/j.neuron.2020.02.028.

Abstract

“Neuro-computational account of arbitration between choice imitation and goal emulation during human observational learning

Highlights
• Replicable evidence of arbitration between choice imitation and goal emulation
• Control over behavior is adaptively weighted toward the most reliable strategy
• Distinct brain networks implement each strategy’s learning signals in parallel
• Arbitration is driven by variations in emulation reliability in rvlPFC, ACC, and TPJ

Summary
When individuals learn from observing the behavior of others, they deploy at least two distinct strategies. Choice imitation involves repeating other agents’ previous actions, whereas emulation proceeds from inferring their goals and intentions. Despite the prevalence of observational learning in humans and other social animals, a fundamental question remains unaddressed: how does the brain decide which strategy to use in a given situation? In two fMRI studies (the second a pre-registered replication of the first), we identify a neuro-computational mechanism underlying arbitration between choice imitation and goal emulation. Computational modeling, combined with a behavioral task that dissociated the two strategies, revealed that control over behavior was adaptively and dynamically weighted toward the most reliable strategy. Emulation reliability, the model’s arbitration signal, was represented in the ventrolateral prefrontal cortex, temporoparietal junction, and rostral cingulate cortex. Our replicated findings illuminate the computations by which the brain decides to imitate or emulate others.

Feel Free To Share This Learning and Neuroscience News.
Join our Newsletter
I agree to have my personal information transferred to AWeber for Neuroscience Newsletter ( more information )
Sign up to receive our recent neuroscience headlines and summaries sent to your email once a day, totally free.
We hate spam and only use your email to contact you about newsletters. You can cancel your subscription any time.