A new study by Brown University researchers shows that two different brain systems work cooperatively as people learn.
The study, published in Proceedings of the National Academy of Sciences, focused on the interplay of two very different modes of learning a new task: reinforcement learning and working memory. Reinforcement learning is an "under-the-hood" process in which people gradually learn which actions to take by processing rewards and punishments at the neural level, and then choosing the one that works best on average -- even if the person is not aware of it. In contrast, working memory involves keeping previous actions and their outcomes in mind to more rapidly and flexibly improve performance.
"People have largely interpreted these systems as working independently or as competing with each other in the learning process," said Michael Frank, a professor in Brown's Department of Cognitive, Linguistic and Psychological Sciences and co-author of the paper. "But we show that the two work together, with neural signals underlying working memory helping to guide those that support reinforcement learning."
Anne Collins, an assistant professor at the University of California, Berkeley, led the work when she was a postdoctoral researcher working with Frank, who directs the Initiative for Computation in Brain and Mind in the Brown Institute for Brain Science. Collins and Frank developed an experimental method designed to isolate the brain signals associated with each of the two systems.
For the study, 40 study participants were shown a series of symbols on a screen and asked, for each symbol, to press a particular button on a keyboard. They weren't told which key was the right one for each symbol. They had to learn it. When they got it right, they were rewarded with points. Over repeated trials, the participants came to learn which keys corresponded with which symbols.
In order to distinguish the contributions from reinforcement learning and working memory, the researchers set up problems with different numbers of symbols, ranging from two to six, and participants had to learn which button to press for each of them. Generally, people can only hold three or four items in working memory at a time, and only for short periods of time. So when the number of symbols or the delay increases, the contribution of working memory to the learning process should diminish.
As the participants performed the tasks, an EEG cap recorded signals from the brain, and the authors applied statistical methods to extract those signals related to one learning system or the other.
The study showed that when memory demands were high, the signals in the brain correlated to reinforcement learning actually got stronger. In other words, when the working memory system was overtaxed, the reinforcement learning system became more important in the learning process. In contrast, when participants could hold information in mind, signals associated with reinforcement learning were weaker, suggesting an increased role for working memory.
The researchers also found that they could decode from the brain signals in a particular trial whether information was likely to be in memory or not. That too traded off with the neural marker of reinforcement learning.
Those findings, the researchers say, suggest that the two systems aren't working independently.
"If they were completely independent of each other, we'd expect the signals associated with reinforcement learning to stay the same regardless of memory demands," Frank said. "But that's not we see, and that's a sign that the two systems are interacting."
But on its own, that finding didn't reveal the nature of that interaction -- whether it's cooperative or competitive. Was working memory shoving the reinforcement learning into the background in trials when the information could be readily accessible in mind? Or could it be that working memory helps to augment reinforcement learning? To figure that out, the researchers looked how the brain signals associated with reinforcement learning changed as the learning process unfolded from trial to trial.
The reinforcement learning system is driven by what's known as "reward prediction error" or RPE, and it's the signal the researchers used to track the reinforcement learning process. RPE represents the extent to which the reward that results from an action exceeds one's expectations. Take for example a study participant trying to figure out which button to press when they see a given symbol. If they happen to guess right and get rewarded with points, that outcome is surprisingly good and produces a high RPE.
In the brain, the reinforcement learning system uses the neurotransmitter dopamine to encode RPE. A high RPE -- meaning a surprisingly good outcome -- is associated with a large release of dopamine. The reinforcement learning system uses that dopamine flood as a signal to update our understanding of what actions we should take to get a given reward. When we repeat that action subsequently, we're less surprised by the reward and so the RPE is lower. As RPE continues to diminish, the system eventually stops updating, and in so doing, settles upon an appropriate action.
One scenario for how working memory could be interacting with reinforcement learning is by attenuating reward expectations, making them more quickly come into line with actual rewards. In that way, working memory could be working cooperatively to speed the reinforcement learning process.
The study found strong evidence for just that scenario. During repeated trials at small set sizes where working memory is active, brain signals associated with RPE started out high in the first few trials, and then quickly dropped off -- a sign that cognitive processes are informing the neural signaling associated with reinforcement learning. In contrast, if working memory were merely suppressing reinforcement learning, one wouldn't expect to see the quick drop in RPE.
The results, Frank said, provide some of the first concrete evidence for cooperation between these two systems.
"Thinking of these not as separate systems but as one big integrated system changes our understanding of the basic science of how people and animals learn," Frank said. "It might help us make better predictions about how the overall learning process is affected in people who have deficits in either of these systems."
And that, Frank said, could one day lead to better treatments for learning impairments.
The research was funded by the National Science Foundation (1460604).
Kevin Stacey | EurekAlert!
Study clarifies kinship of important plant group
05.08.2020 | Rheinische Friedrich-Wilhelms-Universität Bonn
Human cell-based test systems for toxicity studies: Ready-to-use Toxicity Assay (hiPSC)
05.08.2020 | Fraunhofer-Institut für Biomedizinische Technik IBMT
An international research team has found a new approach that may be able to reduce bone loss in osteoporosis and maintain bone health.
Osteoporosis is the most common age-related bone disease which affects hundreds of millions of individuals worldwide. It is estimated that one in three women...
Traditional single-cell sequencing methods help to reveal insights about cellular differences and functions - but they do this with static snapshots only...
“Core-shell” clusters pave the way for new efficient nanomaterials that make catalysts, magnetic and laser sensors or measuring devices for detecting electromagnetic radiation more efficient.
Whether in innovative high-tech materials, more powerful computer chips, pharmaceuticals or in the field of renewable energies, nanoparticles – smallest...
An international research team with Prof. Cornelia Denz from the Institute of Applied Physics at the University of Münster develop for the first time light fields using caustics that do not change during propagation. With the new method, the physicists cleverly exploit light structures that can be seen in rainbows or when light is transmitted through drinking glasses.
Modern applications as high resolution microsopy or micro- or nanoscale material processing require customized laser beams that do not change during...
Although no life has been detected on the Martian surface, a new study from astrophysicist and research scientist at the Center for Space Science at NYU Abu...
23.07.2020 | Event News
21.07.2020 | Event News
07.07.2020 | Event News
05.08.2020 | Physics and Astronomy
05.08.2020 | Health and Medicine
05.08.2020 | Earth Sciences