A new study by Brown University researchers shows that two different brain systems work cooperatively as people learn.
The study, published in Proceedings of the National Academy of Sciences, focused on the interplay of two very different modes of learning a new task: reinforcement learning and working memory. Reinforcement learning is an "under-the-hood" process in which people gradually learn which actions to take by processing rewards and punishments at the neural level, and then choosing the one that works best on average -- even if the person is not aware of it. In contrast, working memory involves keeping previous actions and their outcomes in mind to more rapidly and flexibly improve performance.
"People have largely interpreted these systems as working independently or as competing with each other in the learning process," said Michael Frank, a professor in Brown's Department of Cognitive, Linguistic and Psychological Sciences and co-author of the paper. "But we show that the two work together, with neural signals underlying working memory helping to guide those that support reinforcement learning."
Anne Collins, an assistant professor at the University of California, Berkeley, led the work when she was a postdoctoral researcher working with Frank, who directs the Initiative for Computation in Brain and Mind in the Brown Institute for Brain Science. Collins and Frank developed an experimental method designed to isolate the brain signals associated with each of the two systems.
For the study, 40 study participants were shown a series of symbols on a screen and asked, for each symbol, to press a particular button on a keyboard. They weren't told which key was the right one for each symbol. They had to learn it. When they got it right, they were rewarded with points. Over repeated trials, the participants came to learn which keys corresponded with which symbols.
In order to distinguish the contributions from reinforcement learning and working memory, the researchers set up problems with different numbers of symbols, ranging from two to six, and participants had to learn which button to press for each of them. Generally, people can only hold three or four items in working memory at a time, and only for short periods of time. So when the number of symbols or the delay increases, the contribution of working memory to the learning process should diminish.
As the participants performed the tasks, an EEG cap recorded signals from the brain, and the authors applied statistical methods to extract those signals related to one learning system or the other.
The study showed that when memory demands were high, the signals in the brain correlated to reinforcement learning actually got stronger. In other words, when the working memory system was overtaxed, the reinforcement learning system became more important in the learning process. In contrast, when participants could hold information in mind, signals associated with reinforcement learning were weaker, suggesting an increased role for working memory.
The researchers also found that they could decode from the brain signals in a particular trial whether information was likely to be in memory or not. That too traded off with the neural marker of reinforcement learning.
Those findings, the researchers say, suggest that the two systems aren't working independently.
"If they were completely independent of each other, we'd expect the signals associated with reinforcement learning to stay the same regardless of memory demands," Frank said. "But that's not we see, and that's a sign that the two systems are interacting."
But on its own, that finding didn't reveal the nature of that interaction -- whether it's cooperative or competitive. Was working memory shoving the reinforcement learning into the background in trials when the information could be readily accessible in mind? Or could it be that working memory helps to augment reinforcement learning? To figure that out, the researchers looked how the brain signals associated with reinforcement learning changed as the learning process unfolded from trial to trial.
The reinforcement learning system is driven by what's known as "reward prediction error" or RPE, and it's the signal the researchers used to track the reinforcement learning process. RPE represents the extent to which the reward that results from an action exceeds one's expectations. Take for example a study participant trying to figure out which button to press when they see a given symbol. If they happen to guess right and get rewarded with points, that outcome is surprisingly good and produces a high RPE.
In the brain, the reinforcement learning system uses the neurotransmitter dopamine to encode RPE. A high RPE -- meaning a surprisingly good outcome -- is associated with a large release of dopamine. The reinforcement learning system uses that dopamine flood as a signal to update our understanding of what actions we should take to get a given reward. When we repeat that action subsequently, we're less surprised by the reward and so the RPE is lower. As RPE continues to diminish, the system eventually stops updating, and in so doing, settles upon an appropriate action.
One scenario for how working memory could be interacting with reinforcement learning is by attenuating reward expectations, making them more quickly come into line with actual rewards. In that way, working memory could be working cooperatively to speed the reinforcement learning process.
The study found strong evidence for just that scenario. During repeated trials at small set sizes where working memory is active, brain signals associated with RPE started out high in the first few trials, and then quickly dropped off -- a sign that cognitive processes are informing the neural signaling associated with reinforcement learning. In contrast, if working memory were merely suppressing reinforcement learning, one wouldn't expect to see the quick drop in RPE.
The results, Frank said, provide some of the first concrete evidence for cooperation between these two systems.
"Thinking of these not as separate systems but as one big integrated system changes our understanding of the basic science of how people and animals learn," Frank said. "It might help us make better predictions about how the overall learning process is affected in people who have deficits in either of these systems."
And that, Frank said, could one day lead to better treatments for learning impairments.
The research was funded by the National Science Foundation (1460604).
Kevin Stacey | EurekAlert!
Scientists uncover the role of a protein in production & survival of myelin-forming cells
19.07.2018 | Advanced Science Research Center, GC/CUNY
NYSCF researchers develop novel bioengineering technique for personalized bone grafts
18.07.2018 | New York Stem Cell Foundation
A new manufacturing technique uses a process similar to newspaper printing to form smoother and more flexible metals for making ultrafast electronic devices.
The low-cost process, developed by Purdue University researchers, combines tools already used in industry for manufacturing metals on a large scale, but uses...
For the first time ever, scientists have determined the cosmic origin of highest-energy neutrinos. A research group led by IceCube scientist Elisa Resconi, spokesperson of the Collaborative Research Center SFB1258 at the Technical University of Munich (TUM), provides an important piece of evidence that the particles detected by the IceCube neutrino telescope at the South Pole originate from a galaxy four billion light-years away from Earth.
To rule out other origins with certainty, the team led by neutrino physicist Elisa Resconi from the Technical University of Munich and multi-wavelength...
For the first time a team of researchers have discovered two different phases of magnetic skyrmions in a single material. Physicists of the Technical Universities of Munich and Dresden and the University of Cologne can now better study and understand the properties of these magnetic structures, which are important for both basic research and applications.
Whirlpools are an everyday experience in a bath tub: When the water is drained a circular vortex is formed. Typically, such whirls are rather stable. Similar...
Physicists working with Roland Wester at the University of Innsbruck have investigated if and how chemical reactions can be influenced by targeted vibrational excitation of the reactants. They were able to demonstrate that excitation with a laser beam does not affect the efficiency of a chemical exchange reaction and that the excited molecular group acts only as a spectator in the reaction.
A frequently used reaction in organic chemistry is nucleophilic substitution. It plays, for example, an important role in in the synthesis of new chemical...
Optical spectroscopy allows investigating the energy structure and dynamic properties of complex quantum systems. Researchers from the University of Würzburg present two new approaches of coherent two-dimensional spectroscopy.
"Put an excitation into the system and observe how it evolves." According to physicist Professor Tobias Brixner, this is the credo of optical spectroscopy....
13.07.2018 | Event News
12.07.2018 | Event News
03.07.2018 | Event News
20.07.2018 | Power and Electrical Engineering
20.07.2018 | Information Technology
20.07.2018 | Materials Sciences