TY - JOUR
T1 - Nonlinear age-related differences in probabilistic learning in mice
T2 - A 5-armed bandit task study
AU - Ohta, Hiroyuki
AU - Nozawa, Takashi
AU - Nakano, Takashi
AU - Morimoto, Yuji
AU - Ishizuka, Toshiaki
N1 - Publisher Copyright:
© 2024 Elsevier Inc.
PY - 2024/10
Y1 - 2024/10
N2 - This study explores the impact of aging on reinforcement learning in mice, focusing on changes in learning rates and behavioral strategies. A 5-armed bandit task (5-ABT) and a computational Q-learning model were used to evaluate the positive and negative learning rates and the inverse temperature across three age groups (3, 12, and 18 months). Results showed a significant decline in the negative learning rate of 18-month-old mice, which was not observed for the positive learning rate. This suggests that older mice maintain the ability to learn from successful experiences while decreasing the ability to learn from negative outcomes. We also observed a significant age-dependent variation in inverse temperature, reflecting a shift in action selection policy. Middle-aged mice (12 months) exhibited higher inverse temperature, indicating a higher reliance on previous rewarding experiences and reduced exploratory behaviors, when compared to both younger and older mice. This study provides new insights into aging research by demonstrating that there are age-related differences in specific components of reinforcement learning, which exhibit a non-linear pattern.
AB - This study explores the impact of aging on reinforcement learning in mice, focusing on changes in learning rates and behavioral strategies. A 5-armed bandit task (5-ABT) and a computational Q-learning model were used to evaluate the positive and negative learning rates and the inverse temperature across three age groups (3, 12, and 18 months). Results showed a significant decline in the negative learning rate of 18-month-old mice, which was not observed for the positive learning rate. This suggests that older mice maintain the ability to learn from successful experiences while decreasing the ability to learn from negative outcomes. We also observed a significant age-dependent variation in inverse temperature, reflecting a shift in action selection policy. Middle-aged mice (12 months) exhibited higher inverse temperature, indicating a higher reliance on previous rewarding experiences and reduced exploratory behaviors, when compared to both younger and older mice. This study provides new insights into aging research by demonstrating that there are age-related differences in specific components of reinforcement learning, which exhibit a non-linear pattern.
KW - 5-armed bandit task
KW - Aging
KW - Learning rate
KW - Q-learning model
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85198732592&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85198732592&partnerID=8YFLogxK
U2 - 10.1016/j.neurobiolaging.2024.06.004
DO - 10.1016/j.neurobiolaging.2024.06.004
M3 - Article
C2 - 39029360
AN - SCOPUS:85198732592
SN - 0197-4580
VL - 142
SP - 8
EP - 16
JO - Neurobiology of Aging
JF - Neurobiology of Aging
ER -