TY - GEN
T1 - Free-energy-based reinforcement learning in a partially observable environment
AU - Otsuka, Makoto
AU - Yoshimoto, Junichiro
AU - Doya, Kenji
PY - 2010
Y1 - 2010
N2 - Free-energy-based reinforcement learning (FERL) can handle Markov decision processes (MDPs) with high-dimensional state spaces by approximating the state-action value function with the negative equilibrium free energy of a restricted Boltzmann machine (RBM). In this study, we extend the FERL framework to handle partially observable MDPs (POMDPs) by incorporating a recurrent neural network that learns a memory representation sufficient for predicting future observations and rewards. We demonstrate that the proposed method successfully solves POMDPs with high-dimensional observations without any prior knowledge of the environmental hidden states and dynamics. After learning, task structures are implicitly represented in the distributed activation patterns of hidden nodes of the RBM.
AB - Free-energy-based reinforcement learning (FERL) can handle Markov decision processes (MDPs) with high-dimensional state spaces by approximating the state-action value function with the negative equilibrium free energy of a restricted Boltzmann machine (RBM). In this study, we extend the FERL framework to handle partially observable MDPs (POMDPs) by incorporating a recurrent neural network that learns a memory representation sufficient for predicting future observations and rewards. We demonstrate that the proposed method successfully solves POMDPs with high-dimensional observations without any prior knowledge of the environmental hidden states and dynamics. After learning, task structures are implicitly represented in the distributed activation patterns of hidden nodes of the RBM.
UR - http://www.scopus.com/inward/record.url?scp=84887013392&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84887013392&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84887013392
SN - 2930307102
SN - 9782930307107
T3 - Proceedings of the 18th European Symposium on Artificial Neural Networks - Computational Intelligence and Machine Learning, ESANN 2010
SP - 541
EP - 546
BT - Proceedings of the 18th European Symposium on Artificial Neural Networks - Computational Intelligence and Machine Learning, ESANN 2010
T2 - 18th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2010
Y2 - 28 April 2010 through 30 April 2010
ER -