A solving method for MDPs by minimizing variational free energy

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this article, we propose a novel approach to acquiring the optimal policy for a continuous Markov decision process. Based on an analogy from statistical mechanics, we introduce a variational free energy over a policy. A good policy can be obtained by minimizing the variational free energy. According to our approach, the optimal policy in linear quadratic regulator problems can be obtained by using Kaiman filtering and smoothing techniques. Even in non-linear problems, a semioptimal policy can be obtained by Monte Carlo technique with a Gaussian process method.

Original languageEnglish
Title of host publication2004 IEEE International Joint Conference on Neural Networks - Proceedings
Pages1817-1822
Number of pages6
DOIs
Publication statusPublished - 2004
Externally publishedYes
Event2004 IEEE International Joint Conference on Neural Networks - Proceedings - Budapest, Hungary
Duration: 25-07-200429-07-2004

Publication series

NameIEEE International Conference on Neural Networks - Conference Proceedings
Volume3
ISSN (Print)1098-7576

Conference

Conference2004 IEEE International Joint Conference on Neural Networks - Proceedings
Country/TerritoryHungary
CityBudapest
Period25-07-0429-07-04

All Science Journal Classification (ASJC) codes

  • Software

Fingerprint

Dive into the research topics of 'A solving method for MDPs by minimizing variational free energy'. Together they form a unique fingerprint.

Cite this