TY - JOUR
T1 - Acrobot control by learning the switching of multiple controllers
AU - Yoshimoto, Junichiro
AU - Nishimura, Masaya
AU - Tokita, Yoichi
AU - Ishii, Shin
PY - 2005/5
Y1 - 2005/5
N2 - Reinforcement learning (RL) has been applied to constructing controllers for nonlinear systems in recent years. Since RL methods do not require an exact dynamics model of the controlled object, they have a higher flexibility and potential for adaptation to uncertain or nonstationary environments than methods based on traditional control theory. If the target system has a continuous state space whose dynamic characteristics are nonlinear, however, RL methods often suffer from unstable learning processes. For this reason, it is difficult to apply RL methods to control tasks in the real world. In order to overcome the disadvantage of RL methods, we propose an RL scheme combining multiple controllers, each of which is constructed based on traditional control theory. We then apply it to a swinging-up and stabilizing task of an acrobot with a limited torque, which is a typical but difficult task in the field of nonlinear control theory. Our simulation result showed that our method was able to realize stable learning and to achieve fairly good control.
AB - Reinforcement learning (RL) has been applied to constructing controllers for nonlinear systems in recent years. Since RL methods do not require an exact dynamics model of the controlled object, they have a higher flexibility and potential for adaptation to uncertain or nonstationary environments than methods based on traditional control theory. If the target system has a continuous state space whose dynamic characteristics are nonlinear, however, RL methods often suffer from unstable learning processes. For this reason, it is difficult to apply RL methods to control tasks in the real world. In order to overcome the disadvantage of RL methods, we propose an RL scheme combining multiple controllers, each of which is constructed based on traditional control theory. We then apply it to a swinging-up and stabilizing task of an acrobot with a limited torque, which is a typical but difficult task in the field of nonlinear control theory. Our simulation result showed that our method was able to realize stable learning and to achieve fairly good control.
KW - Acrobot
KW - Nonlinear control
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=19744373386&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=19744373386&partnerID=8YFLogxK
U2 - 10.1007/s10015-004-0340-6
DO - 10.1007/s10015-004-0340-6
M3 - Article
AN - SCOPUS:19744373386
SN - 1433-5298
VL - 9
SP - 67
EP - 71
JO - Artificial Life and Robotics
JF - Artificial Life and Robotics
IS - 2
ER -