TY - GEN
T1 - An integrated design for intensified direct heuristic dynamic programming
AU - Luo, Xiong
AU - Si, Jennie
AU - Zhou, Yuchao
PY - 2013
Y1 - 2013
N2 - There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of machine intelligence system. As one of the ADP algorithms based on adaptive critic neural networks (NNs), the direct heuristic dynamic programming (direct HDP) has demonstrated some successful applications in solving realistic engineering control problems. In this study, based on a three-network architecture in which the reinforcement signal is approximated by an additional NN, a novel integrated design method for intensified direct HDP is developed. The new design approach is implemented by using multiple PID neural networks (PIDNNs), which effectively takes into account structural knowledge of system states and control that are usually present in a physical system. By using a Lyapunov stability approach, a uniformly ultimately boundedness (UUB) result is proved for our PIDNNs-based intensified direct HDP learning controller. Furthermore, the learning and control performances of the proposed design is tested using the popular cart-pole example to illustrate the key ideas of this paper.
AB - There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of machine intelligence system. As one of the ADP algorithms based on adaptive critic neural networks (NNs), the direct heuristic dynamic programming (direct HDP) has demonstrated some successful applications in solving realistic engineering control problems. In this study, based on a three-network architecture in which the reinforcement signal is approximated by an additional NN, a novel integrated design method for intensified direct HDP is developed. The new design approach is implemented by using multiple PID neural networks (PIDNNs), which effectively takes into account structural knowledge of system states and control that are usually present in a physical system. By using a Lyapunov stability approach, a uniformly ultimately boundedness (UUB) result is proved for our PIDNNs-based intensified direct HDP learning controller. Furthermore, the learning and control performances of the proposed design is tested using the popular cart-pole example to illustrate the key ideas of this paper.
KW - Direct heuristic dynamic programming
KW - PID neural network
KW - neural network
KW - stability
UR - http://www.scopus.com/inward/record.url?scp=84891522328&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84891522328&partnerID=8YFLogxK
U2 - 10.1109/ADPRL.2013.6615006
DO - 10.1109/ADPRL.2013.6615006
M3 - Conference contribution
SN - 9781467359252
T3 - IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL
SP - 183
EP - 190
BT - Proceedings of the 2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2013 - 2013 IEEE Symposium Series on Computational Intelligence, SSCI 2013
T2 - 2013 4th IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2013
Y2 - 16 April 2013 through 19 April 2013
ER -