Wavelet Neural Network Observer Based Adaptive Tracking Control for a Class of Uncertain Nonlinear Delayed Systems Using Reinforcement Learning

Full Text (PDF, 559KB), PP.28-34

Views: 0 Downloads: 0

Author(s)

Manish Sharma 1,* Ajay Verma 2

1. Dept. of Electronics and Instrumentation Engineering, Medicaps Institute of Technology and Management, Indore, India

2. Dept. of Electronics and Instrumentation Engineering, Institute of Engineering and Technology, Devi Ahilya University, Indore, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijisa.2012.02.03

Received: 23 Mar. 2011 / Revised: 20 Jul. 2011 / Accepted: 9 Oct. 2011 / Published: 8 Mar. 2012

Index Terms

Wavelet neural networks, adaptive control, optimal control, reinforcement learning, Lyapunov- Krasovskii functional

Abstract

This paper is concerned with the observer designing problem for a class of uncertain delayed nonlinear systems using reinforcement learning. Reinforcement learning is used via two Wavelet Neural networks (WNN), critic WNN and action WNN, which are combined to form an adaptive WNN controller. The “strategic” utility function is approximated by the critic WNN and is minimized by the action WNN. Adaptation laws are developed for the online tuning of wavelets parameters. By Lyapunov approach, the uniformly ultimate boundedness of the closed-loop tracking error is verified. Finally, a simulation example is shown to verify the effectiveness and performance of the proposed method.

Cite This Paper

Manish Sharma, Ajay Verma, "Wavelet Neural Network Observer Based Adaptive Tracking Control for a Class of Uncertain Nonlinear Delayed Systems Using Reinforcement Learning", International Journal of Intelligent Systems and Applications(IJISA), vol.4, no.2, pp.28-34, 2012. DOI:10.5815/ijisa.2012.02.03

Reference

[1]H. Lens and J. Adamy, “Observer based controller design for linear systems with input constraints”, Proceedings of 17th World Congress, The international federation of automatic control, Seoul Korea, pp.9916-9921, July 2008.

[2]F. Abdollahi, H.A Talebi and R.V. Patel, “A stable neural network-based observer with application to flexible-joint manipulators”, IEEE Transactions on Neural Networks, Vol. 17, Issue 1, , pp.118-129 Jan. 2006.

[3]F. Sun; Z. Sun; P. Y. Woo, “Neural network-based adaptive controller design of robotic manipulators with an observer”, IEEE Transactions on Neural Networks, Vol. 12, Issue 1, pp.54-67 Jan. 2001.

[4]M. S. Mahmoud, Y. Shi and H. Nounou, “Resilient observer based control of uncertain time delay systems”, International journal of Innovative Computing, Information and Control, Vol. 3 no.2, pp.407-418.Apr.2007.

[5]P. Cui and C. Zhang, “Observer design and output feedback stabilization of linear singular time delay systems with unknown inputs”. Journal of control theory application, Vol. 6, no.2, pp-177-183.Apr.2008.

[6]J.P. Richard, “Time delay systems: an overview of some recent advances and open problems”, Automatica, Vol. 39, pp. 1667-1694, 2003.

[7]P. He and S. Jagannathan, “Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints,” IEEE Transactions on Systems, Man, and Cybernetics—Part B: Cybernetics, Vol. 37, no. 2, pp.425-436, April 2007.

[8]W.S. Lin, L.H. Chang and P.C. Yang, “Adaptive critic anti-slip control of wheeled autonomous robot”, IET Control Theory Applications, Vol. 1, No. 1, January 2007. 

[9]X. Lin and S. N. Balakrishnan, “Convergence analysis of adaptive critic based optimal control,” Proceedings of the American Control Conferenc, pp. 1929–1933, 2000.

[10]D. V. Prokhorov and D. C. Wunsch, “Adaptive critic designs”, IEEE Transactions on Neural Networks,, vol. 8, no. 5, pp. 997–1007, Sep. 1997.

[11]H. V. Hasselt and M. Wiering, “Reinforcement learning in continuous action spaces”, IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning, pages 272–279, 2007.

[12]Hussein Jaddu and Kunihiko Hiraishi, “A Wavelet Approach for Solving linear quadratic optimal control problems”, SICE-ICASE International Joint Conference ,Oct. 18-21, Korea, 2006.

[13]Q. Zhang and A. Benveniste, “Wavelet networks,” IEEE Transactions on Neural Networks, Vol. 3, no. 6, pp.889-898, November 1992.

[14]J. Zhang, G. G. Walter, Y. Miao, and. W. Lee, “Wavelet neural networks for function learning,” IEEE Transactions on Signal Processing, Vol. 43, no. 6, pp.1485-1497, June 1995.

[15]B. Delyon, A. Juditsky, and A. Benveniste, “Accuracy analysis for wavelet approximations,” IEEE Transactions on Neural Networks, Vol. 6, no. 2, pp.332-348 March 1995.