An RBF based Neuro-dynamic Approach for the Control of Stochastic Dynamic Systems

Authors:	Sarimveis Haralambos, National Technical University of Athens, Greece Patrinos Panagiotis K., National Technical University of Athens, Greece
Topic:	3.2 Cognition and Control ( AI, Fuzzy, Neuro, Evolut.Comp.)
Session:	Neural Control
Keywords:	Markov decision processes, Radial base function networks, uncertain dynamic systems, optimal control

Abstract

This paper presents a neuro-dynamic programming methodology for the control of markov decision processes. The proposed method can be considered as a variant of the optimistic policy iteration, where radial basis function (RBF) networks are employed as a compact representation of the cost-to-go function and the λ-LSPE is used for policy evaluation. We also emphasize the reformulation of the Bellman equation around the post-decision state in order to circumvent the calculation of the expectation. The proposed algorithm is applied to a retailer-inventory management problem.