15th Triennial World Congress of the International Federation of Automatic Control
  Barcelona, 21–26 July 2002 
SEMI-MARKOV DECISION PROBLEMS AND PERFORMANCE SENSITIVITY ANALYSIS
Xi-Ren Cao*,1
* Department of Electrical and Electronic Engineering
The Hong Kong University of Science and Technology
Clear Water Bay, Kowloon, Hong Kong

We extend the results about performance potentials, perturbation realization matrices, policy iteration of Markov decision processes, etc., to semi-Markov processes (SMPs). Starting with the concept of perturbation realization, we define a realization matrix and prove that it satisfies the Lyapunov equation. From the realization matrix we define a performance potential and prove that it satisfies the Poisson equation. Sensitivity formulas and policy iteration algorithms of Semi-Markov decision process (SMDPs) can be derived. The performance sensitivities can be obtained and policy iteration of SMDPs can be implemeted on a single sample path of the SMPs.
Keywords: Potentials, Lyapunov equations, Poisson equations, perturbation analysis, policy iteration

Correspondig Author: Tel: (852)2358-7048 Fax: (852)2358-1485

E-mail: eecao@ust.hk
Session slot T-We-M02: Performance Issues in Discrete Event Systems/Area code 3c : Discrete Event Dynamic Systems