Neuro-Dynamic Programming for the Exploration of Unknown Graphs

Authors:	Baglietto Marco, University of Genoa, Italy Battistelli Giorgio, University of Genoa, Italy Scardovi Luca, University of Genoa, Italy Zoppoli Riccardo, University of Genoa, Italy
Topic:	2.4 Optimal Control
Session:	Optimal Control Theory and Design Methods
Keywords:	Neural networks, Dynamic programming, Graph, Control, Entropy

Abstract

In this paper, the problem of exploring stochastic graphs is addressed. The definition of the entropy related to the a-priori unknown parameters (the lengths of the a-priori unknown links) leads to the formulation of the problem as a stochastic optimal control one. The application of exact Dynamic Programming suffers the so-called curse of dimensionality. To overcome this drawback, an approximate technique is proposed making use of Neuro-Dynamic Programming. Exploiting the concept of frontier, any approximate solution of the problem is shown to generate a "proper" policy.