Neuro-Dynamic Programming for the Exploration of Unknown Graphs
| Authors: | Baglietto Marco, University of Genoa, Italy Battistelli Giorgio, University of Genoa, Italy Scardovi Luca, University of Genoa, Italy Zoppoli Riccardo, University of Genoa, Italy |
|---|
| Topic: | 2.4 Optimal Control |
|---|
| Session: | Optimal Control Theory and Design Methods |
|---|
| Keywords: | Neural networks, Dynamic programming, Graph, Control, Entropy |
|---|
Abstract
In this paper, the problem of exploring stochastic graphs is addressed. The definition of the entropy related to the a-priori unknown parameters (the lengths of the a-priori unknown links) leads to the formulation of the problem as a stochastic optimal control one. The application of exact Dynamic Programming suffers the so-called curse of dimensionality. To overcome this drawback, an approximate technique is proposed making use of Neuro-Dynamic Programming. Exploiting the concept of frontier, any approximate solution of the problem is shown to generate a "proper" policy.