BACK to VOLUME 34 NO.2

Kybernetika 34(2):217-234, 1998.

Adaptive Control for Discrete-Time Markov Processes with Unbounded Costs: Discounted Criterion.

Evgueni Gordienko and J. Adolfo Minjárez-Sosa


Abstract:

We study the adaptive control problem for discrete-time Markov control processes with Borel state and action spaces and possibly unbounded one-stage costs. The processes are given by recurrent equations $x_{t+1}=F(x_t,a_t,\xi _t),$ $t=0,1,\ldots$ with i.i.d. $\Re ^k$-valued random vectors $\xi _t$ whose density $\rho $ is unknown. Assuming observability of $\xi _t$ we propose the procedure of statistical estimation of $\rho $ that allows us to prove discounted asymptotic optimality of two types of adaptive policies used early for the processes with bounded costs.


AMS: 90C;


download abstract.pdf


BIB TeX

@article{kyb:1998:2:217-234,

author = {Gordienko, Evgueni and Minj\'{a}rez-Sosa, J. Adolfo},

title = {Adaptive Control for Discrete-Time Markov Processes with Unbounded Costs: Discounted Criterion.},

journal = {Kybernetika},

volume = {34},

year = {1998},

number = {2},

pages = {217-234}

publisher = {{\'U}TIA, AV {\v C}R, Prague },

}


BACK to VOLUME 34 NO.2