BACK to VOLUME 36 NO.2

Kybernetika 36(2):195-210, 2000.

Estimates of Stability of Markov Control Processes with Unbounded Costs.

Evgueni Gordienko and Francisco Salem


Abstract:

For a discrete-time Markov control process with the transition probability $p$, we compare the total discounted costs $V_\beta$ $(\pi_\beta)$ and $V_\beta(\tilde{\pi}_\beta)$, when applying the optimal control policy $\pi_\beta$ and its approximation $\tilde{\pi}_\beta$. The policy $\tilde{\pi}_\beta$ is optimal for an approximating process with the transition probability $\tilde{p}$.


A cost per stage for considered processes can be unbounded. Under certain ergodicity assumptions we establish the upper bound for the relative stability index $[V_\beta(\tilde{\pi}_\beta)-V_\beta(\pi_\beta)]/V_\beta(\pi_\beta)$. This bound does not depend on a discount factor $\beta\in (0,1)$ and this is given in terms of the total variation distance between $p$ and $\tilde{p}$.


AMS: 93C;


download abstract.pdf


BIB TeX

@article{kyb:2000:2:195-210,

author = {Gordienko, Evgueni and Salem, Francisco},

title = {Estimates of Stability of Markov Control Processes with Unbounded Costs.},

journal = {Kybernetika},

volume = {36},

year = {2000},

number = {2},

pages = {195-210}

publisher = {{\'U}TIA, AV {\v C}R, Prague },

}


BACK to VOLUME 36 NO.2