Average optimality for Markov decision processes in borel spaces: a new condition and approach

Medientyp: E-Artikel

Titel: Average optimality for Markov decision processes in borel spaces: a new condition and approach

Beteiligte: Guo, Xianping; Zhu, Quanxin

Erschienen: Cambridge University Press (CUP), 2006

Sprache: Englisch

DOI: 10.1017/s0021900200001662

ISSN: 0021-9002; 1475-6072

Schlagwörter: Statistics, Probability and Uncertainty ; General Mathematics ; Statistics and Probability

Entstehung:

Anmerkungen:

Beschreibung: In this paper we study discrete-time Markov decision processes with Borel state and action spaces. The criterion is to minimize average expected costs, and the costs may have neither upper nor lower bounds. We first provide two average optimality inequalities of opposing directions and give conditions for the existence of solutions to them. Then, using the two inequalities, we ensure the existence of an average optimal (deterministic) stationary policy under additional continuity-compactness assumptions. Our conditions are slightly weaker than those in the previous literature. Also, some new sufficient conditions for the existence of an average optimal stationary policy are imposed on the primitive data of the model. Moreover, our approach is slightly different from the well-known ‘optimality inequality approach’ widely used in Markov decision processes. Finally, we illustrate our results in two examples.

Nur in Feld suchen:

Zuletzt gesuchte Begriffe: