• Medientyp: E-Artikel
  • Titel: Average optimality for Markov decision processes in borel spaces: a new condition and approach
  • Beteiligte: Guo, Xianping; Zhu, Quanxin
  • Erschienen: Cambridge University Press (CUP), 2006
  • Erschienen in: Journal of Applied Probability, 43 (2006) 2, Seite 318-334
  • Sprache: Englisch
  • DOI: 10.1017/s0021900200001662
  • ISSN: 0021-9002; 1475-6072
  • Schlagwörter: Statistics, Probability and Uncertainty ; General Mathematics ; Statistics and Probability
  • Entstehung:
  • Anmerkungen:
  • Beschreibung: In this paper we study discrete-time Markov decision processes with Borel state and action spaces. The criterion is to minimize average expected costs, and the costs may have neither upper nor lower bounds. We first provide two average optimality inequalities of opposing directions and give conditions for the existence of solutions to them. Then, using the two inequalities, we ensure the existence of an average optimal (deterministic) stationary policy under additional continuity-compactness assumptions. Our conditions are slightly weaker than those in the previous literature. Also, some new sufficient conditions for the existence of an average optimal stationary policy are imposed on the primitive data of the model. Moreover, our approach is slightly different from the well-known ‘optimality inequality approach’ widely used in Markov decision processes. Finally, we illustrate our results in two examples.