AVERAGE OPTIMALITY FOR MARKOV DECISION PROCESSES IN BOREL SPACES: A NEW CONDITION AND APPROACH，成果详细信息-中国科技论文在线

郭先平

64浏览
0点赞
0收藏
0分享
252下载
0评论
引用

期刊论文

AVERAGE OPTIMALITY FOR MARKOV DECISION PROCESSES IN BOREL SPACES: A NEW CONDITION AND APPROACH

郭先平， XIANPING GUO ， Zhongshan University QUANXIN ZHU ， South China Normal University

J. Appl. Prob. 43, 318-334(2006)，-0001，（）：

URL:

摘要/描述

In this paper we study discrete-time Markov decision processes with Borel state and action spaces. The criterion is to minimize average expected costs, and the costs may have neither upper nor lower bounds. We ﬁrst provide two average optimality inequalities of opposing directions and give conditions for the existence of solutions to them. Then, using the two inequalities, we ensure the existence of an average optimal (deterministic) stationary policy under additional continuity-compactness assumptions. Our conditions are slightly weaker than those in the previous literature. Also, some new sufﬁcient conditions for the existence of an average optimal stationary policy are imposed on the primitive data of the model. Moreover, our approach is slightly different from the well-known' optimality inequality approach' widely used in Markov decision processes. Finally, we illustrate our results in two examples.

关键词: Discrete-time Markov decision process ， average expected criterion ， average optimality inequality ， optimal stationary policy

问答

暂无问题，成为第一个提问者

我要提问全部问题

【免责声明】以下全部内容由[郭先平]上传于[2006年10月12日 02时17分08秒]，版权归原创者所有。本文仅代表作者本人观点，与本网站无关。本网站对文中陈述、观点判断保持中立，不对所包含内容的准确性、可靠性或完整性提供任何明示或暗示的保证。请读者仅作参考，并请自行承担全部责任。

我要评论

全部评论 共 0 条

本学者其他成果

同领域成果