A Potential-Based Method for Finite-Stage Markov Decision Processes ，成果详细信息-中国科技论文在线

贾庆山

39浏览
0点赞
0收藏
0分享
113下载
0评论
引用

期刊论文

A Potential-Based Method for Finite-Stage Markov Decision Processes

贾庆山， Qing-Shan Jia Member ， IEEE

，-0001，（）：

URL:

摘要/描述

Finite-Stage Markov Decision Process (MDP) supplies a general framework for many practical problems when only the performance in a finite duration is of interest. Dynamic programming (DP) supplies a general way to find the optimal policies but is usually practically infeasible, due to the exponentially increasing policy space. Approximating the finitestage MDP by an infinite-stage MDP reduces the search space but usually does not find the optimal stationary policy, due to the approximation error. We develop a method that finds the optimal stationary policies for the finite-stage MDP. The method is based on performance potentials, which can be estimated through sample paths and thus suits practical application.

关键词: Performance potentials ， policy iteration ， stationary policy ， finite-stage Markov Decision

问答

暂无问题，成为第一个提问者

我要提问全部问题

【免责声明】以下全部内容由[贾庆山]上传于[2008年03月19日 17时24分03秒]，版权归原创者所有。本文仅代表作者本人观点，与本网站无关。本网站对文中陈述、观点判断保持中立，不对所包含内容的准确性、可靠性或完整性提供任何明示或暗示的保证。请读者仅作参考，并请自行承担全部责任。

我要评论

全部评论 共 0 条

本学者其他成果

同领域成果