您当前所在位置: 首页 > 学者

贾庆山

  • 39浏览

  • 0点赞

  • 0收藏

  • 0分享

  • 113下载

  • 0评论

  • 引用

期刊论文

A Potential-Based Method for Finite-Stage Markov Decision Processes

贾庆山Qing-Shan Jia Member IEEE

,-0001,():

URL:

摘要/描述

Finite-Stage Markov Decision Process (MDP) supplies a general framework for many practical problems when only the performance in a finite duration is of interest. Dynamic programming (DP) supplies a general way to find the optimal policies but is usually practically infeasible, due to the exponentially increasing policy space. Approximating the finitestage MDP by an infinite-stage MDP reduces the search space but usually does not find the optimal stationary policy, due to the approximation error. We develop a method that finds the optimal stationary policies for the finite-stage MDP. The method is based on performance potentials, which can be estimated through sample paths and thus suits practical application.

【免责声明】以下全部内容由[贾庆山]上传于[2008年03月19日 17时24分03秒],版权归原创者所有。本文仅代表作者本人观点,与本网站无关。本网站对文中陈述、观点判断保持中立,不对所包含内容的准确性、可靠性或完整性提供任何明示或暗示的保证。请读者仅作参考,并请自行承担全部责任。

我要评论

全部评论 0

本学者其他成果

    同领域成果