基于好奇心探索的深度强化学习算法研究

刘一鸣; 胡铮

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

基于好奇心探索的深度强化学习算法研究

首发时间：2020-05-12

刘一鸣 ¹
刘一鸣（1996-），男，硕士研究生，深度强化学习
胡铮 ¹
胡铮，男，副教授、硕导，多智能体决策

1、北京邮电大学信息与通信工程学院

摘要：强化学习广泛应用于系统决策等人工智能领域，凭借强大的性能优势，能够解决大量复杂场景下的智能体决策问题，具有很高的研究价值和意义。但是奖励的稀疏和延迟阻碍了智能体的策略学习，尽管目前出现的好奇心探索有利于增强智能体的学习能力，但是好奇心奖励的构造方式有待进一步改善。本文基于于智能体的好奇心探索，提出基于方向好奇心的强化学习算法，设计方向探测器，利用先验知识指导智能体的探索方向，规避风险探索对奖励信号进行衰减处理，并且在Atari的游戏场景中进行实验，最后取得了更高的得分。

关键词：智能体决策强化学习奖励构造好奇心探索。

For information in English, please click here

Research on Deep Reinforcement Learning Algorithm Based on Curiosity Exploration

LiuYiming ¹
刘一鸣（1996-），男，硕士研究生，深度强化学习
HuZheng ¹
胡铮，男，副教授、硕导，多智能体决策

1、School of Information and Communication Engineering, Beijing University of Posts and Telecommunications,100876

Abstract：Reinforcement learning is widely used in the field of artificial intelligence such as system decision-making. With its powerful performance advantages, it can solve the problem of agent decision-making in a large number of complex scenarios, and has high research value and significance. However, the sparseness and delay of rewards hinder the agent\'s strategy learning. Although curiosity exploration currently appears is conducive to enhancing the agent\'s learning ability, the construction of curiosity rewards needs to be further improved. Based on the agent \'s curiosity exploration, this paper proposes a direction-based curiosity-based reinforcement learning algorithm, designing a direction detector, using prior knowledge to guide the agent \'s exploration direction, avoiding risk exploration, attenuating the reward signal, and playing games in Atari Experimented in the scene, and finally achieved a higher score.?????

Keywords： Agent decision Reinforcement learning Reward shaping Curiosity exploration.?????

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

刘一鸣，胡铮. 基于好奇心探索的深度强化学习算法研究[EB/OL]. 北京：中国科技论文在线 [2020-05-12]. https://www.paper.edu.cn/releasepaper/content/202005-63.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202005-63
论文题目	基于好奇心探索的深度强化学习算法研究
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.