基于强化学习的桥牌叫牌策略研究

陈驰; 杨放春

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

基于强化学习的桥牌叫牌策略研究

首发时间：2019-11-22

陈驰 ¹
陈驰(1994-)，男，硕士研究生，主要研究方向强化学习
杨放春 ¹
杨放春（1957-），男，教授、博导，主要研究方向：融合网络，车联网，服务计算与大数据，网络安全

1、北京邮电大学网络与交换国家重点实验室，北京 100876

摘要：随着人工智能理论的发展，人类在越来越多的游戏领域被人工智能打败。而定约桥牌作为棋牌类游戏中规则最为复杂的游戏，对于目前的人工智能来说仍然是难以攻克的课题，因此，研究人工智能与定约桥牌的结合是十分有意义的。本文针对定约桥牌的叫牌阶段，利用强化学习方法对叫牌过程进行研究。通过大量机器人对打的桥牌数据来训练得到的基本模型，利用PolicyGradient等相关强化学习方法对基本模型进行增强，同时引入一种叫牌过滤机制来加快强化学习模型的收敛，并对训练结果和最终的叫牌能力进行分析。

关键词：人工智能定约桥牌强化学习

For information in English, please click here

Bridge Bidbing based on Reinforcement Learning

CHEN Chi ¹
陈驰(1994-)，男，硕士研究生，主要研究方向强化学习
YANG Fangchun ¹
杨放春（1957-），男，教授、博导，主要研究方向：融合网络，车联网，服务计算与大数据，网络安全

1、Beijing University of Posts and Telecommunications,Beijing 100876

Abstract：With the development of artificial intelligence theory, human beings have been defeated by artificial intelligence in more and more game fields. As the most complicated game in the chess game, the contract bridge is still a difficult task for the current artificial intelligence. Therefore, it is very meaningful to study the combination of artificial intelligence and contract bridge. In this paper, the intensive learning method is used to study the bidding process for the bidding stage of the contract bridge. The basic model is trained by a large number of robots playing bridge data, and the basic model is enhanced by using the related reinforcement learning method such as Policy Gradient. At the same time, a bid filtering mechanism is introduced to accelerate the convergence of the reinforcement learning model, and the training results and Final bidding ability analysis

Keywords： artificial intelligence contract bridge reinforcement learning

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

陈驰，杨放春. 基于强化学习的桥牌叫牌策略研究[EB/OL]. 北京：中国科技论文在线 [2019-11-22]. https://www.paper.edu.cn/releasepaper/content/201911-69.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	201911-69
论文题目	基于强化学习的桥牌叫牌策略研究
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.