[1]高强,徐心和,王昊,等.一种基于经验的德州扑克博弈系统架构[J].智能系统学报,2020,15(3):468-474.[doi:10.11992/tis.201803043]
GAO Qiang,XU Xinhe,WANG Hao,et al.System architecture of Texas Hold’em based on experience[J].CAAI Transactions on Intelligent Systems,2020,15(3):468-474.[doi:10.11992/tis.201803043]
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
15
期数:
2020年第3期
页码:
468-474
栏目:
学术论文—智能系统
出版日期:
2020-05-05
- Title:
-
System architecture of Texas Hold’em based on experience
- 作者:
-
高强1, 徐心和2, 王昊3, 白国力3, 曹瑞珉3
-
1. 沈阳大学 辽宁省装备制造综合自动化重点实验室,辽宁 沈阳 110044;
2. 东北大学 信息科学与工程学院,辽宁 沈阳 110819;
3. 东北大学 机械工程与自动化学院,辽宁 沈阳 110819
- Author(s):
-
GAO Qiang1, XU Xinhe2, WANG Hao3, BAI Guoli3, CAO Ruimin3
-
1. Key Laboratory of Manufacturing Industrial Integrated Automation, Shenyang University, Shenyang 110044, China;
2. College of Information Science and Engineering, Northeastern University, Shenyang 110819, China;
3. School of Mechanical Engineering and Automation, Northeastern University, Shenyang 110819, China
-
- 关键词:
-
二人赌注无上限德州扑克; 计算机博弈; 非完全信息动态博弈; 博弈树; 深度学习; 专家库; 哈希表; 博弈策略
- Keywords:
-
Heads-up no-limit Texas Hold’em; computer games; dynamic game with imperfect information; game tree; deep learning; expert database; Hash table; game strategy
- 分类号:
-
TP301.5
- DOI:
-
10.11992/tis.201803043
- 摘要:
-
为了利用历史经验知识提高德州扑克博弈水平,提出一种二人赌注无上限的德州扑克博弈系统架构:对于知识库模块,利用海量历史牌局训练得到基于CNN的深度学习网络模型并构建了一个专家经验库;在系统的搜索模块中,构建了一种分阶段的德州扑克博弈树,利用专家经验和历史经验引导德州扑克博弈树的展开;对于系统的估值核心模块,构建了一种基于哈希技术的牌型对照表,以提高系统判定胜负的效率。实验结果表明本文提出的博弈系统架构具有更高的对弈水平。
- Abstract:
-
To improve the level of Texas Hold’em through historical experience, this paper proposes a system architecture of heads-up no-limit Texas Hold’em for the knowledge base module. Mass historic games are used to train the deep learning network based on convolutional neural network, and an expert database is constructed for the search module of the system. Texas Hold’em structured game tree is developed and extended, and it is applied in terms of the expertise and historical experience to the core module for evaluation. A hand-ranking hash-based table is built to reduce the time required to evaluate hand rankings. The experimental result shows a higher playing level for the proposed system architecture.
更新日期/Last Update:
1900-01-01