[1]沈 晶,顾国昌,刘海波.基于多智能体的Option自动生成算法[J].智能系统学报,2006,1(1):84-87.
SHEN Jing,GU Guo-chang,LIU Hai-bo.Algorithm for automatic constructing Option based on multi-agent[J].CAAI Transactions on Intelligent Systems,2006,1(1):84-87.
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
1
期数:
2006年第1期
页码:
84-87
栏目:
学术论文—人工智能基础
出版日期:
2006-03-25
- Title:
-
Algorithm for automatic constructing Option based on multi-agent
- 文章编号:
-
1673-4785(2006)01-0084-04
- 作者:
-
沈 晶, 顾国昌, 刘海波
-
哈尔滨工程大学计算机科学与技术学院,黑龙江哈尔滨150001
- Author(s):
-
SHEN Jing,GU Guo-chang,LIU Hai-bo
-
School of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China
-
- 关键词:
-
分层强化学习; 自动分层; 多智能体系统; Option; aiNet
- Keywords:
-
hierarchical reinforcement learning; automatic hierarchy; multi-agent system; Option; aiNet
- 分类号:
-
TP18
- 文献标志码:
-
A
- 摘要:
-
目前分层强化学习中的任务自动分层都是采用基于单智能体的串行学习算法,为解决串行算法学习速度较慢的问题,以Sutton的Option分层强化学习方法为基础框架,提出了一种基于多智能体的Option自动生成算法,该算法由多智能体合作对状态空间进行并行探测并集中应用aiNet实现免疫聚类产生状态子空间,然后并行学习生成各子空间上的内部策略,最终生成Option. 以二维有障碍栅格空间内2点间最短路径规划为任务背景给出了算法并进行了仿真实验和分析.结果表明,基于多智能体的Option自动生成算法速度明显快于基于单智能体的算法.
- Abstract:
-
In current hierarchical reinforcement learning, the automatic task hie rarchies are constructed by low speed serial learning algorithm based on single agent. A multi-agent based algorithm for constructing Options aut omatically was presented for speeding up the learning algorithm. The algorithm was developed on the basis of the Option HRL framework proposed by Sutton. Firstly, multiple agents cooperated in parallel exploring the state space. Then the stat e space was partitioned into several sub-spaces via immune clustering based on a iN et. Next, the agents learned the local strategies of the different subspace co ncu rrently. Consequently, the Options were constructed. The theoretical analyses an d experiments with shortest path planning in a twodimensional grid space wit h obstacles show that the speed of multiagent based algorithm for automaticall y con structing Options was obviously faster than that of singleagent based algorith ms.
备注/Memo
收稿日期:2005-12-28.
基金项目:哈尔滨工程大学基础研究基金资助项目(HEUFT05021,HEUFT05068).
作者简介:
沈??? 晶,女,1969年生,哈尔滨工程大学在读博士生.主要从事分层强化学习、人工免疫理论的研究.在国内外会议、期刊发表学术论文30余篇,参加翻译出版译著1部.
顾国昌,男,1946年生,教授,博士生导师.主要从事智能控制、智能机器人技术以及嵌入式系统研究,发表论文100余篇,并有多篇被EI、ISTP等收录.任中国人工智能学会智能机器人学会理事、黑龙江省计算机学会副理事长.
刘海波,男,1976年生,博士,IEEE专业会员,IAIA会员,中国计算机学会会员.主要从事神经心理学理论、多智能体技术与智能机器人体系结构相融合的研究,发表学术论文50余篇,出版编著3部、译著1部.
更新日期/Last Update:
2009-04-07