[1]沈 晶,顾国昌,刘海波.基于多智能体的Option自动生成算法[J].智能系统学报,2006,1(1):84-87.
 SHEN Jing,GU Guo-chang,LIU Hai-bo.Algorithm for automatic constructing Option based on multi-agent[J].CAAI Transactions on Intelligent Systems,2006,1(1):84-87.
点击复制

基于多智能体的Option自动生成算法

参考文献/References:
[1] BARTO A G, MAHADEVAN S. Recent advances in hierarchical reinforcement le arni ng[J]. Discrete Event Dynamic Systems: Theory and Applications, 2003,13(4): 41-77.
[2] SUTTON R S, PRECUP D, SINGH S P. Between MDPs and semi-MDPs: a framew ork for temporal abstraction in reinforcement learning[J]. Artificial Intelligence, 1 999,112(1): 181-211.
[3] PARR R. Hierarchical control and learning for Markov decision processes [D]. Berkeley: University of California, 1998.
[4] DIETTERICH T G. Hierarchical reinforcement learning with the MAXQ value func tion decomposition[J]. Journal of Artificial Intelligence Research, 2000,13(1) : 227-303.
[5] DIGNEY B L. Learning hierarchical control structures for multiple tas ks and changing environments[A]. Proc of the 5th International Conference on Simulat ion of Adaptive Behavior[C]. Zurich, Switzerland, 1998.
[6] MCGOVERN A, BARTO A. Autonomous discovery of subgoals in reinforcem ent learn ing using diverse density[A]. Proc of the 8th International Conference on Mac hine Learning[C]. San Fransisco: Morgan Kaufmann, 2001.
[7] MENACHE I, MANNOR S, SHIMKIN N. Qcut: dynamic discovery of sub-goal s in rei nforcement learning[A]. Proc the 13th European Conference on Machine Learning [C]. Helsinki, Finland, 2002.
[8] MANNOR S, MENACHE I, HOZE A, et al. Dynamic abstraction in reinforce ment lea rning via clustering[A]. Proc of the 21th International Conference on Machine Learning[C]. Banff, Canada, 2004.
[9] DE CASTRO L N, VON ZUBEN F N. An evolutionary immune network for data cluste ring[A]. Proc of the IEEE Brazilian Symposium on Artificial Neural Networks[ C]. Rio de Janeiro, Brazil, 2000.
相似文献/References:
[1]周文吉,俞扬.分层强化学习综述[J].智能系统学报,2017,12(5):590.[doi:10.11992/tis.201706031]
 ZHOU Wenji,YU Yang.Summarize of hierarchical reinforcement learning[J].CAAI Transactions on Intelligent Systems,2017,12(1):590.[doi:10.11992/tis.201706031]
[2]殷昌盛,杨若鹏,朱巍,等.多智能体分层强化学习综述[J].智能系统学报,2020,15(4):646.[doi:10.11992/tis.201909027]
 YIN Changsheng,YANG Ruopeng,ZHU Wei,et al.A survey on multi-agent hierarchical reinforcement learning[J].CAAI Transactions on Intelligent Systems,2020,15(1):646.[doi:10.11992/tis.201909027]

备注/Memo

收稿日期:2005-12-28.
基金项目:哈尔滨工程大学基础研究基金资助项目(HEUFT05021,HEUFT05068).
作者简介:
沈    晶,女,1969年生,哈尔滨工程大学在读博士生.主要从事分层强化学习、人工免疫理论的研究.在国内外会议、期刊发表学术论文30余篇,参加翻译出版译著1部.
顾国昌,男,1946年生,教授,博士生导师.主要从事智能控制、智能机器人技术以及嵌入式系统研究,发表论文100余篇,并有多篇被EI、ISTP等收录.任中国人工智能学会智能机器人学会理事、黑龙江省计算机学会副理事长.
刘海波,男,1976年生,博士,IEEE专业会员,IAIA会员,中国计算机学会会员.主要从事神经心理学理论、多智能体技术与智能机器人体系结构相融合的研究,发表学术论文50余篇,出版编著3部、译著1部.

更新日期/Last Update: 2009-04-07
Copyright @ 《 智能系统学报》 编辑部
地址:(150001)黑龙江省哈尔滨市南岗区南通大街145-1号楼 电话:0451- 82534001、82518134