参考文献/References:
[1] BARTO A G, MAHADEVAN S. Recent advances in hierarchical reinforcement le arni ng[J]. Discrete Event Dynamic Systems: Theory and Applications, 2003,13(4): 41-77.
[2] SUTTON R S, PRECUP D, SINGH S P. Between MDPs and semi-MDPs: a framew ork for temporal abstraction in reinforcement learning[J]. Artificial Intelligence, 1 999,112(1): 181-211.
[3] PARR R. Hierarchical control and learning for Markov decision processes [D]. Berkeley: University of California, 1998.
[4] DIETTERICH T G. Hierarchical reinforcement learning with the MAXQ value func tion decomposition[J]. Journal of Artificial Intelligence Research, 2000,13(1) : 227-303.
[5] DIGNEY B L. Learning hierarchical control structures for multiple tas ks and changing environments[A]. Proc of the 5th International Conference on Simulat ion of Adaptive Behavior[C]. Zurich, Switzerland, 1998.
[6] MCGOVERN A, BARTO A. Autonomous discovery of subgoals in reinforcem ent learn ing using diverse density[A]. Proc of the 8th International Conference on Mac hine Learning[C]. San Fransisco: Morgan Kaufmann, 2001.
[7] MENACHE I, MANNOR S, SHIMKIN N. Qcut: dynamic discovery of sub-goal s in rei nforcement learning[A]. Proc the 13th European Conference on Machine Learning [C]. Helsinki, Finland, 2002.
[8] MANNOR S, MENACHE I, HOZE A, et al. Dynamic abstraction in reinforce ment lea rning via clustering[A]. Proc of the 21th International Conference on Machine Learning[C]. Banff, Canada, 2004.
[9] DE CASTRO L N, VON ZUBEN F N. An evolutionary immune network for data cluste ring[A]. Proc of the IEEE Brazilian Symposium on Artificial Neural Networks[ C]. Rio de Janeiro, Brazil, 2000.
相似文献/References:
[1]周文吉,俞扬.分层强化学习综述[J].智能系统学报,2017,12(05):590.[doi:10.11992/tis.201706031]
ZHOU Wenji,YU Yang.Summarize of hierarchical reinforcement learning[J].CAAI Transactions on Intelligent Systems,2017,12(01):590.[doi:10.11992/tis.201706031]
[2]殷昌盛,杨若鹏,朱巍,等.多智能体分层强化学习综述[J].智能系统学报,2020,15(4):646.[doi:10.11992/tis.201909027]
YIN Changsheng,YANG Ruopeng,ZHU Wei,et al.A survey on multi-agent hierarchical reinforcement learning[J].CAAI Transactions on Intelligent Systems,2020,15(01):646.[doi:10.11992/tis.201909027]