[1]SHEN Jing,GU Guo-chang,LIU Hai-bo.Algorithm for automatic constructing Option based on multi-agent[J].CAAI Transactions on Intelligent Systems,2006,1(1):84-87.
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
1
Number of periods:
2006 1
Page number:
84-87
Column:
学术论文—人工智能基础
Public date:
2006-03-25
- Title:
-
Algorithm for automatic constructing Option based on multi-agent
- Author(s):
-
SHEN Jing; GU Guo-chang; LIU Hai-bo
-
School of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China
-
- Keywords:
-
hierarchical reinforcement learning; automatic hierarchy; multi-agent system; Option; aiNet
- CLC:
-
TP18
- DOI:
-
-
- Abstract:
-
In current hierarchical reinforcement learning, the automatic task hie rarchies are constructed by low speed serial learning algorithm based on single agent. A multi-agent based algorithm for constructing Options aut omatically was presented for speeding up the learning algorithm. The algorithm was developed on the basis of the Option HRL framework proposed by Sutton. Firstly, multiple agents cooperated in parallel exploring the state space. Then the stat e space was partitioned into several sub-spaces via immune clustering based on a iN et. Next, the agents learned the local strategies of the different subspace co ncu rrently. Consequently, the Options were constructed. The theoretical analyses an d experiments with shortest path planning in a twodimensional grid space wit h obstacles show that the speed of multiagent based algorithm for automaticall y con structing Options was obviously faster than that of singleagent based algorith ms.