[1]YIN Changsheng,YANG Ruopeng,ZHU Wei,et al.A survey on multi-agent hierarchical reinforcement learning[J].CAAI Transactions on Intelligent Systems,2020,15(4):646-655.[doi:10.11992/tis.201909027]
                                
                                Copy
                                
                                
                             
                            
                                
                                    CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
                                    15
                                    Number of periods:
                                    2020 4
                                    Page number:
                                    646-655
                                    Column:
                                    综述
                                    Public date:
                                    2020-07-05
                                
                                
                                    - Title:
 
                                    - 
                                        A survey on multi-agent hierarchical reinforcement learning
 
                                
                                
                                
                                    - Author(s):
 
                                    - 
                                        YIN Changsheng;  YANG Ruopeng;  ZHU Wei;  ZOU Xiaofei;  LI Feng
 
                                    - 
                                        School of Information and Communication, National University of Defense Technology, Wuhan 430010, China
 
                                    - 
                                
 
                                
                                    - Keywords:
 
                                    - 
                                        artificial intelligence; machine learning; reinforcement learning; multi-agent; summary; reinforcement learning; hierarchical reinforcement learning; application status
 
                                
                                
                                    - CLC:
 
                                    - 
                                        TP18
 
                                
                                
                                    - DOI:
 
                                    - 
                                        10.11992/tis.201909027
 
                                
                                
                                
                                    - Abstract:
 
                                    - 
                                        As an important research area in the field of machine learning and artificial intelligence, multi-agent hierarchical reinforcement learning (MAHRL) integrates the advantages of the collaboration of multi-agent system (MAS) and the decision making of reinforcement learning (RL) in a general-purpose form, and decomposes the RL problem into sub-problems and solves each of them to overcome the so-called curse of dimensionality. So MAHRL offers a potential way to solve large-scale and complex decision problem. In this paper, we systematically describe three key technologies of MAHRL: reinforcement learning (RL), Semi Markov Decision Process (SMDP), multi-agent reinforcement learning (MARL). We then systematically describe four main categories of the MAHRL method from the angle of hierarchical learning, which includes Option, HAM, MAXQ and End-to-End. Finally, we end up with summarizing the application status of MAHRL in robot control, game decision making and mission planning.