[1]YU Kui,WANG Hao,YAO Hong-liang.A dynamic influence diagram for dynamic decision processes[J].CAAI Transactions on Intelligent Systems,2008,3(2):159-166.
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
3
Number of periods:
2008 2
Page number:
159-166
Column:
学术论文—人工智能基础
Public date:
2008-04-25
- Title:
-
A dynamic influence diagram for dynamic decision processes
- Author(s):
-
YU Kui1; 2; WANG Hao2; YAO Hong-liang2
-
1.Department of Computer Science, Institute of Textile and Garment of Changzho u, Changzhou 213164, China;
2. School of Computer and Information, Hefei Univers ity of Technology, Hefei 230009, China
-
- Keywords:
-
dynamic Bayesian networks; influence diagrams; Markov decision process; partially observable Markov decision process; dynamic influence diagram
- CLC:
-
TP181
- DOI:
-
-
- Abstract:
-
Computational complexities in strategy space and state space make the partially observable Markov decision process (POMDP) an NPhard problem. Therefore, in this paper, a dynamic influence diagram is proposed to model the decisionmaking problem with a single agent, in which a directed acyclic diagram is used to express the complex relationships between systematic variables. Firstly, a dynamic Bayesian network is used to represent the transition and observation models so as to reduce the state space of the system. Secondly, in order to reduce the representational complexity of the utility function, it is expressed in terms of utility nodes. Finally, the actions of the system are represented with decision nodes to simplify the strategy space. The dynamic influence diagram is compared with the POMDP using these three aspects. Our research indicates that a dynamic influence diagram provides a simple way to express POMDP problems. Experiments in the Robocup environment verified the effectiveness of the proposed model.