[1]WANG Guo-lei,ZHONG Shi-sheng,LIN Lin.Bilevel Qlearning algorithm for dynamic multimachinescheduling problems[J].CAAI Transactions on Intelligent Systems,2009,4(3):239-244.
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
4
Number of periods:
2009 3
Page number:
239-244
Column:
学术论文—智能系统
Public date:
2009-06-25
- Title:
-
Bilevel Qlearning algorithm for dynamic multimachinescheduling problems
- Author(s):
-
WANG Guo-lei; ZHONG Shi-sheng; LIN Lin
-
School of Mechanical Engineering, Harbin Institute of Technology, Harbin 150001, China
-
- Keywords:
-
dynamic multimachine scheduling; Qlearning; action set; state space division; reward function
- CLC:
-
TP273
- DOI:
-
-
- Abstract:
-
Traditional Qlearning is very effective in dynamic singlemachine scheduling problems, yet sometimes it cannot get optimal results for dynamic multimachine scheduling problems due to its lack of global vision. To resolve this, a twolayer Qlearning algorithm was put forward. The bottomlevel of Qlearning was focused on localized targets in order to learn the optimal scheduling policy which can minimize machine idleness and the mean flow time of single machines. On the other hand, the toplevel of Qlearning was focused on global targets in order to find the dispatching policy which can balance machine loads and minimize the overall tardiness of all jobs. The scheduling and dispatching rules of agents, the method for dividing state space and the reward functions were all examined. Simulation results showed that the proposed twolayer Qlearning algorithm can improve the results of dynamic multimachine scheduling problems.