<-Previous Article Next Article->

[1]LIAN Chuanqiang,XU Xin,WU Jun,et al.Q-CF multiAgent reinforcement learningfor resource allocation problems[J].CAAI Transactions on Intelligent Systems,2011,6(2):95-100.

Copy

Q-CF multiAgent reinforcement learningfor resource allocation problems

PDF Download HTML

CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume: 6 Number of periods: 2011 2 Page number: 95-100 Column: 学术论文—机器学习 Public date: 2011-04-25

Title:: Q-CF multiAgent reinforcement learningfor resource allocation problems

Author(s):: LIAN Chuanqiang; XU Xin; WU Jun; LI Zhaobin; College of Mechatronics and Automation, National University of Defense Technology, Changsha 410073, China

Keywords:: multiAgent system; reinforcement learning; resource allocation; cooperation control

CLC:: TP391.1

DOI:: -

Abstract:: When a multiAgent reinforcement learning algorithm is used in complex distributed systems, problems such as huge state space and low learning efficiency arise. In this paper, a multiAgent reinforcement learning algorithm was studied for the resource allocation problem in a network environment. By combining the Qlearning algorithm and the chain feedback learning mechanism, a novel QCF multiAgent reinforcement learning algorithm was presented. In the QCF algorithm, multiAgent cooperation was realized based on the mechanism of information chain feedback. Simulation results show that compared with the multiAgent Qlearning algorithm in existence, the proposed algorithm in this paper has a faster convergence speed while at the same time ensures the performance optimization of cooperation policy.

References:: ［1］CHONGJIE Z, LESSER V, SHENOY P. A multiAgent learning approach to resource sharing across computing clusters［R］.Computer Science Department， University of Massachusetts Computer Science Amherst UMass, UMCS2008035, 2008.
［2］KO P C, LIN P C, YOU J A, et al. Multilayer allocated learning based neural network for resource allocation optimization［C］// Proceedings of the 9th Joint Conference on Information Sciences(JCIS 2006). Taibei, China， 2006: 3541.
［3］TESAURO G. Online resource allocation using decompositional reinforcement learning［C］//Proceedings of AAAI 2005. Pittsburgh, USA, 2005： 886891.
［4］LITTMAN M L, STONE P. Leading bestresponse strategies in repeated games［C］//The 17th Annual International Joint Conference on Artificial Intelligence Workshop on Economic Agents, Models, and Mechanism. Seattle, Washington, USA, 2001: 745756.
［5］HU J, WELLMAN M P. Multiagent reinforcement learning in stochastic games［OL］. Citeseer. ist. psu. edu/hu99multiagent. Html, 1999.
［6］BUSONIU L, De SCHUTTER B, BABUSKA R. Multiagent reinforcement learning with adaptive state focus［C］//Proceedings of the 17th BelgiumNetherlands Conference on Artificial Intelligence. Brussels, Belgium, 2005: 3542.
［7］KOK J R, VLASSIS N. Collaborative multiagent reinforcement learning by payoff propagation［J］. Journal of Machine Learning Research, 2006, 7: 17891828.
［8］杨佩，陈兆乾，陈世福. 机器学习在RoboCup中的应用研究［J］.计算机科学, 2003, 30(6): 118121. YANG Pei， CHEN Zhaoqian, CHEN Shifu. RoboCup multiAgent system machinelearning［J］.Computer Sciences, 2003, 30(6): 118121.
［9］王醒策，张汝波，顾国昌. 基于强化学习的多机器人编队方法研究［J］.计算机工程, 2002, 28(6)： 1516. WANG Xingce, ZHANG Rubo, GU Guochang. Research on multiAgent team formation based on reinforcement learning［J］.Computer Engineering, 2002, 28(6): 1516.
［10］HU J, WELLMAN M P. Nash Qlearning for generalsum stochastic games［J］. Journal of Machine Learning Research, 2003, 4: 10391069.
［11］ALPAYDM E. 机器学习导论［M］. 范明，等译. 北京:北京工业出版社, 2009: 244255.
?［12］LAGOUDAKIS M G, PARR R. Leastsquares policy iteration［J］. Journal of Machine Learning Research, 2003 (4): 11071149.
［13］XU X, HU D W, LU X C. Kernel based leastsquares policy iteration［J］. IEEE Transactions on Neural Networks, 2007, 18(4): 973992.

Similar References:

Memo

Last Update: 2011-05-19

Q-CF multiAgent reinforcement learningfor resource allocation problems PDF DownloadHTML

Memo

Q-CF multiAgent reinforcement learningfor resource allocation problems

PDF Download HTML