[1]LIAN Chuanqiang,XU Xin,WU Jun,et al.Q-CF multiAgent reinforcement learningfor resource allocation problems[J].CAAI Transactions on Intelligent Systems,2011,6(2):95-100.
Copy

Q-CF multiAgent reinforcement learningfor resource allocation problems

References:
[1]CHONGJIE Z, LESSER V, SHENOY P. A multiAgent learning approach to resource sharing across computing clusters[R].Computer Science Department, University of Massachusetts Computer Science Amherst UMass, UMCS2008035, 2008.
[2]KO P C, LIN P C, YOU J A, et al. Multilayer allocated learning based neural network for resource allocation optimization[C]// Proceedings of the 9th Joint Conference on Information Sciences(JCIS 2006). Taibei, China, 2006: 3541.
[3]TESAURO G. Online resource allocation using decompositional reinforcement learning[C]//Proceedings of AAAI 2005. Pittsburgh, USA, 2005: 886891.
[4]LITTMAN M L, STONE P. Leading bestresponse strategies in repeated games[C]//The 17th Annual International Joint Conference on Artificial Intelligence Workshop on Economic Agents, Models, and Mechanism. Seattle, Washington, USA, 2001: 745756.
[5]HU J, WELLMAN M P. Multiagent reinforcement learning in stochastic games[OL]. Citeseer. ist. psu. edu/hu99multiagent. Html, 1999.
[6]BUSONIU L, De SCHUTTER B, BABUSKA R. Multiagent reinforcement learning with adaptive state focus[C]//Proceedings of the 17th BelgiumNetherlands Conference on Artificial Intelligence. Brussels, Belgium, 2005: 3542.
[7]KOK J R, VLASSIS N. Collaborative multiagent reinforcement learning by payoff propagation[J]. Journal of Machine Learning Research, 2006, 7: 17891828.
[8]杨佩,陈兆乾,陈世福. 机器学习在RoboCup中的应用研究[J].计算机科学, 2003, 30(6): 118121. YANG Pei, CHEN Zhaoqian, CHEN Shifu. RoboCup multiAgent system machinelearning[J].Computer Sciences, 2003, 30(6): 118121.
[9]王醒策,张汝波,顾国昌. 基于强化学习的多机器人编队方法研究[J].计算机工程, 2002, 28(6): 1516. WANG Xingce, ZHANG Rubo, GU Guochang. Research on multiAgent team formation based on reinforcement learning[J].Computer Engineering, 2002, 28(6): 1516.
[10]HU J, WELLMAN M P. Nash Qlearning for generalsum stochastic games[J]. Journal of Machine Learning Research, 2003, 4: 10391069.
[11]ALPAYDM E. 机器学习导论[M]. 范明,等译. 北京:北京工业出版社, 2009: 244255.
?[12]LAGOUDAKIS M G, PARR R. Leastsquares policy iteration[J]. Journal of Machine Learning Research, 2003 (4): 11071149.
[13]XU X, HU D W, LU X C. Kernel based leastsquares policy iteration[J]. IEEE Transactions on Neural Networks, 2007, 18(4): 973992.
Similar References:

Memo

-

Last Update: 2011-05-19

Copyright © CAAI Transactions on Intelligent Systems