The Cooperative Multi-agent Learning with Random Reward Values

来源 :Journal of Shanghai Jiaotong University | 被引量 : 0次 | 上传用户：dancheman001

【摘要】

：

This paper investigated how to learn the optimal action policies in cooperative multiagent systems if the agents' rewards are random variables, and proposed

【作者】

：

张化祥黄上腾

【机构】

：

Dept.ofComputerScienceandEng.

【出处】

：

Journal of Shanghai Jiaotong University

【发表日期】

：

2005年2期

【关键词】

：

学习加强随机报答多代理马尔可夫决策 reinforcement learning game random reward

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

This paper investigated how to learn the optimal action policies in cooperative multiagent systems if the agents' rewards are random variables, and proposed a general two-stage learning algorithm for cooperative multiagent decision processes. The algo

其他文献

潜伏期马立克氏病病鸡血清型病毒L-meq、meq基因的比较及遗传分析

从潜伏期感染马立克氏病病毒（MDV）鸡淋巴组织中提取基因组DNA，采用梯度PCR的方法获得MDV的L—meq、meq基因，将其插入pMD18-T克隆载体，经测序并进行了序列分析。结果表明，L—meq、me

期刊

马立克氏病病毒L—mepMEQPCR鸡MDVL-meq meq PCR chicken

The Cooperative Multi-agent Learning with Random Reward Values

其他学术论文