Multiagent Reinforcement Learning:Rollout and Policy Iteration

来源 :自动化学报:英文版 | 被引量 : 0次 | 上传用户:wenshicai2009
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
We discuss the solution of complex multistage decision problems using methods that are based on the idea of policy iteration(PI),i.e.,start from some base policy and generate an improved policy.Rollout is the simplest method of this type,where just one im
其他文献
The control of battery energy storage systems(BESSs)plays an important role in the management of microgrids.In this paper,the problem of balancing the state-ofc
This paper investigates the distributed fault-tolerant containment control(FTCC)problem of nonlinear multi-agent systems(MASs)under a directed network topology.
In this paper,an adaptive dynamic programming(ADP)strategy is investigated for discrete-time nonlinear systems with unknown nonlinear dynamics subject to input
Embedded systems have numerous applications in everyday life.Petri-net-based representation for embedded systems(PRES+)is an important methodology for the model
This paper proposes an adaptive sliding mode observer(ASMO)-based approach for wind turbines subject to simultaneous faults in sensors and actuators.The propose
In this paper,we elaborate on residual-driven Fuzzy C-Means(FCM)for image segmentation,which is the first approach that realizes accurate residual(noise/outlier
The rise of multi-cloud systems has been spurred.For safety-critical missions,it is important to guarantee their security and reliability.To address trust const
Configuration evaluation is a key technology to be considered in the design of multiple aircrafts formation(MAF)configurations with high dynamic properties in e
Multi-agent systems(MASs)are typically composed of multiple smart entities with independent sensing,communication,computing,and decision-making capabilities.Now
This paper aims at eliminating the asymmetric and saturated hysteresis nonlinearities by designing hysteresis pseudo inverse compensator and robust adaptive dyn