Adaptive dynamic programming for online solution of a zero-sum differential game

来源 :Journal of Control Theory and Applications | 被引量 : 0次 | 上传用户:qq664374004
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This paper will present an approximate/adaptive dynamic programming(ADP) algorithm,that uses the idea of integral reinforcement learning(IRL),to determine online the Nash equilibrium solution for the two-player zerosum differential game with linear dynamics and infinite horizon quadratic cost.The algorithm is built around an iterative method that has been developed in the control engineering community for solving the continuous-time game algebraic Riccati equation(CT-GARE),which underlies the game problem.We here show how the ADP techniques will enhance the capabilities of the offline method allowing an online solution without the requirement of complete knowledge of the system dynamics.The feasibility of the ADP scheme is demonstrated in simulation for a power system control application.The adaptation goal is the best control policy that will face in an optimal manner the highest load disturbance. This paper will present an approximate / adaptive dynamic programming (ADP) algorithm, that uses the idea of ​​integral reinforcement learning (IRL), to determine online the Nash equilibrium solution for the two-player zerosum differential game with linear dynamics and infinite horizon quadratic cost The algorithm is built around an iterative method that has been developed in the control engineering community for solving the continuous-time game algebraic Riccati equation (CT-GARE), which underlies the game problem.We show show the the ADP techniques will enhance the capabilities of the offline method allow an online solution without the requirement of complete knowledge of the system dynamics. the feasibility of the ADP scheme is demonstrated in simulation for a power system control application. adaptation goal is the best control policy that will face in an optimal manner the highest load disturbance.
其他文献
  介绍了辊式挤压成型机的工作原理和产品特点,重点阐述了机架设计中的要点与细节问题,并以中800×340挤压成型机为例计算得到侧臂支架的应力与变形分布。验证了机架结构设计
近年来,重症支气管哮喘的发病率和病死率都呈上升趋势,从其发作特点来看,该病发作突然,及时处理即可缓解,早期的干预可缩短发作时间和患者的痛苦,防止症状恶化和致死性发作,
本文以经济学、文化学的角度,对内蒙古自治区苏木、嘎查地区开展体育旅游的条件进行研究和分析。在新的时期下,如何充分利用内蒙古自治区苏木、嘎查地区现有的资源优势,对这
本文在介绍了RSS的含义、功能、特点、使用方法基础上,重点分析介绍了RSS在高校图书馆个性化信息服务中的应用.
介绍了Ad Hoc网络路由协议(Ad Hoc On-Demand Distance Vector Routing,AODV)的路由机制和特点,提出了一种基于AODV的改进协议V-AODV。V-AODV改进了AODV的路由测度,用量化的节点移动性代替跳数作为路由代价的度量,在动态网络中能够选取到更加稳定的路由。仿真表明,V-AODV比AODV在时延和重传次数等性能上都有一定程度的提升。
文以载道,情为心声。一篇好的课文,必定洋溢着感人的激情。即使是议论体裁的文章,也会蕴含着对人生、对社会、对自然、对真理的一往深情。中学语文教学中,应该始终贯穿情感教
高中物理“培养学生运用数学处理物理问题的能力”的要求是:学生能理解公式和图像的物理意义,能运用数学进行逻辑推理,得出物理结论,要学会用图像表达和处理问题;能进行定量
本文通过研究,指出如何实现后勤系统市场化社会化转型发展,对于促进后勤系统服务质量的提升,提升后勤系统健康快速发展有着积极的意义。 This paper points out that how to
企业社会责任在本质上是企业文化中的价值观念,是企业文化的重要组成部分。企业要永续经营,就必须在企业文化中体现和渗透出企业要肩负的社会责任。本文就从我国企业社会责任入
本文阐述了公共数学教学在我校教学工作中的重要性,提出了构建公共数学教学和学习平台的一些设想.