Adaptive dynamic programming for online solution of a zero-sum differential game

来源 :Journal of Control Theory and Applications | 被引量 : 0次 | 上传用户：qq664374004

【摘要】

：

This paper will present an approximate/adaptive dynamic programming(ADP) algorithm,that uses the idea of integral reinforcement learning(IRL),to determine onlin

【作者】

：

Draguna VRABIE Frank LEWIS

【机构】

：

United Technologies Research Center,East Hartford,Automation and Robotics Research Institute,Univers

【出处】

：

Journal of Control Theory and Applications

【发表日期】

：

2011年03期

【关键词】

：

Approximate/Adaptive dynamic programming Game algebraic Riccati equation Zero-su

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

This paper will present an approximate/adaptive dynamic programming(ADP) algorithm,that uses the idea of integral reinforcement learning(IRL),to determine online the Nash equilibrium solution for the two-player zerosum differential game with linear dynamics and infinite horizon quadratic cost.The algorithm is built around an iterative method that has been developed in the control engineering community for solving the continuous-time game algebraic Riccati equation(CT-GARE),which underlies the game problem.We here show how the ADP techniques will enhance the capabilities of the offline method allowing an online solution without the requirement of complete knowledge of the system dynamics.The feasibility of the ADP scheme is demonstrated in simulation for a power system control application.The adaptation goal is the best control policy that will face in an optimal manner the highest load disturbance. This paper will present an approximate / adaptive dynamic programming (ADP) algorithm, that uses the idea of integral reinforcement learning (IRL), to determine online the Nash equilibrium solution for the two-player zerosum differential game with linear dynamics and infinite horizon quadratic cost The algorithm is built around an iterative method that has been developed in the control engineering community for solving the continuous-time game algebraic Riccati equation (CT-GARE), which underlies the game problem.We show show the the ADP techniques will enhance the capabilities of the offline method allow an online solution without the requirement of complete knowledge of the system dynamics. the feasibility of the ADP scheme is demonstrated in simulation for a power system control application. adaptation goal is the best control policy that will face in an optimal manner the highest load disturbance.

其他文献

辊式挤压成型机机架设计与有限元分析

　　介绍了辊式挤压成型机的工作原理和产品特点，重点阐述了机架设计中的要点与细节问题，并以中800×340挤压成型机为例计算得到侧臂支架的应力与变形分布。验证了机架结构设计

期刊

水泥工业辊式挤压成型机结构设计有限元分析计算方法

浅谈重症支气管哮喘的护理

近年来,重症支气管哮喘的发病率和病死率都呈上升趋势,从其发作特点来看,该病发作突然,及时处理即可缓解,早期的干预可缩短发作时间和患者的痛苦,防止症状恶化和致死性发作,

期刊

重症哮喘护理

论内蒙古自治区苏木嘎查地区体育旅游经济的开发与利用

本文以经济学、文化学的角度,对内蒙古自治区苏木、嘎查地区开展体育旅游的条件进行研究和分析。在新的时期下,如何充分利用内蒙古自治区苏木、嘎查地区现有的资源优势,对这

期刊

内蒙古自治区苏木嘎查地区体育旅游经济开发和利用

基于RSS谈高校图书馆个性化信息服务

本文在介绍了RSS的含义、功能、特点、使用方法基础上,重点分析介绍了RSS在高校图书馆个性化信息服务中的应用.

期刊

RSS高校图书馆个性化信息服务

基于移动性的AODV路由协议改进

介绍了Ad Hoc网络路由协议(Ad Hoc On-Demand Distance Vector Routing,AODV)的路由机制和特点,提出了一种基于AODV的改进协议V-AODV。V-AODV改进了AODV的路由测度,用量化的节点移动性代替跳数作为路由代价的度量,在动态网络中能够选取到更加稳定的路由。仿真表明,V-AODV比AODV在时延和重传次数等性能上都有一定程度的提升。

期刊

Ad Hoc网络路由协议路由协议移动性

浅谈中学语文教学中的情感教育

文以载道,情为心声。一篇好的课文,必定洋溢着感人的激情。即使是议论体裁的文章,也会蕴含着对人生、对社会、对自然、对真理的一往深情。中学语文教学中,应该始终贯穿情感教

期刊

语文教学情感教育真情意境

浅谈学生如何运用数学知识处理高中物理问题

高中物理“培养学生运用数学处理物理问题的能力”的要求是:学生能理解公式和图像的物理意义,能运用数学进行逻辑推理,得出物理结论,要学会用图像表达和处理问题;能进行定量

期刊

数学方法物理问题分析

后勤系统市场化社会化转型发展研究

本文通过研究,指出如何实现后勤系统市场化社会化转型发展,对于促进后勤系统服务质量的提升,提升后勤系统健康快速发展有着积极的意义。 This paper points out that how to

期刊

后勤系统市场化社会化发展策略

基于企业文化的我国企业社会责任探究

企业社会责任在本质上是企业文化中的价值观念，是企业文化的重要组成部分。企业要永续经营，就必须在企业文化中体现和渗透出企业要肩负的社会责任。本文就从我国企业社会责任入

期刊

企业社会责任企业文化价值观念利益相关者永续经营文化层次企业内部责任意识

基于网络的公共数学教学和学习平台

本文阐述了公共数学教学在我校教学工作中的重要性,提出了构建公共数学教学和学习平台的一些设想.

期刊

公共数学教学模式网络平台

Adaptive dynamic programming for online solution of a zero-sum differential game

与本文相关的学术论文