Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization

来源 :Journal of Control Theory and Applications | 被引量 : 0次 | 上传用户：ssskstar

【摘要】

：

A multiresolution state-space discretization method with pseudorandom gridding is developed for the episodic unsupervised learning method of Q-learning.It is us

【作者】

：

Amanda LAMPTON John VALASEK Mrinal KUMAR

【机构】

：

Systems Technology,Inc.,13766 S.Hawthorne Blvd,Hawthorne,CA 90250,U.S.A.,Department of Aerospace Eng

【出处】

：

Journal of Control Theory and Applications

【发表日期】

：

2011年03期

【关键词】

：

Reinforcement learning Morphing Random grid

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

A multiresolution state-space discretization method with pseudorandom gridding is developed for the episodic unsupervised learning method of Q-learning.It is used as the learning agent for closed-loop control of morphing or highly reconfigurable systems.This paper develops a method whereby a state-space is adaptively discretized by progressively finer pseudorandom grids around the regions of interest within the state or learning space in an effort to break the Curse of Dimensionality.Utility of the method is demonstrated with application to the problem of a morphing airfoil,which is simulated by a computationally intensive computational fiuid dynamics model.By setting the multiresolution method to define the region of interest by the goal the agent seeks,it is shown that this method with the pseudorandom grid can learn a specific goal within ±0.001 while reducing the total number of state-action pairs needed to achieve this level of specificity to less than 3000. A multiresolution state-space discretization method with pseudorandom gridding is developed for the episodic unsupervised learning method of Q-learning. It is used as the learning agent for closed-loop control of morphing or highly reconfigurable systems. This paper develops a method whereby a state -space is adaptively discretized by progressively finer pseudorandom grids around the regions of interest within the state or learning space in an effort to break the Curse of Dimensionality. Utility of the method is demonstrated with application to the problem of a morphing airfoil, which is simulated by a computationally intensive computational fiuid dynamics model. By setting the multiresolution method to define the region of interest by the goal the agent seeks, it is shown that this method with the pseudorandom grid can learn a specific goal within ± 0.001 while reducing the total number of state-action pairs needed to achieve this level of specificity to less than 3000.

其他文献

介入治疗肺结核大咯血的护理

目的：总结经导管行部分支气管动脉造影栓塞术{BAE}治疗肺结核大咯血的护理。方法采用Seldinger技术对27例行部分支气管动脉栓塞术治疗的肺结核大咯血患者术前进行必要的检查及

期刊

肺结核大咯血介入治疗护理

高性能混凝土在现代建筑施工中的应用

高性能混凝土是近期混凝土技术发展的主要方向,高性能混凝土是具有某些性能要求的匀质混凝土,必须采用严格的施工工艺,采用优质材料配制,便于浇捣、不离析、力学性能稳定、早

期刊

高性能混凝土建筑工程应用设计

急诊ESWL治疗输尿管结石214临床分析

目的：对我院2003年5月～2013年5月共10年急诊体外震波碎石术(ESWL)治疗输尿管结石214的临床分析，了解治疗过程中的失误，提高该治疗的效率。方法对457例疑诊输尿管结石的患者行急诊

期刊

体外震波碎石术输尿管结石临床分析

胰管结石慢性胰腺炎的诊断和外科治疗（30例）

目的：探讨胰管结石慢性胰腺的诊断和外科治疗。方法收集我院1997年9月至2012年10月间经手术治疗的胰管结石慢性胰腺炎患者30例的临床资料进行回顾性分析。结果全组病例均经B超

期刊

胰管结石慢性胰腺炎胆道疾病B超CT磁共振胆胰管成像

营养膳食护理在糖尿病患者的应用

糖尿病是一种常见的内分泌疾病，病因复杂，病理基础主要是由于胰岛素绝对或相对分泌不足而引起的血糖及尿糖升高。有一些患者出现多饮、多尿、多食的“三多”症状，重症患者可伴有

期刊

糖尿病营养膳食护理

采血工作人员的职业危害及防护研究

目的：探讨采血工作人员职业危害的防护方法，提高对职业危害的防护意识。方法加强宣教，对采血工作人员进行健康教育，配备防护用具，执行标准化、科学化、制度化的管理体系。结果通过

期刊

采血工作人员职业危害防护研究

参松养心胶囊联合缬沙坦治疗阵发性房颤疗效观察

目的：探讨参松养心胶囊联合缬沙坦治疗阵发性房颤维持窦性心律的有效性、安全性。方法将140例阵发性房颤患者随机分为2组，实验组70例，采用参松养心胶囊联合颉沙坦，对照组70例采用

期刊

心房颤动阵发性参松养心胶囊缬沙坦胺碘酮Atrial fibrillationParoxysmalShensongyangxin capsuleV

中风患者的护理体会

中风是致残率、致死率较高的一种疾病。其起病急，病情发展迅速，病程长，恢复慢。本文采用中西医结合治疗和护理80例中风病人，取得了良好的效果，降低了患者的致残率，提高了患者的生活

期刊

中风护理体会

建筑屋面防水技术及质量控制综述

本文结合长期的实践经验,分别从设计和施工两个方面对屋面防水工程质量控制进行了阐述,供大家参考.

期刊

屋面防水问题施工方法

纤维桩对上前牙龈下牙折行牙合向牵引后全冠修复的临床观察

目的：探讨上前牙龈下牙折，根管治疗后，利用纤维桩对患牙根正畸牵引牙合向伸长后行全冠修复的临床应用。方法对15例上前牙折达龈下2-3m m的患者，根管治疗后，在根管内置纤维桩，形成树

期刊

纤维桩前牙冠折正畸牵引全冠修复

Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization

与本文相关的学术论文