【摘 要】
:
A new algorithm is proposed, which immolates the optimality of control policies potentially to obtain the robusticity of solutions. The robusticity of solutions
【机 构】
:
College of Computer and Communicational Engineering,College of Sciences
论文部分内容阅读
A new algorithm is proposed, which immolates the optimality of control policies potentially to obtain the robusticity of solutions. The robusticity of solutions maybe becomes a very important property for a learning system when there exists non-matching between theory models and practical physical system, or the practical system is not static,or the availability of a control action changes along with the variety of time. The main contribution is that a set of approximation algorithms and their convergence results are given. A generalized average operator instead of the general optimal operator max (or min) is applied to study a class of important learning algorithms, dynamic programming algorithms, and discuss their convergences from theoretic point of view. The purpose for this research is to improve the robusticity of reinforcement learning algorithms theoretically.
其他文献
针对仅有角测量的单站无源目标在混合坐标系中进行跟踪所出现的问题,本文提出了应用U变换算法进行改进.该算法采用求取σ点和相应权值的办法,就可以直接得到非线性函数的均值
A method is developed by which control Lyapunov functions of a class of nonlinear systems can be constructed systematically.Based on the control Lyapunov functi
Polymeric materials usually present some viscoelastic behavior. To improve the mechanical behavior of these materials, ceramics materials are often filled into
The hydrodynamic load support generated by a slip wedge of a slider bearing was studied. The surface slip property was optimized so that a maximum hydrodynamic
Semi-solid ingots of an AlSi7Mg alloy were obtained using the method of near liquidus casting. Their microstructures exhibit the characteristics of fine, equiax
A new cyclometalated platinum complex containing 2, 5-bis(naphthalene-1-y1)-1,3,4-oxadiazole ligand was synthesized and characterized. The UV-Vis absorptions an
The minority carrier diffusion length of n-type GaN films grown by metalorganic chemical vapor deposition (MOCVD) has been studied by measuring the surface phot
目的 回顾性分析重型再生障碍性贫血(severe aplastic anemia,SAA)行脐带间充质干细胞(umbilical cord mesenchymal stem cells,UC-MSCs)联合单倍体异基因造血干细胞移植(hap
The influence of cryogenic treatment on the mechanical properties of the extruded Mg-Gd-Y-Zr(Mn) alloys was investigated by the tensile tests, scanning electron
The surface of an up-conversion luminescence material was modified by overcoating with SiO2, which was synthesized from a hydrolysis progress of tetraethoxysila