Bionic autonomous learning control of a two-wheeled self-balancing flexible robot

来源 :Journal of Control Theory and Applications | 被引量 : 0次 | 上传用户:acidliu1
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This paper presents an OCPA (operant conditioning probabilistic automaton) bionic autonomous learning system based on Skinner’s operant conditioning theory for solving the balance control problem of a two-wheeled flexible robot. The OCPA learning system consists of two stages: in the first stage, an operant action is selected stochastically from a set of operant actions and then used as the input of the control system; in the second stage, the learning system gathers the orientation information of the system and uses it for optimization until achieves control target. At the same time, the size of the operant action set can be automatically reduced during the learning process for avoiding little probability event. Theory analysis is made for the designed OCPA learning system in the paper, which theoretically proves the convergence of operant conditioning learning mechanism in OCPA learning system, namely the operant action entropy will converge to minimum with the learning process. And then OCPA learning system is applied to posture balanced control of two-wheeled flexible self-balanced robots. Robot does not have posutre balanced skill in initial state and the selecting probability of each operant in operant sets is equal. With the learning proceeding, the selected probabilities of optimal operant gradually tend to one and the operant action entropy gradually tends to minimum, and so robot gradually learned the posture balanced skill. This paper presents an OCPA (operant conditioning probabilistic automaton) bionic autonomous learning system based on Skinner’s operant conditioning theory for solving the balance control problem of a two-wheeled flexible robot. The OCPA learning system consists of two stages: in the first stage, an operant action is selected stochastically from a set of operant actions and then used as the input of the control system; in the second stage, the learning system gathers the orientation information of the system and uses it for optimizing until achieves control target. At the same time, the size of the operant action set can be automatically reduced during the learning process for avoiding little probability event. Theory analysis is made for the designed OCPA learning system in the paper, which theoretically proves the convergence of operand conditioning learning mechanism in OCPA learning system, namely the operant action entropy will converge to minimum with the learning process. An d then OCPA learning system is applied to posture balanced control of two-wheeled flexible self-balanced robots. Robot does not have posutre balanced skill in initial state and the selecting probability of each operant in operant sets is equal. With the learning proceeding, the selected probabilities of optimal operand gradually tend to one and the operant action entropy gradually tends to minimum, and so robot gradually learned the posture balanced skill.
其他文献
目的 分析硝苯地平(简称NIF)+厄贝沙坦(简称Irb)对糖尿病(简称DM)合并高血压(简称HBP)疾病的治疗效果及血压控制情况.方法 本研究对象为2016年9月~2019年9月间来院治疗的70例D
目的:探讨肝硬化背景下渐变肝癌结节的超声造影模式及定量分析。  方法:107例肝硬化背景下渐变肝癌结节者(增生结节18例、不典型增生结节30例、小肝癌41例,大肝癌18例)行超声造
中风即为西医称的脑卒中,被中医列为四大疑难病之首,其发病率和致残率都非常高.它多发生在中老年人身上,中风病人表现出的症状与患病程度以及恢复程度有关,有的突然晕倒、不
期刊
本刊讯 近日,国内火电首个物联发电5G宏基站在国家能源集团国电电力所属内蒙古东胜热电有限公司完成基建、通电、调试,正式接入核心网,标志着进入5G+智慧企业建设新时代.
期刊
心脏是人体最为主要的器官,起着驱动人体所有器官的重要作用,是人体最为重要的内脏器官.也就是说,人体的所有器官都是在心脏的作用下才正常运行的,一旦心脏停止工作,也就代表
期刊
结合我国当前全脊椎切除手术的开展情况来看,该手术活动是治疗脊柱原发肿瘤以及脊柱转移瘤的主要手段.根据相关调查显示,在所有的脊椎肿瘤中,有关原发性的脊柱肿瘤占据总患者
期刊
随着我国经济的快速增长,对环境的污染越发严重,且污染从城市走向农村。现在农村的饮用水安全已经成为农村一大问题,因为工业生产从城市搬去农村,加上监管不力、饮用水的安全意识
慢性支气管炎是一种临床常见慢性疾病,其病程较长,会对慢性支气管炎患者身心健康造成巨大伤害.其临床症状多为咳痰、咳嗽等,疾病发病时间约为3个月,连续持续两年或是两年以上
期刊
数学是一门综合性较强的学科,不仅仅考察学生们数学知识的运用,还考察学生们的思维方式,对学生们的要求也相对于其他学科来说较为严格。所以,讲授数学知识的教师们在教学的过程中
随着当今时代的飞速发展和新课改教学政策的不断深入推进与落实,小学数学作为小学教育课程体系中的重要学科,也逐渐抛弃了传统的教学理念和落后的教学手法,积极响应国家的新课改