Incorporation of Perception-based Information in Robot Learning Using Fuzzy Reinforcement Learning A

来源 :青岛海洋大学学报 | 被引量 : 0次 | 上传用户:tangyajun1314
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Robot learning in unstructured environments has been proved to be an extremely challenging problem, mainly because of many uncertainties always present in the real world. Human beings, on the other hand, seem to cope very well with uncertain and unpredictable environments, often relying on perception-based information. Furthermore, humans beings can also utilize perceptions to guide their learning on those parts of the perception-action space that are actually relevant to the task. Therefore, we conduct a research aimed at improving robot learning through the incorporation of both perceptionbased and measurement-based information. For this reason, a fuzzy reinforcement learning (FRL) agent is proposed in this paper. Based on a neural-fuzzy architecture, different kinds of information can be incorporated into the FRL agent to initialise its action network, critic network and evaluation feedback module so as to accelerate its learning. By making use of the global optimisation capability of GAs (genetic algorithms), a GA-based FRL (GAFRL) agent is presented to solve the local minima problem in traditional actor-critic reinforcement learning. On the other hand, with the prediction capability of the critic network, GAs can perform a more effective global search. Different GAFRL agents are constructed and verified by using the simulation model of a physical biped robot. The simulation analysis shows that the biped learning rate for dynamic balance can be improved by incorporating perception-based information on biped balancing and walking evaluation.The biped robot can find its application in ocean exploration, detection or sea rescue activity, as well as military maritime activity.
其他文献
以阻尼乳胶IPNs为成膜物质,多聚磷酸铵(APP)、三聚氰铵(MEL)和季戊四醇(PE)为阻燃添加剂,采用正交实验设计,找出了本体系中APP、MEL、PE三者的最佳配比.通过改变乳液与阻燃添
基于派生式 CAPP专家系统的 GT分类特点 ,论述了ISODATA方法的基本概念、重要性、主要步骤及算法 ,重点介绍了聚类判据与准则 .实例表明了该方法的有效性 ,为自由锻 CAPP专家
对烟草中无机氯含量的自动电位滴定法分析的全过程进行了研究,确定该项分析的最佳条件为:2 g烟样,用1%HNO3溶液50 mL浸取5 min,然后加入0.1000 molL-1 NaCl溶液5.00 mL,固定
The prime function of the Philippine National Collection of Microorganisms (PNCM), being the national repository of microbial strains, is to collect and preserv
以熔融缩聚方法合成了具有苯烷基侧链取代的全芳香液晶聚酯,用TG、DSC、热台偏光显微镜研究了聚酯的热性能,并讨论了取代基对聚酯热性能的影响,所有合成的聚酯均为热致型液晶
矩技术因具有数学上的简明性及多用性 ,而得到了广泛应用。本文介绍矩技术在图像处理、计算机视觉和模式识别领域内的应用 ,主要包括景物匹配、直方图匹配、图像重建、图像压缩、对称性检测、图像规格化、纹理分割、边缘检测、目标识别和图像检索。
词汇教学是大学英语教学的重要部分.针对学生在学习英语单词中存在的困难,结合第1册教材中的词汇,提出在教学中应注重词汇的实际运用、词汇知识的关联性、多义性、扩展性、构
法国自1996年开始更新课程,新课程中明确提出,技术要真正整合到数学教学中去,并且声明了这种整合是必需的.技术整合的意义在于,通过新技术所提供的各种可能性来支持、完善和
作者在对Win16和Win32进行比较的基础上,结合作者在移植过程中的实际经验,对遇到的一些问题和解决办法以及移植步骤作了详细的探讨和总结.
干法造纸非织造布广泛应用于卫生用品和擦拭用品。本文介绍了该产品的生产工艺和物理性能,分析了中国的生产状况和消费环境,认为中国有一个正在兴起的干法造纸产品市场。