Adaptive dynamic programming for linear impulse systems

来源 :Journal of Zhejiang University-Science C(Computers and Elect | 被引量 : 0次 | 上传用户：qiuenqiuen

【摘要】

：

We investigate the optimization of linear impulse systems with the reinforcement learning based adaptive dynamic programming(ADP)method.For linear impulse syste

【作者】

：

Xiao-hua WANG Juan-juan YU Yao HUANG Hua WANG Zhong-hua MIAO

【机构】

：

School of Mechatronics Engineering and Automation, Shanghai University,Shanghai Key Laboratory of Po

【出处】

：

Journal of Zhejiang University-Science C(Computers and Elect

【发表日期】

：

2014年01期

【关键词】

：

impulse dimensionality quadric singularity quadratic trained converge iterative

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

We investigate the optimization of linear impulse systems with the reinforcement learning based adaptive dynamic programming(ADP)method.For linear impulse systems,the optimal objective function is shown to be a quadric form of the pre-impulse states.The ADP method provides solutions that iteratively converge to the optimal objective function.If an initial guess of the pre-impulse objective function is selected as a quadratic form of the pre-impulse states,the objective function iteratively converges to the optimal one through ADP.Though direct use of the quadratic objective function of the states within the ADP method is theoretically possible,the numerical singularity problem may occur due to the matrix inversion therein when the system dimensionality increases.A neural network based ADP method can circumvent this problem.A neural network with polynomial activation functions is selected to approximate the pre-impulse objective function and trained iteratively using the ADP method to achieve optimal control.After a successful training,optimal impulse control can be derived.Simulations are presented for illustrative purposes. We investigate the optimization of linear impulse systems with the reinforcement learning based adaptive dynamic programming (ADP) method. For linear impulse systems, the optimal objective function is shown to be a quadric form of the pre-impulse states. The ADP method provides solutions that iteratively converge to the optimal objective function. If an initial guess of the pre-impulse objective function is selected as a quadratic form of the pre-impulse states, the objective function iteratively converges to the optimal one through ADP.Though direct use of the quadratic objective function of the states within the ADP method is theoretically possible, the numerical singularity problem may occur due to the matrix inversion incorporated when the system dimensionality increases. A neural network based ADP method can circumvent this problem. A neural network with polynomial activation functions is selected to approximate the pre-impulse objective function and trained iteratively using the ADP method to a chieve optimal control. After a successful training, optimal impulse control can be derived. Simulations are presented for illustrative purposes.

其他文献

中国当代八大文人书画名家文化内涵论

文人书画是当代书画界一个独特的文化现象。文人的书画作品相对于职业书画家的作品,不以书画技法的奇巧高妙取胜,却以深刻的文化内涵和丰厚的人文情怀打动人心,成为当代书画

期刊

文人当代当代文人书画文化内涵栗原小荻书画界贾平凹文化批评莫言张贤亮

江西九江红土堆积的磁性地层学及其成因研究

中国南方广泛分布的红土堆积是热带-亚热带地区长期湿热气候条件下的风化产物,是中国南方重要的第四纪地层,蕴含了大量北亚热带-热带地区第四纪古气候变迁和古环境演化信息,

学位

红土堆积磁性地层成因粒度地球化学

面向新世纪党的建设面临的新问题

党的十五大确立了面向新世纪的党建新目标:“要把我们党建设成为用邓小平理论武装起来、全心全意为人民服务、思想上政治上组织上完全巩固、能够经受住各种风险、始终走在时

期刊

党的建设国际风云邓小平理论政治方向基层组织先锋模范作用战略任务消极腐败现象干部队伍政治信念

典型拟除虫菊酯农药在水溶液中的光降解研究

本文利用联苯菊酯、氰戊菊酯及氯氰菊酯的吸收光谱特性及其主要光降解产物的荧光光谱特性、质谱特性，较为系统、全面研究了几种典型拟除虫菊酯农药的光降解反应动力学特性及光

学位

拟除虫菊酯农药光降解动力学环境化学

再论中国画的书法用笔的审美价值

一、中国画的书法线条能够使中国画产生内涵的抽象美如果说“笔”是中国画的骨,“墨”即是中国画的肉,笔墨就构成了中国画的灵魂。一幅好的中国画包括画家的学养、画面的意境

期刊

孙过庭书谱空间分割经营位置质量感审美价值毫芒当代中国画内部运动吴昌硕

加强“第二课堂”教育是提高党校教学质量的好方法

所谓“第二课堂”教育是相对于正常的课堂教学而言的,它主要是指在教学过程中,结合课堂所讲授的理论问题,有目的地组织学员走出教室,走向社会,到实践中去搞专题研究、典型剖

期刊

党校教学“第二课堂”学员实际理论联系实际认识过程教学效果课堂教学党性锻炼参观访问增长见识

日本沼虾蜕皮激素受体(EcR)全长cDNA克隆及其在胚胎发育过程中的表达分析

蜕皮甾类激素除了能够调节甲壳动物的生长、繁殖和周期性蜕皮外，还在卵黄发生及胚胎发生中有着重要的生理作用。蜕皮激素受体（EcR）属于核受体超家族成员，为蜕皮甾类激素的作用受

学位

日本沼虾蜕皮激素受体蜕皮甾类激素cDNA克隆胚胎发育表达分析

华贵栉孔扇贝大规模死亡与赤潮生物的关联性研究

学位

柳色盈杯——辛卯春茶

“矮纸斜行闲作草,晴窗细乳戏分茶”。清明前后,各地春茶陆续上市,虽然部分茶区受到低温天气侵扰,多个产地面临采工短缺和人力成本上涨的困惑,2011年的春茶还是来了,在期盼之

期刊

春茶柳色采工分茶作草清明前后如约而至龙井茶叶底龙井问茶

殖民表征,种族和解——《鼓》与1950-60年代南非摄影及社会

摄影画刊曾是纪实摄影的摇篮,在种族隔离时代,南非《鼓》培养和训练了一代图片编辑和纪实摄影家,他们的工作在表征殖民治理和种族和解进程中起重要作用,但相关研究寥寥。本文

期刊

《鼓》种族隔离制度南非摄影纪实摄影殖民表征表征种族和解图片编辑世界化种族问题

Adaptive dynamic programming for linear impulse systems

其他学术论文