Adaptive dynamic programming for linear impulse systems

来源 :Journal of Zhejiang University-Science C(Computers and Elect | 被引量 : 0次 | 上传用户:qiuenqiuen
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
We investigate the optimization of linear impulse systems with the reinforcement learning based adaptive dynamic programming(ADP)method.For linear impulse systems,the optimal objective function is shown to be a quadric form of the pre-impulse states.The ADP method provides solutions that iteratively converge to the optimal objective function.If an initial guess of the pre-impulse objective function is selected as a quadratic form of the pre-impulse states,the objective function iteratively converges to the optimal one through ADP.Though direct use of the quadratic objective function of the states within the ADP method is theoretically possible,the numerical singularity problem may occur due to the matrix inversion therein when the system dimensionality increases.A neural network based ADP method can circumvent this problem.A neural network with polynomial activation functions is selected to approximate the pre-impulse objective function and trained iteratively using the ADP method to achieve optimal control.After a successful training,optimal impulse control can be derived.Simulations are presented for illustrative purposes. We investigate the optimization of linear impulse systems with the reinforcement learning based adaptive dynamic programming (ADP) method. For linear impulse systems, the optimal objective function is shown to be a quadric form of the pre-impulse states. The ADP method provides solutions that iteratively converge to the optimal objective function. If an initial guess of the pre-impulse objective function is selected as a quadratic form of the pre-impulse states, the objective function iteratively converges to the optimal one through ADP.Though direct use of the quadratic objective function of the states within the ADP method is theoretically possible, the numerical singularity problem may occur due to the matrix inversion incorporated when the system dimensionality increases. A neural network based ADP method can circumvent this problem. A neural network with polynomial activation functions is selected to approximate the pre-impulse objective function and trained iteratively using the ADP method to a chieve optimal control. After a successful training, optimal impulse control can be derived. Simulations are presented for illustrative purposes.
其他文献
文人书画是当代书画界一个独特的文化现象。文人的书画作品相对于职业书画家的作品,不以书画技法的奇巧高妙取胜,却以深刻的文化内涵和丰厚的人文情怀打动人心,成为当代书画
中国南方广泛分布的红土堆积是热带-亚热带地区长期湿热气候条件下的风化产物,是中国南方重要的第四纪地层,蕴含了大量北亚热带-热带地区第四纪古气候变迁和古环境演化信息,
党的十五大确立了面向新世纪的党建新目标:“要把我们党建设成为用邓小平理论武装起来、全心全意为人民服务、思想上政治上组织上完全巩固、能够经受住各种风险、始终走在时
本文利用联苯菊酯、氰戊菊酯及氯氰菊酯的吸收光谱特性及其主要光降解产物的荧光光谱特性、质谱特性,较为系统、全面研究了几种典型拟除虫菊酯农药的光降解反应动力学特性及光
一、中国画的书法线条能够使中国画产生内涵的抽象美如果说“笔”是中国画的骨,“墨”即是中国画的肉,笔墨就构成了中国画的灵魂。一幅好的中国画包括画家的学养、画面的意境
所谓“第二课堂”教育是相对于正常的课堂教学而言的,它主要是指在教学过程中,结合课堂所讲授的理论问题,有目的地组织学员走出教室,走向社会,到实践中去搞专题研究、典型剖
蜕皮甾类激素除了能够调节甲壳动物的生长、繁殖和周期性蜕皮外,还在卵黄发生及胚胎发生中有着重要的生理作用。蜕皮激素受体(EcR)属于核受体超家族成员,为蜕皮甾类激素的作用受
学位
“矮纸斜行闲作草,晴窗细乳戏分茶”。清明前后,各地春茶陆续上市,虽然部分茶区受到低温天气侵扰,多个产地面临采工短缺和人力成本上涨的困惑,2011年的春茶还是来了,在期盼之
摄影画刊曾是纪实摄影的摇篮,在种族隔离时代,南非《鼓》培养和训练了一代图片编辑和纪实摄影家,他们的工作在表征殖民治理和种族和解进程中起重要作用,但相关研究寥寥。本文