A controller which is locally optimal near the origin and globally inverse optimal for the nonlinear system is proposed for path following of over actuated marine crafts with actuator dynamics. The motivation is the e...A controller which is locally optimal near the origin and globally inverse optimal for the nonlinear system is proposed for path following of over actuated marine crafts with actuator dynamics. The motivation is the existence of undesired signals sent to the actuators, which can result in bad behavior in path following. To attenuate the oscillation of the control signal and obtain smooth thrust outputs, the actuator dynamics are added into the ship maneuvering model. Instead of modifying the Line-of-Sight (LOS) guidance law, this proposed controller can easily adjust the vessel speed to minimize the large cross-track error caused by the high vessel speed when it is turning. Numerical simulations demonstrate the validity of this proposed controller.展开更多
For most firms,especially the small-and medium-sized ones,the operational decisions are affected by their internal capital and ability to obtain external capital.However,the majority of the current studies on dynamic ...For most firms,especially the small-and medium-sized ones,the operational decisions are affected by their internal capital and ability to obtain external capital.However,the majority of the current studies on dynamic inventory control ignore the firm’s financial status and financing issues completely.An important question that arises is:what are the dynamic optimal inventory and financing policies for firms with limited capital and limited access to external capital?In this paper,we review some of the latest developments in this area.After a brief review of single period models,we focus on multi-period dynamic control of the firm who aims to optimize its xpected terminal wealth.Two cases are discussed in detail:self-finance and short term finance.In the first case,the firm has to rely on its own capital for all ordering decisions,while in the second,the firm can borrow short term loan from lenders.A detailed characterization of the optimal policy is presented and its managerial insights are discussed.Several possible extensions are suggested.展开更多
In this paper, we consider a class of optimal control problem for the singularly perturbed hybrid dynamical systems. By means of variational method, we obtain the necessary conditions of the hybrid dynamical systems. ...In this paper, we consider a class of optimal control problem for the singularly perturbed hybrid dynamical systems. By means of variational method, we obtain the necessary conditions of the hybrid dynamical systems. Meanwhile, the existence of solution for the hybrid dynamical system is proved by the sewing method and the uniformly valid asymptotic expansion of the optimal trajectory is constructed by the boundary function method. Finally,an example is presented to illustrate the result.展开更多
This paper deals with both the leading train and the following train in a train tracking under a four-aspect fixed autoblock system in order to study the optimum operating strategy for energy saving. After analyzing t...This paper deals with both the leading train and the following train in a train tracking under a four-aspect fixed autoblock system in order to study the optimum operating strategy for energy saving. After analyzing the working principle of the four-aspect fixed autoblock system, an energy-saving control model is created based on the dynamics equation of the Wains. In addition to safety, energy consumption and time error are the main concerns of the model. Based on this model, dynamic speed constraints of the following train are proposed, defined by the leading gain dynamically. At the same time, the static speed constraints defined by the line conditions are also taken into account. The parallel genetic algorithm is used to search the optimum operating strategy. In order to simplify the solving process, the external punishment function is adopted to transform this problem with constraints to the one without constraints. By using the real number coding and the strategy of dividing ramps into three parts, the convergence of GA is accelerated and the length of chromosomes is shortened. The simulation result from a four-aspect fixed autoblock system simulation platform shows that the method can reduce the energy consumption effectively in the premise of ensuring safety and punctuality.展开更多
This paper estimates an off-policy integral reinforcement learning(IRL) algorithm to obtain the optimal tracking control of unknown chaotic systems. Off-policy IRL can learn the solution of the HJB equation from the...This paper estimates an off-policy integral reinforcement learning(IRL) algorithm to obtain the optimal tracking control of unknown chaotic systems. Off-policy IRL can learn the solution of the HJB equation from the system data generated by an arbitrary control. Moreover, off-policy IRL can be regarded as a direct learning method, which avoids the identification of system dynamics. In this paper, the performance index function is first given based on the system tracking error and control error. For solving the Hamilton–Jacobi–Bellman(HJB) equation, an off-policy IRL algorithm is proposed.It is proven that the iterative control makes the tracking error system asymptotically stable, and the iterative performance index function is convergent. Simulation study demonstrates the effectiveness of the developed tracking control method.展开更多
In this paper, an optimal tracking control scheme is proposed for a class of discrete-time chaotic systems using the approximation-error-based adaptive dynamic programming (ADP) algorithm. Via the system transformat...In this paper, an optimal tracking control scheme is proposed for a class of discrete-time chaotic systems using the approximation-error-based adaptive dynamic programming (ADP) algorithm. Via the system transformation, the optimal tracking problem is transformed into an optimal regulation problem, and then the novel optimal tracking control method is proposed. It is shown that for the iterative ADP algorithm with finite approximation error, the iterative performance index functions can converge to a finite neighborhood of the greatest lower bound of all performance index functions under some convergence conditions. Two examples are given to demonstrate the validity of the proposed optimal tracking control scheme for chaotic systems.展开更多
We develop an online adaptive dynamic programming (ADP) based optimal control scheme for continuous-time chaotic systems. The idea is to use the ADP algorithm to obtain the optimal control input that makes the perfo...We develop an online adaptive dynamic programming (ADP) based optimal control scheme for continuous-time chaotic systems. The idea is to use the ADP algorithm to obtain the optimal control input that makes the performance index function reach an optimum. The expression of the performance index function for the chaotic system is first presented. The online ADP algorithm is presented to achieve optimal control. In the ADP structure, neural networks are used to construct a critic network and an action network, which can obtain an approximate performance index function and the control input, respectively. It is proven that the critic parameter error dynamics and the closed-loop chaotic systems are uniformly ultimately bounded exponentially. Our simulation results illustrate the performance of the established optimal control method.展开更多
The q-profile control problem in the ramp-up phase of plasma discharges is consid- ered in this work. The magnetic diffusion partial differential equation (PDE) models the dynamics of the poloidal magnetic flux prof...The q-profile control problem in the ramp-up phase of plasma discharges is consid- ered in this work. The magnetic diffusion partial differential equation (PDE) models the dynamics of the poloidal magnetic flux profile, which is used in this work to formulate a PDE-constrained op-timization problem under a quasi-static assumption. The minimum surface theory and constrained numeric optimization are then applied to achieve suboptimal solutions. Since the transient dy- namics is pre-given by the minimum surface theory, then this method can dramatically accelerate the solution process. In order to be robust under external uncertainties in real implementations, PID (proportional-integral-derivative) controllers are used to force the actuators to follow the computational input trajectories. It has the potential to implement in real-time for long time discharges by combining this method with the magnetic equilibrium update.展开更多
The reduction of energy consumption is an increasingly important topic of the railway system.Energy-efficient train control(EETC)is one solution,which refers to mathematically computing when to accelerate,which cruisi...The reduction of energy consumption is an increasingly important topic of the railway system.Energy-efficient train control(EETC)is one solution,which refers to mathematically computing when to accelerate,which cruising speed to hold,how long one should coast over a suitable space,and when to brake.Most approaches in literature and industry greatly simplify a lot of nonlinear effects,such that they ignore mostly the losses due to energy conversion in traction components and auxiliaries.To fill this research gap,a series of increasingly detailed nonlinear losses is described and modelled.We categorize an increasing detail in this representation as four levels.We study the impact of those levels of detail on the energy optimal speed trajectory.To do this,a standard approach based on dynamic programming is used,given constraints on total travel time.This evaluation of multiple test cases highlights the influence of the dynamic losses and the power consumption of auxiliary components on railway trajectories,also compared to multiple benchmarks.The results show how the losses can make up 50%of the total energy consumption for an exemplary trip.Ignoring them would though result in consistent but limited errors in the optimal trajectory.Overall,more complex trajectories can result in less energy consumption when including the complexity of nonlinear losses than when a simpler model is considered.Those effects are stronger when the trajectory includes many acceleration and braking phases.展开更多
Magnetorheological (MR) dampers are one of the most promising new devices for civil infrastructural vibration control applications. However, due to their highly nonlinear dynamic behavior, it is very difficult to obta...Magnetorheological (MR) dampers are one of the most promising new devices for civil infrastructural vibration control applications. However, due to their highly nonlinear dynamic behavior, it is very difficult to obtain of a mathematical model of inverse MR damper that has an explicit relationship between the desired damper force and the command signal (voltage). This force voltage relationship is especially required for the structural vibration control design and simulation using MR dampers. This paper focuses on using a neural network (NN) technique to emulate the inverse MR damper model. The output of the neural network can be used to command the MR damper for generating desired forces. Numerical simulations are also presented to illustrate the effectiveness of this inverse model in semi active vibration control using MR dampers.展开更多
随着下一代通信网的发展,传统网络架构已无法满足日益增长的灵活性、可扩展性及管理需求。软件定义网络(Software Defined Network,SDN)作为一种新型网络架构,为6G网络提供了新的研究方向。文章分析SDN的基本架构和工作原理,并总结SDN...随着下一代通信网的发展,传统网络架构已无法满足日益增长的灵活性、可扩展性及管理需求。软件定义网络(Software Defined Network,SDN)作为一种新型网络架构,为6G网络提供了新的研究方向。文章分析SDN的基本架构和工作原理,并总结SDN技术的优化方法。在此基础上,结合Mininet仿真平台对SDN与传统网络架构在6G应用场景下的性能进行对比实验。结果表明,SDN在网络延迟、丢包率及资源利用率等关键性能指标上显著优于传统网络架构,为6G网络的部署提供了重要理论依据和实践指导。展开更多
基金Supported by the National Natural Science Foundation of China under Grant Nos. 61301279, 51479158 and the Fundamental Research Funds for the Central Universities under Grant No. WUT: 163102006
文摘A controller which is locally optimal near the origin and globally inverse optimal for the nonlinear system is proposed for path following of over actuated marine crafts with actuator dynamics. The motivation is the existence of undesired signals sent to the actuators, which can result in bad behavior in path following. To attenuate the oscillation of the control signal and obtain smooth thrust outputs, the actuator dynamics are added into the ship maneuvering model. Instead of modifying the Line-of-Sight (LOS) guidance law, this proposed controller can easily adjust the vessel speed to minimize the large cross-track error caused by the high vessel speed when it is turning. Numerical simulations demonstrate the validity of this proposed controller.
基金Supported by National Natural Science Foundation of China(Grant No.71390330)
文摘For most firms,especially the small-and medium-sized ones,the operational decisions are affected by their internal capital and ability to obtain external capital.However,the majority of the current studies on dynamic inventory control ignore the firm’s financial status and financing issues completely.An important question that arises is:what are the dynamic optimal inventory and financing policies for firms with limited capital and limited access to external capital?In this paper,we review some of the latest developments in this area.After a brief review of single period models,we focus on multi-period dynamic control of the firm who aims to optimize its xpected terminal wealth.Two cases are discussed in detail:self-finance and short term finance.In the first case,the firm has to rely on its own capital for all ordering decisions,while in the second,the firm can borrow short term loan from lenders.A detailed characterization of the optimal policy is presented and its managerial insights are discussed.Several possible extensions are suggested.
基金supported by the National Natural Science Foundation of China(11471118,11401385 and 11371140)Natural Science Foundation of Hebei Province(A2015407063)Doctoral Foundation of Hebei Normal University of Science and Technology(2013YB008)
文摘In this paper, we consider a class of optimal control problem for the singularly perturbed hybrid dynamical systems. By means of variational method, we obtain the necessary conditions of the hybrid dynamical systems. Meanwhile, the existence of solution for the hybrid dynamical system is proved by the sewing method and the uniformly valid asymptotic expansion of the optimal trajectory is constructed by the boundary function method. Finally,an example is presented to illustrate the result.
基金supported by the National Science & Technology Pillar Program during the Eleventh Five-Year Plan Period of China (No.2009BAG12A05)
文摘This paper deals with both the leading train and the following train in a train tracking under a four-aspect fixed autoblock system in order to study the optimum operating strategy for energy saving. After analyzing the working principle of the four-aspect fixed autoblock system, an energy-saving control model is created based on the dynamics equation of the Wains. In addition to safety, energy consumption and time error are the main concerns of the model. Based on this model, dynamic speed constraints of the following train are proposed, defined by the leading gain dynamically. At the same time, the static speed constraints defined by the line conditions are also taken into account. The parallel genetic algorithm is used to search the optimum operating strategy. In order to simplify the solving process, the external punishment function is adopted to transform this problem with constraints to the one without constraints. By using the real number coding and the strategy of dividing ramps into three parts, the convergence of GA is accelerated and the length of chromosomes is shortened. The simulation result from a four-aspect fixed autoblock system simulation platform shows that the method can reduce the energy consumption effectively in the premise of ensuring safety and punctuality.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.61304079 and 61374105)the Beijing Natural Science Foundation,China(Grant Nos.4132078 and 4143065)+2 种基金the China Postdoctoral Science Foundation(Grant No.2013M530527)the Fundamental Research Funds for the Central Universities,China(Grant No.FRF-TP-14-119A2)the Open Research Project from State Key Laboratory of Management and Control for Complex Systems,China(Grant No.20150104)
文摘This paper estimates an off-policy integral reinforcement learning(IRL) algorithm to obtain the optimal tracking control of unknown chaotic systems. Off-policy IRL can learn the solution of the HJB equation from the system data generated by an arbitrary control. Moreover, off-policy IRL can be regarded as a direct learning method, which avoids the identification of system dynamics. In this paper, the performance index function is first given based on the system tracking error and control error. For solving the Hamilton–Jacobi–Bellman(HJB) equation, an off-policy IRL algorithm is proposed.It is proven that the iterative control makes the tracking error system asymptotically stable, and the iterative performance index function is convergent. Simulation study demonstrates the effectiveness of the developed tracking control method.
基金supported by the Open Research Project from SKLMCCS (Grant No. 20120106)the Fundamental Research Funds for the Central Universities of China (Grant No. FRF-TP-13-018A)+1 种基金the Postdoctoral Science Foundation of China (Grant No. 2013M530527)the National Natural Science Foundation of China (Grant Nos. 61304079, 61125306, and 61034002)
文摘In this paper, an optimal tracking control scheme is proposed for a class of discrete-time chaotic systems using the approximation-error-based adaptive dynamic programming (ADP) algorithm. Via the system transformation, the optimal tracking problem is transformed into an optimal regulation problem, and then the novel optimal tracking control method is proposed. It is shown that for the iterative ADP algorithm with finite approximation error, the iterative performance index functions can converge to a finite neighborhood of the greatest lower bound of all performance index functions under some convergence conditions. Two examples are given to demonstrate the validity of the proposed optimal tracking control scheme for chaotic systems.
基金Project supported by the Open Research Project from the SKLMCCS(Grant No.20120106)the Fundamental Research Funds for the Central Universities of China(Grant No.FRF-TP-13-018A)+2 种基金the Postdoctoral Science Foundation of China(Grant No.2013M530527)the National Natural Science Foundation of China(Grant Nos.61304079 and 61374105)the Natural Science Foundation of Beijing,China(Grant No.4132078 and 4143065)
文摘We develop an online adaptive dynamic programming (ADP) based optimal control scheme for continuous-time chaotic systems. The idea is to use the ADP algorithm to obtain the optimal control input that makes the performance index function reach an optimum. The expression of the performance index function for the chaotic system is first presented. The online ADP algorithm is presented to achieve optimal control. In the ADP structure, neural networks are used to construct a critic network and an action network, which can obtain an approximate performance index function and the control input, respectively. It is proven that the critic parameter error dynamics and the closed-loop chaotic systems are uniformly ultimately bounded exponentially. Our simulation results illustrate the performance of the established optimal control method.
基金supported partially by the US NSF CAREER award program (ECCS-0645086)National Natural Science Foundation of China (No.F030119)+2 种基金Zhejiang Provincial Natural Science Foundation of China (Nos.Y1110354, Y6110751)the Fundamental Research Funds for the Central Universities of China (No.1A5000-172210101)the Natural Science Foundation of Ningbo (No.2010A610096)
文摘The q-profile control problem in the ramp-up phase of plasma discharges is consid- ered in this work. The magnetic diffusion partial differential equation (PDE) models the dynamics of the poloidal magnetic flux profile, which is used in this work to formulate a PDE-constrained op-timization problem under a quasi-static assumption. The minimum surface theory and constrained numeric optimization are then applied to achieve suboptimal solutions. Since the transient dy- namics is pre-given by the minimum surface theory, then this method can dramatically accelerate the solution process. In order to be robust under external uncertainties in real implementations, PID (proportional-integral-derivative) controllers are used to force the actuators to follow the computational input trajectories. It has the potential to implement in real-time for long time discharges by combining this method with the magnetic equilibrium update.
基金supported by Swiss Federal Office of Transport,the ETH foundation and via the grant RAILPOWER.
文摘The reduction of energy consumption is an increasingly important topic of the railway system.Energy-efficient train control(EETC)is one solution,which refers to mathematically computing when to accelerate,which cruising speed to hold,how long one should coast over a suitable space,and when to brake.Most approaches in literature and industry greatly simplify a lot of nonlinear effects,such that they ignore mostly the losses due to energy conversion in traction components and auxiliaries.To fill this research gap,a series of increasingly detailed nonlinear losses is described and modelled.We categorize an increasing detail in this representation as four levels.We study the impact of those levels of detail on the energy optimal speed trajectory.To do this,a standard approach based on dynamic programming is used,given constraints on total travel time.This evaluation of multiple test cases highlights the influence of the dynamic losses and the power consumption of auxiliary components on railway trajectories,also compared to multiple benchmarks.The results show how the losses can make up 50%of the total energy consumption for an exemplary trip.Ignoring them would though result in consistent but limited errors in the optimal trajectory.Overall,more complex trajectories can result in less energy consumption when including the complexity of nonlinear losses than when a simpler model is considered.Those effects are stronger when the trajectory includes many acceleration and braking phases.
文摘Magnetorheological (MR) dampers are one of the most promising new devices for civil infrastructural vibration control applications. However, due to their highly nonlinear dynamic behavior, it is very difficult to obtain of a mathematical model of inverse MR damper that has an explicit relationship between the desired damper force and the command signal (voltage). This force voltage relationship is especially required for the structural vibration control design and simulation using MR dampers. This paper focuses on using a neural network (NN) technique to emulate the inverse MR damper model. The output of the neural network can be used to command the MR damper for generating desired forces. Numerical simulations are also presented to illustrate the effectiveness of this inverse model in semi active vibration control using MR dampers.
文摘随着下一代通信网的发展,传统网络架构已无法满足日益增长的灵活性、可扩展性及管理需求。软件定义网络(Software Defined Network,SDN)作为一种新型网络架构,为6G网络提供了新的研究方向。文章分析SDN的基本架构和工作原理,并总结SDN技术的优化方法。在此基础上,结合Mininet仿真平台对SDN与传统网络架构在6G应用场景下的性能进行对比实验。结果表明,SDN在网络延迟、丢包率及资源利用率等关键性能指标上显著优于传统网络架构,为6G网络的部署提供了重要理论依据和实践指导。