In order to accurately forecast the load of power system and enhance the stability of the power network, a novel unascertained mathematics based recurrent neural network (UMRNN) for power intelligence center (PIC) was...In order to accurately forecast the load of power system and enhance the stability of the power network, a novel unascertained mathematics based recurrent neural network (UMRNN) for power intelligence center (PIC) was created through three steps. First, by combining with the general project uncertain element transmission theory (GPUET), the basic definitions of stochastic, fuzzy, and grey uncertain elements were given based on the principal types of uncertain information. Second, a power dynamic alliance including four sectors: generation sector, transmission sector, distribution sector and customers was established. The key factors were amended according to the four transmission topologies of uncertain elements, thus the new factors entered the power intelligence center as the input elements. Finally, in the intelligence handing background of PIC, by performing uncertain and recursive process to the input values of network, and combining unascertained mathematics, the novel load forecasting model was built. Three different approaches were put forward to forecast an eastern regional power grid load in China. The root mean square error (ERMS) demonstrates that the forecasting accuracy of the proposed model UMRNN is 3% higher than that of BP neural network (BPNN), and 5% higher than that of autoregressive integrated moving average (ARIMA). Besides, an example also shows that the average relative error of the first quarter of 2008 forecasted by UMRNN is only 2.59%, which has high precision.展开更多
For vehicle integrated navigation systems, real-time estimating states of the dead reckoning (DR) unit is much more difficult than that of the other measuring sensors under indefinite noises and nonlinear characteri...For vehicle integrated navigation systems, real-time estimating states of the dead reckoning (DR) unit is much more difficult than that of the other measuring sensors under indefinite noises and nonlinear characteristics. Compared with the well known, extended Kalman filter (EKF), a recurrent neural network is proposed for the solution, which not only improves the location precision and the adaptive ability of resisting disturbances, but also avoids calculating the analytic derivation and Jacobian matrices of the nonlinear system model. To test the performances of the recurrent neural network, these two methods are used to estimate the state of the vehicle's DR navigation system. Simulation results show that the recurrent neural network is superior to the EKF and is a more ideal filtering method for vehicle DR navigation.展开更多
After a recursive multi-step-ahead predictor for nonlinear systems based on local recurrent neural networks is introduced, an intelligent FID controller is adopted to correct the errors including identified model erro...After a recursive multi-step-ahead predictor for nonlinear systems based on local recurrent neural networks is introduced, an intelligent FID controller is adopted to correct the errors including identified model errors and accumulated errors produced in the recursive process. Characterized by predictive control, this method can achieve a good control accuracy and has good robustness. A simulation study shows that this control algorithm is very effective.展开更多
Nonlinear model predictive controllers(NMPC)can predict the future behavior of the under-controlled system using a nonlinear predictive model.Here,an array of hyper chaotic diagonal recurrent neural network(HCDRNN)was...Nonlinear model predictive controllers(NMPC)can predict the future behavior of the under-controlled system using a nonlinear predictive model.Here,an array of hyper chaotic diagonal recurrent neural network(HCDRNN)was proposed for modeling and predicting the behavior of the under-controller nonlinear system in a moving forward window.In order to improve the convergence of the parameters of the HCDRNN to improve system’s modeling,the extent of chaos is adjusted using a logistic map in the hidden layer.A novel NMPC based on the HCDRNN array(HCDRNN-NMPC)was proposed that the control signal with the help of an improved gradient descent method was obtained.The controller was used to control a continuous stirred tank reactor(CSTR)with hard-nonlinearities and input constraints,in the presence of uncertainties including external disturbance.The results of the simulations show the superior performance of the proposed method in trajectory tracking and disturbance rejection.Parameter convergence and neglectable prediction error of the neural network(NN),guaranteed stability and high tracking performance are the most significant advantages of the proposed scheme.展开更多
To avoid unstable learning, a stable adaptive learning algorithm was proposed for discrete-time recurrent neural networks. Unlike the dynamic gradient methods, such as the backpropagation through time and the real tim...To avoid unstable learning, a stable adaptive learning algorithm was proposed for discrete-time recurrent neural networks. Unlike the dynamic gradient methods, such as the backpropagation through time and the real time recurrent learning, the weights of the recurrent neural networks were updated online in terms of Lyapunov stability theory in the proposed learning algorithm, so the learning stability was guaranteed. With the inversion of the activation function of the recurrent neural networks, the proposed learning algorithm can be easily implemented for solving varying nonlinear adaptive learning problems and fast convergence of the adaptive learning process can be achieved. Simulation experiments in pattern recognition show that only 5 iterations are needed for the storage of a 15×15 binary image pattern and only 9 iterations are needed for the perfect realization of an analog vector by an equilibrium state with the proposed learning algorithm.展开更多
This work proposes a recorded recurrent twin delayed deep deterministic(RRTD3)policy gradient algorithm to solve the challenge of constructing guidance laws for intercepting endoatmospheric maneuvering missiles with u...This work proposes a recorded recurrent twin delayed deep deterministic(RRTD3)policy gradient algorithm to solve the challenge of constructing guidance laws for intercepting endoatmospheric maneuvering missiles with uncertainties and observation noise.The attack-defense engagement scenario is modeled as a partially observable Markov decision process(POMDP).Given the benefits of recurrent neural networks(RNNs)in processing sequence information,an RNN layer is incorporated into the agent’s policy network to alleviate the bottleneck of traditional deep reinforcement learning methods while dealing with POMDPs.The measurements from the interceptor’s seeker during each guidance cycle are combined into one sequence as the input to the policy network since the detection frequency of an interceptor is usually higher than its guidance frequency.During training,the hidden states of the RNN layer in the policy network are recorded to overcome the partially observable problem that this RNN layer causes inside the agent.The training curves show that the proposed RRTD3 successfully enhances data efficiency,training speed,and training stability.The test results confirm the advantages of the RRTD3-based guidance laws over some conventional guidance laws.展开更多
In order to improve the autonomous ability of unmanned aerial vehicles(UAV)to implement air combat mission,many artificial intelligence-based autonomous air combat maneuver decision-making studies have been carried ou...In order to improve the autonomous ability of unmanned aerial vehicles(UAV)to implement air combat mission,many artificial intelligence-based autonomous air combat maneuver decision-making studies have been carried out,but these studies are often aimed at individual decision-making in 1 v1 scenarios which rarely happen in actual air combat.Based on the research of the 1 v1 autonomous air combat maneuver decision,this paper builds a multi-UAV cooperative air combat maneuver decision model based on multi-agent reinforcement learning.Firstly,a bidirectional recurrent neural network(BRNN)is used to achieve communication between UAV individuals,and the multi-UAV cooperative air combat maneuver decision model under the actor-critic architecture is established.Secondly,through combining with target allocation and air combat situation assessment,the tactical goal of the formation is merged with the reinforcement learning goal of every UAV,and a cooperative tactical maneuver policy is generated.The simulation results prove that the multi-UAV cooperative air combat maneuver decision model established in this paper can obtain the cooperative maneuver policy through reinforcement learning,the cooperative maneuver policy can guide UAVs to obtain the overall situational advantage and defeat the opponents under tactical cooperation.展开更多
A neuromorphic continuous-time state space pole assignment adaptive controller is proposed, which is particularlyappropriate for controlling a large-scale time-variant state-space model due to the parallely distribute...A neuromorphic continuous-time state space pole assignment adaptive controller is proposed, which is particularlyappropriate for controlling a large-scale time-variant state-space model due to the parallely distributed nature ofneurocomputing. In our approach, Hopfield neural network is exploited to identify the parameters of a continuous-timestate-space model, and a dedicated recurrent neural network is designed to compute pole placement feedback control law inreal time. Thus the identification and the control computation are incorporated in the closed-loop, adaptive, real-timecontrol system. The merit of this approach is that the neural networks converge to their solutions very quickly andsimultaneously.展开更多
We are engaged in solving two difficult problems in adaptive control of the large-scale time-variant aerospace system. One is parameter identification of time-variant continuous-time state-space modei; the other is ho...We are engaged in solving two difficult problems in adaptive control of the large-scale time-variant aerospace system. One is parameter identification of time-variant continuous-time state-space modei; the other is how to solve algebraic Riccati equation (ARE) of large order efficiently. In our approach, two neural networks are employed to independently solve both the system identification problem and the ARE associated with the optimal control problem. Thus the identification and the control computation are combined in closed-loop, adaptive, real-time control system . The advantage of this approach is that the neural networks converge to their solutions very quickly and simultaneously.展开更多
Deep learning based recommendation methods, such as the recurrent neural network based recommendation method(RNNRec) and the gated recurrent unit(GRU) based recommendation method(GRURec), are proposed to solve the pro...Deep learning based recommendation methods, such as the recurrent neural network based recommendation method(RNNRec) and the gated recurrent unit(GRU) based recommendation method(GRURec), are proposed to solve the problem of time heterogeneous feedback recommendation. These methods out-perform several state-of-the-art methods. However, in RNNRec and GRURec, action vectors and item vectors are shared among users. The different meanings of the same action for different users are not considered. Similarly, different user preference for the same item is also ignored. To address this problem, the models of RNNRec and GRURec are modified in this paper. In the proposed methods, action vectors and item vectors are transformed into the user space for each user firstly, and then the transformed vectors are fed into the original neural networks of RNNRec and GRURec. The transformed action vectors and item vectors represent the user specified meaning of actions and the preference for items, which makes the proposed method obtain more accurate recommendation results. The experimental results on two real-life datasets indicate that the proposed method outperforms RNNRec and GRURec as well as other state-of-the-art approaches in most cases.展开更多
Based on consideration of the differential relations between the immeasurable variables and measurable variables in electro-hydraulic servo system,adaptive dynamic recurrent fuzzy neural networks(ADRFNNs) were employe...Based on consideration of the differential relations between the immeasurable variables and measurable variables in electro-hydraulic servo system,adaptive dynamic recurrent fuzzy neural networks(ADRFNNs) were employed to identify the primary uncertainty and the mathematic model of the system was turned into an equivalent linear model with terms of secondary uncertainty.At the same time,gain adaptive sliding mode variable structure control(GASMVSC) was employed to synthesize the control effort.The results show that the unrealization problem caused by some system's immeasurable state variables in traditional fuzzy neural networks(TFNN) taking all state variables as its inputs is overcome.On the other hand,the identification by the ADRFNNs online with high accuracy and the adaptive function of the correction term's gain in the GASMVSC make the system possess strong robustness and improved steady accuracy,and the chattering phenomenon of the control effort is also suppressed effectively.展开更多
基金Projects(70572090, 70373017) supported by the National Natural Science Foundation of China
文摘In order to accurately forecast the load of power system and enhance the stability of the power network, a novel unascertained mathematics based recurrent neural network (UMRNN) for power intelligence center (PIC) was created through three steps. First, by combining with the general project uncertain element transmission theory (GPUET), the basic definitions of stochastic, fuzzy, and grey uncertain elements were given based on the principal types of uncertain information. Second, a power dynamic alliance including four sectors: generation sector, transmission sector, distribution sector and customers was established. The key factors were amended according to the four transmission topologies of uncertain elements, thus the new factors entered the power intelligence center as the input elements. Finally, in the intelligence handing background of PIC, by performing uncertain and recursive process to the input values of network, and combining unascertained mathematics, the novel load forecasting model was built. Three different approaches were put forward to forecast an eastern regional power grid load in China. The root mean square error (ERMS) demonstrates that the forecasting accuracy of the proposed model UMRNN is 3% higher than that of BP neural network (BPNN), and 5% higher than that of autoregressive integrated moving average (ARIMA). Besides, an example also shows that the average relative error of the first quarter of 2008 forecasted by UMRNN is only 2.59%, which has high precision.
文摘For vehicle integrated navigation systems, real-time estimating states of the dead reckoning (DR) unit is much more difficult than that of the other measuring sensors under indefinite noises and nonlinear characteristics. Compared with the well known, extended Kalman filter (EKF), a recurrent neural network is proposed for the solution, which not only improves the location precision and the adaptive ability of resisting disturbances, but also avoids calculating the analytic derivation and Jacobian matrices of the nonlinear system model. To test the performances of the recurrent neural network, these two methods are used to estimate the state of the vehicle's DR navigation system. Simulation results show that the recurrent neural network is superior to the EKF and is a more ideal filtering method for vehicle DR navigation.
基金This project was supported by the National Natural Science Foundation of China(60174021)Natural Science Foundation Key Project of Tianjin(013800711).
文摘After a recursive multi-step-ahead predictor for nonlinear systems based on local recurrent neural networks is introduced, an intelligent FID controller is adopted to correct the errors including identified model errors and accumulated errors produced in the recursive process. Characterized by predictive control, this method can achieve a good control accuracy and has good robustness. A simulation study shows that this control algorithm is very effective.
文摘Nonlinear model predictive controllers(NMPC)can predict the future behavior of the under-controlled system using a nonlinear predictive model.Here,an array of hyper chaotic diagonal recurrent neural network(HCDRNN)was proposed for modeling and predicting the behavior of the under-controller nonlinear system in a moving forward window.In order to improve the convergence of the parameters of the HCDRNN to improve system’s modeling,the extent of chaos is adjusted using a logistic map in the hidden layer.A novel NMPC based on the HCDRNN array(HCDRNN-NMPC)was proposed that the control signal with the help of an improved gradient descent method was obtained.The controller was used to control a continuous stirred tank reactor(CSTR)with hard-nonlinearities and input constraints,in the presence of uncertainties including external disturbance.The results of the simulations show the superior performance of the proposed method in trajectory tracking and disturbance rejection.Parameter convergence and neglectable prediction error of the neural network(NN),guaranteed stability and high tracking performance are the most significant advantages of the proposed scheme.
基金Project(50276005) supported by the National Natural Science Foundation of China Projects (2006CB705400, 2003CB716206) supported by National Basic Research Program of China
文摘To avoid unstable learning, a stable adaptive learning algorithm was proposed for discrete-time recurrent neural networks. Unlike the dynamic gradient methods, such as the backpropagation through time and the real time recurrent learning, the weights of the recurrent neural networks were updated online in terms of Lyapunov stability theory in the proposed learning algorithm, so the learning stability was guaranteed. With the inversion of the activation function of the recurrent neural networks, the proposed learning algorithm can be easily implemented for solving varying nonlinear adaptive learning problems and fast convergence of the adaptive learning process can be achieved. Simulation experiments in pattern recognition show that only 5 iterations are needed for the storage of a 15×15 binary image pattern and only 9 iterations are needed for the perfect realization of an analog vector by an equilibrium state with the proposed learning algorithm.
基金supported by the National Natural Science Foundation of China(Grant No.12072090)。
文摘This work proposes a recorded recurrent twin delayed deep deterministic(RRTD3)policy gradient algorithm to solve the challenge of constructing guidance laws for intercepting endoatmospheric maneuvering missiles with uncertainties and observation noise.The attack-defense engagement scenario is modeled as a partially observable Markov decision process(POMDP).Given the benefits of recurrent neural networks(RNNs)in processing sequence information,an RNN layer is incorporated into the agent’s policy network to alleviate the bottleneck of traditional deep reinforcement learning methods while dealing with POMDPs.The measurements from the interceptor’s seeker during each guidance cycle are combined into one sequence as the input to the policy network since the detection frequency of an interceptor is usually higher than its guidance frequency.During training,the hidden states of the RNN layer in the policy network are recorded to overcome the partially observable problem that this RNN layer causes inside the agent.The training curves show that the proposed RRTD3 successfully enhances data efficiency,training speed,and training stability.The test results confirm the advantages of the RRTD3-based guidance laws over some conventional guidance laws.
基金supported by the Aeronautical Science Foundation of China(2017ZC53033)the Seed Foundation of Innovation and Creation for Graduate Students in Northwestern Polytechnical University(CX2020156)。
文摘In order to improve the autonomous ability of unmanned aerial vehicles(UAV)to implement air combat mission,many artificial intelligence-based autonomous air combat maneuver decision-making studies have been carried out,but these studies are often aimed at individual decision-making in 1 v1 scenarios which rarely happen in actual air combat.Based on the research of the 1 v1 autonomous air combat maneuver decision,this paper builds a multi-UAV cooperative air combat maneuver decision model based on multi-agent reinforcement learning.Firstly,a bidirectional recurrent neural network(BRNN)is used to achieve communication between UAV individuals,and the multi-UAV cooperative air combat maneuver decision model under the actor-critic architecture is established.Secondly,through combining with target allocation and air combat situation assessment,the tactical goal of the formation is merged with the reinforcement learning goal of every UAV,and a cooperative tactical maneuver policy is generated.The simulation results prove that the multi-UAV cooperative air combat maneuver decision model established in this paper can obtain the cooperative maneuver policy through reinforcement learning,the cooperative maneuver policy can guide UAVs to obtain the overall situational advantage and defeat the opponents under tactical cooperation.
文摘A neuromorphic continuous-time state space pole assignment adaptive controller is proposed, which is particularlyappropriate for controlling a large-scale time-variant state-space model due to the parallely distributed nature ofneurocomputing. In our approach, Hopfield neural network is exploited to identify the parameters of a continuous-timestate-space model, and a dedicated recurrent neural network is designed to compute pole placement feedback control law inreal time. Thus the identification and the control computation are incorporated in the closed-loop, adaptive, real-timecontrol system. The merit of this approach is that the neural networks converge to their solutions very quickly andsimultaneously.
文摘We are engaged in solving two difficult problems in adaptive control of the large-scale time-variant aerospace system. One is parameter identification of time-variant continuous-time state-space modei; the other is how to solve algebraic Riccati equation (ARE) of large order efficiently. In our approach, two neural networks are employed to independently solve both the system identification problem and the ARE associated with the optimal control problem. Thus the identification and the control computation are combined in closed-loop, adaptive, real-time control system . The advantage of this approach is that the neural networks converge to their solutions very quickly and simultaneously.
基金supported by the National Natural Science Foundation of China(61403350)。
文摘Deep learning based recommendation methods, such as the recurrent neural network based recommendation method(RNNRec) and the gated recurrent unit(GRU) based recommendation method(GRURec), are proposed to solve the problem of time heterogeneous feedback recommendation. These methods out-perform several state-of-the-art methods. However, in RNNRec and GRURec, action vectors and item vectors are shared among users. The different meanings of the same action for different users are not considered. Similarly, different user preference for the same item is also ignored. To address this problem, the models of RNNRec and GRURec are modified in this paper. In the proposed methods, action vectors and item vectors are transformed into the user space for each user firstly, and then the transformed vectors are fed into the original neural networks of RNNRec and GRURec. The transformed action vectors and item vectors represent the user specified meaning of actions and the preference for items, which makes the proposed method obtain more accurate recommendation results. The experimental results on two real-life datasets indicate that the proposed method outperforms RNNRec and GRURec as well as other state-of-the-art approaches in most cases.
基金Project(60634020) supported by the National Natural Science Foundation of China
文摘Based on consideration of the differential relations between the immeasurable variables and measurable variables in electro-hydraulic servo system,adaptive dynamic recurrent fuzzy neural networks(ADRFNNs) were employed to identify the primary uncertainty and the mathematic model of the system was turned into an equivalent linear model with terms of secondary uncertainty.At the same time,gain adaptive sliding mode variable structure control(GASMVSC) was employed to synthesize the control effort.The results show that the unrealization problem caused by some system's immeasurable state variables in traditional fuzzy neural networks(TFNN) taking all state variables as its inputs is overcome.On the other hand,the identification by the ADRFNNs online with high accuracy and the adaptive function of the correction term's gain in the GASMVSC make the system possess strong robustness and improved steady accuracy,and the chattering phenomenon of the control effort is also suppressed effectively.