The time dependent vehicle routing problem with time windows(TDVRPTW) is considered. A multi-type ant system(MTAS) algorithm hybridized with the ant colony system(ACS)and the max-min ant system(MMAS) algorithm...The time dependent vehicle routing problem with time windows(TDVRPTW) is considered. A multi-type ant system(MTAS) algorithm hybridized with the ant colony system(ACS)and the max-min ant system(MMAS) algorithms is proposed. This combination absorbs the merits of the two algorithms in solutions construction and optimization separately. In order to improve the efficiency of the insertion procedure, a nearest neighbor selection(NNS) mechanism, an insertion local search procedure and a local optimization procedure are specified in detail. And in order to find a balance between good scouting performance and fast convergence rate, an adaptive pheromone updating strategy is proposed in the MTAS. Computational results confirm the MTAS algorithm's good performance with all these strategies on classic vehicle routing problem with time windows(VRPTW) benchmark instances and the TDVRPTW instances, and some better results especially for the number of vehicles and travel times of the best solutions are obtained in comparison with the previous research.展开更多
Based on the reliability budget and percentile travel time(PTT) concept, a new travel time index named combined mean travel time(CMTT) under stochastic traffic network was proposed. CMTT here was defined as the convex...Based on the reliability budget and percentile travel time(PTT) concept, a new travel time index named combined mean travel time(CMTT) under stochastic traffic network was proposed. CMTT here was defined as the convex combination of the conditional expectations of PTT-below and PTT-excess travel times. The former was designed as a risk-optimistic travel time index, and the latter was a risk-pessimistic one. Hence, CMTT was able to describe various routing risk-attitudes. The central idea of CMTT was comprehensively illustrated and the difference among the existing travel time indices was analyzed. The Wardropian combined mean traffic equilibrium(CMTE) model was formulated as a variational inequality and solved via an alternating direction algorithm nesting extra-gradient projection process. Some mathematical properties of CMTT and CMTE model were rigorously proved. Finally, a numerical example was performed to characterize the CMTE network. It is founded that that risk-pessimism is of more benefit to a modest(or low) congestion and risk network, however, it changes to be risk-optimism for a high congestion and risk network.展开更多
For the position tracking control of hydraulic manipulators,a novel method of time delay control(TDC) with continuous nonsingular terminal sliding mode(CNTSM) was proposed in this work.Complex dynamics of the hydrauli...For the position tracking control of hydraulic manipulators,a novel method of time delay control(TDC) with continuous nonsingular terminal sliding mode(CNTSM) was proposed in this work.Complex dynamics of the hydraulic manipulator is approximately canceled by time delay estimation(TDE),which means the proposed method is model-free and no prior knowledge of the dynamics is required.Moreover,the CNTSM term with a fast-TSM-type reaching law ensures fast convergence and high-precision tracking control performance under heavy lumped uncertainties.Despite its considerable robustness against lumped uncertainties,the proposed control scheme is continuous and chattering-free and no pressure sensors are required in practical applications.Theoretical analysis and experimental results show that faster and higher-precision position tracking performance is achieved compared with the traditional CNTSM-based TDC method using boundary layers.展开更多
This paper studies the problem of the space station short-term mission planning, which aims to allocate the executing time of missions effectively, schedule the corresponding resources reasonably and arrange the time ...This paper studies the problem of the space station short-term mission planning, which aims to allocate the executing time of missions effectively, schedule the corresponding resources reasonably and arrange the time of the astronauts properly. A domain model is developed by using the ontology theory to describe the concepts, constraints and relations of the planning domain formally, abstractly and normatively. A method based on time iteration is adopted to solve the short-term planning problem. Meanwhile, the resolving strategies are proposed to resolve different kinds of conflicts induced by the constraints of power, heat, resource, astronaut and relationship. The proposed approach is evaluated in a test case with fifteen missions, thirteen resources and three astronauts. The results show that the developed domain ontology model is reasonable, and the time iteration method using the proposed resolving strategies can successfully obtain the plan satisfying all considered constraints.展开更多
A pre-selection space time model was proposed to estimate the traffic condition at poor-data-detector,especially non-detector locations.The space time model is better to integrate the spatial and temporal information ...A pre-selection space time model was proposed to estimate the traffic condition at poor-data-detector,especially non-detector locations.The space time model is better to integrate the spatial and temporal information comprehensibly.Firstly,the influencing factors of the "cause nodes" were studied,and then the pre-selection "cause nodes" procedure which utilizes the Pearson correlation coefficient to evaluate the relevancy of the traffic data was introduced.Finally,only the most relevant data were collected to compose the space time model.The experimental results with the actual data demonstrate that the model performs better than other three models.展开更多
As one of the basic inventory cost models, the (Q, τ)inventory cost model of dual suppliers with random procurement lead time is mostly formulated by using the concepts of "effective lead time" and "lead time de...As one of the basic inventory cost models, the (Q, τ)inventory cost model of dual suppliers with random procurement lead time is mostly formulated by using the concepts of "effective lead time" and "lead time demand", which may lead to an imprecise inventory cost. Through the real-time statistic of the inventory quantities, this paper considers the precise (Q, τ) inventory cost model of dual supplier procurement by using an infinitesimal dividing method. The traditional modeling method of the inventory cost for dual supplier procurement includes complex procedures. To reduce the complexity effectively, the presented method investigates the statistics properties in real-time of the inventory quantities with the application of the infinitesimal dividing method. It is proved that the optimal holding and shortage costs of dual supplier procurement are less than those of single supplier procurement respectively. With the assumption that both suppliers have the same distribution of lead times, the convexity of the cost function per unit time is proved. So the optimal solution can be easily obtained by applying the classical convex optimization methods. The numerical examples are given to verify the main conclusions.展开更多
To solve the scheduling problem of dual-armed cluster tools for wafer fabrications with residency time and reentrant constraints,a heuristic scheduling algorithm was developed.Firstly,on the basis of formulating sched...To solve the scheduling problem of dual-armed cluster tools for wafer fabrications with residency time and reentrant constraints,a heuristic scheduling algorithm was developed.Firstly,on the basis of formulating scheduling problems domain of dual-armed cluster tools,a non-integer programming model was set up with a minimizing objective function of the makespan.Combining characteristics of residency time and reentrant constraints,a scheduling algorithm of searching the optimal operation path of dual-armed transport module was presented under many kinds of robotic scheduling paths for dual-armed cluster tools.Finally,the experiments were designed to evaluate the proposed algorithm.The results show that the proposed algorithm is feasible and efficient for obtaining an optimal scheduling solution of dual-armed cluster tools with residency time and reentrant constraints.展开更多
Chronic hepatitis B infection is a major health problem,with approximately 350 million virus carriers worldwide.In Africa,about 30%-60% of children and 60%-100% of adults have
The mixing time of impact zone in liquid-continuous impinging streams reactor(LISR) is theoretically calculated by empirical model and modern micromixing model of the fluid mixing process, and the variation laws of ma...The mixing time of impact zone in liquid-continuous impinging streams reactor(LISR) is theoretically calculated by empirical model and modern micromixing model of the fluid mixing process, and the variation laws of macromixing time and micromixing time are quantitatively discussed. The results show that under a continuous and stable operating condition, as the paddle speed increases, the macromixing time and micromixing time calculated by the two models both decrease, even in a linkage equilibrium state. Simultaneously, as the paddle speed increases, the results figured by the two models tend to be consistent. It indicates that two models both are more suitable for calculation of mixing time in high paddle speed. Compared with the existing experimental results of this type of reactor, the mixing time computed in the speed of 1500 r/min is closer to it. These conclusions can provide an important reference for systematically studying the strengthening mechanism of LISR under continuous mixing conditions.展开更多
The decision-making and optimization of two-echelon inventory coordination were analyzed with service level constraint and controllable lead time sensitive to order quantity.First,the basic model of this problem was e...The decision-making and optimization of two-echelon inventory coordination were analyzed with service level constraint and controllable lead time sensitive to order quantity.First,the basic model of this problem was established and based on relevant analysis,the original model could be transformed by minimax method.Then,the optimal order quantity and production quantity influenced by service level constraint were analyzed and the boundary of optimal order quantity and production quantity was given.According to this boundary,the effective method and tactics were put forward to solve the transformed model.In case analysis,the optimal expected total cost of two-echelon inventory can be obtained and it was analyzed how service level constraint and safety factor influence the optimal expected total cost of two-echelon inventory.The results show that the optimal expected total cost of two-echelon inventory is constrained by the higher constraint between service level constraint and safety factor.展开更多
The reconstruction control of modular self-reconfigurable spacecraft (MSRS) is addressed using an adaptive sliding mode control (ASMC) scheme based on time-delay estimation (TDE) technology. In contrast to the ground,...The reconstruction control of modular self-reconfigurable spacecraft (MSRS) is addressed using an adaptive sliding mode control (ASMC) scheme based on time-delay estimation (TDE) technology. In contrast to the ground, the base of the MSRS is floating when assembled in orbit, resulting in a strong dynamic coupling effect. A TED-based ASMC technique with exponential reaching law is designed to achieve high-precision coordinated control between the spacecraft base and the robotic arm. TDE technology is used by the controller to compensate for coupling terms and uncertainties, while ASMC can augment and improve TDE’s robustness. To suppress TDE errors and eliminate chattering, a new adaptive law is created to modify gain parameters online, ensuring quick dynamic response and high tracking accuracy. The Lyapunov approach shows that the tracking errors are uniformly ultimately bounded (UUB). Finally, the on-orbit assembly process of MSRS is simulated to validate the efficacy of the proposed control scheme. The simulation results show that the proposed control method can accurately complete the target module’s on-orbit assembly, with minimal perturbations to the spacecraft’s attitude. Meanwhile, it has a high level of robustness and can effectively eliminate chattering.展开更多
Time series analysis is a key technology for medical diagnosis,weather forecasting and financial prediction systems.However,missing data frequently occur during data recording,posing a great challenge to data mining t...Time series analysis is a key technology for medical diagnosis,weather forecasting and financial prediction systems.However,missing data frequently occur during data recording,posing a great challenge to data mining tasks.In this study,we propose a novel time series data representation-based denoising autoencoder(DAE)for the reconstruction of missing values.Two data representation methods,namely,recurrence plot(RP)and Gramian angular field(GAF),are used to transform the raw time series to a 2D matrix for establishing the temporal correlations between different time intervals and extracting the structural patterns from the time series.Then an improved DAE is proposed to reconstruct the missing values from the 2D representation of time series.A comprehensive comparison is conducted amongst the different representations on standard datasets.Results show that the 2D representations have a lower reconstruction error than the raw time series,and the RP representation provides the best outcome.This work provides useful insights into the better reconstruction of missing values in time series analysis to considerably improve the reliability of timevarying system.展开更多
The authors propose a new persistent transmission based real time Ethernet MAC protocol that provides a predictable upper bound for the delivery delay of real time frames. Moreover, it is compatible with the protocol ...The authors propose a new persistent transmission based real time Ethernet MAC protocol that provides a predictable upper bound for the delivery delay of real time frames. Moreover, it is compatible with the protocol used by the existing Ethernet controllers for conventional datagram traffic and thus standard Ethernet stations can be used in the system without any modification. The paper describes the protocol in detail and analyses the maximum delivery delay for real time traffic and the efficiency of the channel.展开更多
Robust stabilization for a class of singular systems was studied by a new method based on time discretization, and a sufficient condition of the robust stabilization was obtained. Firstly, an approximate system of the...Robust stabilization for a class of singular systems was studied by a new method based on time discretization, and a sufficient condition of the robust stabilization was obtained. Firstly, an approximate system of the closed-loop system of the singular system with time-delays was obtained. The approximate system is a singular system in standard state space. Then, the robust stabilization of the singular system was investigated with time-delays by researching the stability of the approximate system using all the exiting analysis method. Finally, a numerical example was presented to verify the effectiveness of the new method.展开更多
In consideration of the field-of-view(FOV)angle con-straint,this study focuses on the guidance problem with impact time control.A deep reinforcement learning guidance method is given for the missile to obtain the desi...In consideration of the field-of-view(FOV)angle con-straint,this study focuses on the guidance problem with impact time control.A deep reinforcement learning guidance method is given for the missile to obtain the desired impact time and meet the demand of FOV angle constraint.On basis of the framework of the proportional navigation guidance,an auxiliary control term is supplemented by the distributed deep deterministic policy gradient algorithm,in which the reward functions are developed to decrease the time-to-go error and improve the terminal guid-ance accuracy.The numerical simulation demonstrates that the missile governed by the presented deep reinforcement learning guidance law can hit the target successfully at appointed arrival time.展开更多
文摘The time dependent vehicle routing problem with time windows(TDVRPTW) is considered. A multi-type ant system(MTAS) algorithm hybridized with the ant colony system(ACS)and the max-min ant system(MMAS) algorithms is proposed. This combination absorbs the merits of the two algorithms in solutions construction and optimization separately. In order to improve the efficiency of the insertion procedure, a nearest neighbor selection(NNS) mechanism, an insertion local search procedure and a local optimization procedure are specified in detail. And in order to find a balance between good scouting performance and fast convergence rate, an adaptive pheromone updating strategy is proposed in the MTAS. Computational results confirm the MTAS algorithm's good performance with all these strategies on classic vehicle routing problem with time windows(VRPTW) benchmark instances and the TDVRPTW instances, and some better results especially for the number of vehicles and travel times of the best solutions are obtained in comparison with the previous research.
基金Project(2012CB725403-5)supported by National Basic Research Program of ChinaProject(71131001-2)supported by National Natural Science Foundation of China+1 种基金Projects(2012JBZ005)supported by Fundamental Research Funds for the Central Universities,ChinaProject(201170)supported by the Foundation for National Excellent Doctoral Dissertation of China
文摘Based on the reliability budget and percentile travel time(PTT) concept, a new travel time index named combined mean travel time(CMTT) under stochastic traffic network was proposed. CMTT here was defined as the convex combination of the conditional expectations of PTT-below and PTT-excess travel times. The former was designed as a risk-optimistic travel time index, and the latter was a risk-pessimistic one. Hence, CMTT was able to describe various routing risk-attitudes. The central idea of CMTT was comprehensively illustrated and the difference among the existing travel time indices was analyzed. The Wardropian combined mean traffic equilibrium(CMTE) model was formulated as a variational inequality and solved via an alternating direction algorithm nesting extra-gradient projection process. Some mathematical properties of CMTT and CMTE model were rigorously proved. Finally, a numerical example was performed to characterize the CMTE network. It is founded that that risk-pessimism is of more benefit to a modest(or low) congestion and risk network, however, it changes to be risk-optimism for a high congestion and risk network.
基金Project(51004085)supported by the National Natural Science Foundation of China
文摘For the position tracking control of hydraulic manipulators,a novel method of time delay control(TDC) with continuous nonsingular terminal sliding mode(CNTSM) was proposed in this work.Complex dynamics of the hydraulic manipulator is approximately canceled by time delay estimation(TDE),which means the proposed method is model-free and no prior knowledge of the dynamics is required.Moreover,the CNTSM term with a fast-TSM-type reaching law ensures fast convergence and high-precision tracking control performance under heavy lumped uncertainties.Despite its considerable robustness against lumped uncertainties,the proposed control scheme is continuous and chattering-free and no pressure sensors are required in practical applications.Theoretical analysis and experimental results show that faster and higher-precision position tracking performance is achieved compared with the traditional CNTSM-based TDC method using boundary layers.
基金supported by the National Natural Science Foundation of China(11402295)the Science Project of National University of Defense Technology(JC14-01-05)the Hunan Provincial Natural Science Foundation of China(2015JJ3020)
文摘This paper studies the problem of the space station short-term mission planning, which aims to allocate the executing time of missions effectively, schedule the corresponding resources reasonably and arrange the time of the astronauts properly. A domain model is developed by using the ontology theory to describe the concepts, constraints and relations of the planning domain formally, abstractly and normatively. A method based on time iteration is adopted to solve the short-term planning problem. Meanwhile, the resolving strategies are proposed to resolve different kinds of conflicts induced by the constraints of power, heat, resource, astronaut and relationship. The proposed approach is evaluated in a test case with fifteen missions, thirteen resources and three astronauts. The results show that the developed domain ontology model is reasonable, and the time iteration method using the proposed resolving strategies can successfully obtain the plan satisfying all considered constraints.
基金Project(D101106049710005) supported by the Beijing Science Foundation Program,ChinaProject(61104164) supported by the National Natural Science Foundation,China
文摘A pre-selection space time model was proposed to estimate the traffic condition at poor-data-detector,especially non-detector locations.The space time model is better to integrate the spatial and temporal information comprehensibly.Firstly,the influencing factors of the "cause nodes" were studied,and then the pre-selection "cause nodes" procedure which utilizes the Pearson correlation coefficient to evaluate the relevancy of the traffic data was introduced.Finally,only the most relevant data were collected to compose the space time model.The experimental results with the actual data demonstrate that the model performs better than other three models.
基金supported by the National High Technology Research and Development Program of China(863 Program)(2007AA04Z102)the National Natural Science Foundation of China(6087407160574077).
文摘As one of the basic inventory cost models, the (Q, τ)inventory cost model of dual suppliers with random procurement lead time is mostly formulated by using the concepts of "effective lead time" and "lead time demand", which may lead to an imprecise inventory cost. Through the real-time statistic of the inventory quantities, this paper considers the precise (Q, τ) inventory cost model of dual supplier procurement by using an infinitesimal dividing method. The traditional modeling method of the inventory cost for dual supplier procurement includes complex procedures. To reduce the complexity effectively, the presented method investigates the statistics properties in real-time of the inventory quantities with the application of the infinitesimal dividing method. It is proved that the optimal holding and shortage costs of dual supplier procurement are less than those of single supplier procurement respectively. With the assumption that both suppliers have the same distribution of lead times, the convexity of the cost function per unit time is proved. So the optimal solution can be easily obtained by applying the classical convex optimization methods. The numerical examples are given to verify the main conclusions.
基金Projects(7107111561273035)supported by the National Natural Science Foundation of China
文摘To solve the scheduling problem of dual-armed cluster tools for wafer fabrications with residency time and reentrant constraints,a heuristic scheduling algorithm was developed.Firstly,on the basis of formulating scheduling problems domain of dual-armed cluster tools,a non-integer programming model was set up with a minimizing objective function of the makespan.Combining characteristics of residency time and reentrant constraints,a scheduling algorithm of searching the optimal operation path of dual-armed transport module was presented under many kinds of robotic scheduling paths for dual-armed cluster tools.Finally,the experiments were designed to evaluate the proposed algorithm.The results show that the proposed algorithm is feasible and efficient for obtaining an optimal scheduling solution of dual-armed cluster tools with residency time and reentrant constraints.
基金supported by the National Natural Science Foundationof China,No.60774036the NSF of Hubei Province 2008CDA063
文摘Chronic hepatitis B infection is a major health problem,with approximately 350 million virus carriers worldwide.In Africa,about 30%-60% of children and 60%-100% of adults have
基金Project(51276131)supported by the National Natural Science Foundation of ChinaProject(ZRZ0316)supported by the Natural Science Foundation of Hubei Province,ChinaProject(2013070104010025)supported by the Morning Glory Project of Wuhan Science and Technology Bureau,China
文摘The mixing time of impact zone in liquid-continuous impinging streams reactor(LISR) is theoretically calculated by empirical model and modern micromixing model of the fluid mixing process, and the variation laws of macromixing time and micromixing time are quantitatively discussed. The results show that under a continuous and stable operating condition, as the paddle speed increases, the macromixing time and micromixing time calculated by the two models both decrease, even in a linkage equilibrium state. Simultaneously, as the paddle speed increases, the results figured by the two models tend to be consistent. It indicates that two models both are more suitable for calculation of mixing time in high paddle speed. Compared with the existing experimental results of this type of reactor, the mixing time computed in the speed of 1500 r/min is closer to it. These conclusions can provide an important reference for systematically studying the strengthening mechanism of LISR under continuous mixing conditions.
基金Project(71102174,71372019)supported by the National Natural Science Foundation of ChinaProject(9123028)supported by the Beijing Natural Science Foundation of China+3 种基金Project(20111101120019)supported by the Specialized Research Fund for Doctoral Program of Higher Education of ChinaProject(11JGC106)supported by the Beijing Philosophy&Social Science Foundation of ChinaProjects(NCET-10-0048,NCET-10-0043)supported by the Program for New Century Excellent Talents in University of ChinaProject(2010YC1307)supported by Excellent Young Teacher in Beijing Institute of Technology of China
文摘The decision-making and optimization of two-echelon inventory coordination were analyzed with service level constraint and controllable lead time sensitive to order quantity.First,the basic model of this problem was established and based on relevant analysis,the original model could be transformed by minimax method.Then,the optimal order quantity and production quantity influenced by service level constraint were analyzed and the boundary of optimal order quantity and production quantity was given.According to this boundary,the effective method and tactics were put forward to solve the transformed model.In case analysis,the optimal expected total cost of two-echelon inventory can be obtained and it was analyzed how service level constraint and safety factor influence the optimal expected total cost of two-echelon inventory.The results show that the optimal expected total cost of two-echelon inventory is constrained by the higher constraint between service level constraint and safety factor.
基金This study was supported by the National Defense Science and Technology Innovation Zone of China(Grant No.00205501).
文摘The reconstruction control of modular self-reconfigurable spacecraft (MSRS) is addressed using an adaptive sliding mode control (ASMC) scheme based on time-delay estimation (TDE) technology. In contrast to the ground, the base of the MSRS is floating when assembled in orbit, resulting in a strong dynamic coupling effect. A TED-based ASMC technique with exponential reaching law is designed to achieve high-precision coordinated control between the spacecraft base and the robotic arm. TDE technology is used by the controller to compensate for coupling terms and uncertainties, while ASMC can augment and improve TDE’s robustness. To suppress TDE errors and eliminate chattering, a new adaptive law is created to modify gain parameters online, ensuring quick dynamic response and high tracking accuracy. The Lyapunov approach shows that the tracking errors are uniformly ultimately bounded (UUB). Finally, the on-orbit assembly process of MSRS is simulated to validate the efficacy of the proposed control scheme. The simulation results show that the proposed control method can accurately complete the target module’s on-orbit assembly, with minimal perturbations to the spacecraft’s attitude. Meanwhile, it has a high level of robustness and can effectively eliminate chattering.
文摘Time series analysis is a key technology for medical diagnosis,weather forecasting and financial prediction systems.However,missing data frequently occur during data recording,posing a great challenge to data mining tasks.In this study,we propose a novel time series data representation-based denoising autoencoder(DAE)for the reconstruction of missing values.Two data representation methods,namely,recurrence plot(RP)and Gramian angular field(GAF),are used to transform the raw time series to a 2D matrix for establishing the temporal correlations between different time intervals and extracting the structural patterns from the time series.Then an improved DAE is proposed to reconstruct the missing values from the 2D representation of time series.A comprehensive comparison is conducted amongst the different representations on standard datasets.Results show that the 2D representations have a lower reconstruction error than the raw time series,and the RP representation provides the best outcome.This work provides useful insights into the better reconstruction of missing values in time series analysis to considerably improve the reliability of timevarying system.
基金TheNationalNaturalScienceFoundationofChina (No .6 9984 0 0 3)
文摘The authors propose a new persistent transmission based real time Ethernet MAC protocol that provides a predictable upper bound for the delivery delay of real time frames. Moreover, it is compatible with the protocol used by the existing Ethernet controllers for conventional datagram traffic and thus standard Ethernet stations can be used in the system without any modification. The paper describes the protocol in detail and analyses the maximum delivery delay for real time traffic and the efficiency of the channel.
文摘Robust stabilization for a class of singular systems was studied by a new method based on time discretization, and a sufficient condition of the robust stabilization was obtained. Firstly, an approximate system of the closed-loop system of the singular system with time-delays was obtained. The approximate system is a singular system in standard state space. Then, the robust stabilization of the singular system was investigated with time-delays by researching the stability of the approximate system using all the exiting analysis method. Finally, a numerical example was presented to verify the effectiveness of the new method.
基金supported by the National Natural Science Foundation of China(62003021,62373304)Industry-University-Research Innovation Fund for Chinese Universities(2021ZYA02009)+2 种基金Shaanxi Qinchuangyuan High-level Innovation and Entrepreneurship Talent Project(OCYRCXM-2022-136)Shaanxi Association for Science and Technology Youth Talent Support Program(XXJS202218)the Fundamental Research Funds for the Central Universities(D5000210830).
文摘In consideration of the field-of-view(FOV)angle con-straint,this study focuses on the guidance problem with impact time control.A deep reinforcement learning guidance method is given for the missile to obtain the desired impact time and meet the demand of FOV angle constraint.On basis of the framework of the proportional navigation guidance,an auxiliary control term is supplemented by the distributed deep deterministic policy gradient algorithm,in which the reward functions are developed to decrease the time-to-go error and improve the terminal guid-ance accuracy.The numerical simulation demonstrates that the missile governed by the presented deep reinforcement learning guidance law can hit the target successfully at appointed arrival time.