A new file assignment strategy of parallel I/O, which is named heuristic file sorted assignment algorithm was proposed on cluster computing system. Based on the load balancing, it assigns the files to the same disk ac...A new file assignment strategy of parallel I/O, which is named heuristic file sorted assignment algorithm was proposed on cluster computing system. Based on the load balancing, it assigns the files to the same disk according to the similar service time. Firstly, the files were sorted and stored at the set I in descending order in terms of their service time, then one disk of cluster node was selected randomly when the files were to be assigned, and at last the continuous files were taken orderly from the set I to the disk until the disk reached its load maximum. The experimental results show that the new strategy improves the performance by 20.2% when the load of the system is light and by 31.6% when the load is heavy. And the higher the data access rate, the more evident the improvement of the performance obtained by the heuristic file sorted assignment algorithm.展开更多
In this paper,a new technique is introduced to construct higher-order iterative methods for solving nonlinear systems.The order of convergence of some iterative methods can be improved by three at the cost of introduc...In this paper,a new technique is introduced to construct higher-order iterative methods for solving nonlinear systems.The order of convergence of some iterative methods can be improved by three at the cost of introducing only one additional evaluation of the function in each step.Furthermore,some new efficient methods with a higher-order of convergence are obtained by using only a single matrix inversion in each iteration.Analyses of convergence properties and computational efficiency of these new methods are made and testified by several numerical problems.By comparison,the new schemes are more efficient than the corresponding existing ones,particularly for large problem sizes.展开更多
Federated learning(FL)is a distributed machine learning paradigm for edge cloud computing.FL can facilitate data-driven decision-making in tactical scenarios,effectively addressing both data volume and infrastructure ...Federated learning(FL)is a distributed machine learning paradigm for edge cloud computing.FL can facilitate data-driven decision-making in tactical scenarios,effectively addressing both data volume and infrastructure challenges in edge environments.However,the diversity of clients in edge cloud computing presents significant challenges for FL.Personalized federated learning(pFL)received considerable attention in recent years.One example of pFL involves exploiting the global and local information in the local model.Current pFL algorithms experience limitations such as slow convergence speed,catastrophic forgetting,and poor performance in complex tasks,which still have significant shortcomings compared to the centralized learning.To achieve high pFL performance,we propose FedCLCC:Federated Contrastive Learning and Conditional Computing.The core of FedCLCC is the use of contrastive learning and conditional computing.Contrastive learning determines the feature representation similarity to adjust the local model.Conditional computing separates the global and local information and feeds it to their corresponding heads for global and local handling.Our comprehensive experiments demonstrate that FedCLCC outperforms other state-of-the-art FL algorithms.展开更多
Fog computing has emerged as an important technology which can improve the performance of computation-intensive and latency-critical communication networks.Nevertheless,the fog computing Internet-of-Things(IoT)systems...Fog computing has emerged as an important technology which can improve the performance of computation-intensive and latency-critical communication networks.Nevertheless,the fog computing Internet-of-Things(IoT)systems are susceptible to malicious eavesdropping attacks during the information transmission,and this issue has not been adequately addressed.In this paper,we propose a physical-layer secure fog computing IoT system model,which is able to improve the physical layer security of fog computing IoT networks against the malicious eavesdropping of multiple eavesdroppers.The secrecy rate of the proposed model is analyzed,and the quantum galaxy–based search algorithm(QGSA)is proposed to solve the hybrid task scheduling and resource management problem of the network.The computational complexity and convergence of the proposed algorithm are analyzed.Simulation results validate the efficiency of the proposed model and reveal the influence of various environmental parameters on fog computing IoT networks.Moreover,the simulation results demonstrate that the proposed hybrid task scheduling and resource management scheme can effectively enhance secrecy performance across different communication scenarios.展开更多
This paper presents a kind of artificial intelligent system-generalized computing system (GCS for short), and introduces its mathematical description, implement problem and learning problem.
Peta-scale high-perfomlance computing systems are increasingly built with heterogeneous CPU and GPU nodes to achieve higher power efficiency and computation throughput. While providing unprecedented capabilities to co...Peta-scale high-perfomlance computing systems are increasingly built with heterogeneous CPU and GPU nodes to achieve higher power efficiency and computation throughput. While providing unprecedented capabilities to conduct computational experiments of historic significance, these systems are presently difficult to program. The users, who are domain experts rather than computer experts, prefer to use programming models closer to their domains (e.g., physics and biology) rather than MPI and OpenME This has led the development of domain-specific programming that provides domain-specific programming interfaces but abstracts away some performance-critical architecture details. Based on experience in designing large-scale computing systems, a hybrid programming framework for scientific computing on heterogeneous architectures is proposed in this work. Its design philosophy is to provide a collaborative mechanism for domain experts and computer experts so that both domain-specific knowledge and performance-critical architecture details can be adequately exploited. Two real-world scientific applications have been evaluated on TH-IA, a peta-scale CPU-GPU heterogeneous system that is currently the 5th fastest supercomputer in the world. The experimental results show that the proposed framework is well suited for developing large-scale scientific computing applications on peta-scale heterogeneous CPU/GPU systems.展开更多
Objective To establish a body composition analysis system based on chest CT,and to observe its value for evaluating content of chest muscle and adipose.Methods T7—T8 layer CT images of 108 pneumonia patients were col...Objective To establish a body composition analysis system based on chest CT,and to observe its value for evaluating content of chest muscle and adipose.Methods T7—T8 layer CT images of 108 pneumonia patients were collected(segmented dataset),and chest CT data of 984 patients were screened from the COVID 19-CT dataset(10 cases were randomly selected as whole test dataset,the remaining 974 cases were selected as layer selection dataset).T7—T8 layer was classified based on convolutional neural network(CNN)derived networks,including ResNet,ResNeXt,MobileNet,ShuffleNet,DenseNet,EfficientNet and ConvNeXt,then the accuracy,precision,recall and specificity were used to evaluate the performance of layer selection dataset.The skeletal muscle(SM),subcutaneous adipose tissue(SAT),intermuscular adipose tissue(IMAT)and visceral adipose tissue(VAT)were segmented using classical fully CNN(FCN)derived network,including FCN,SegNet,UNet,Attention UNet,UNET++,nnUNet,UNeXt and CMUNeXt,then Dice similarity coefficient(DSC),intersection over union(IoU)and 95 Hausdorff distance(HD)were used to evaluate the performance of segmented dataset.The automatic body composition analysis system was constructed based on optimal layer selection network and segmentation network,the mean absolute error(MAE),root mean squared error(RMSE)and standard deviation(SD)of MAE were used to evaluate the performance of automatic system for testing the whole test dataset.Results The accuracy,precision,recall and specificity of DenseNet network for automatically classifying T7—T8 layer from chest CT images was 95.06%,84.83%,92.27%and 95.78%,respectively,which were all higher than those of the other layer selection networks.In segmentation of SM,SAT,IMAT and overall,DSC and IoU of UNet++network were all higher,while 95HD of UNet++network were all lower than those of the other segmentation networks.Using DenseNet as the layer selection network and UNet++as the segmentation network,MAE of the automatic body composition analysis system for predicting SM,SAT,IMAT,VAT and MAE was 27.09,6.95,6.65 and 3.35 cm 2,respectively.Conclusion The body composition analysis system based on chest CT could be used to assess content of chest muscle and adipose.Among them,the UNet++network had better segmentation performance in adipose tissue than SM.展开更多
Quorum systems have been used to solve the problem of data consistency in distributed fault-tolerance systems. But when intrusions occur, traditional quorum systems have some disadvantages. For example, synchronous qu...Quorum systems have been used to solve the problem of data consistency in distributed fault-tolerance systems. But when intrusions occur, traditional quorum systems have some disadvantages. For example, synchronous quorum systems are subject to DOS attacks, while asynchronous quorum systems need a larger system size (at least 3f+1 for generic data, and f fewer for self-verifying data). In order to solve the problems above, an intrusion-tolerance quorum system (ITQS) of hybrid time model based on trust timely computing base is presented (TTCB). The TTCB is a trust secure real-time component inside the server with a well defined interface and separated from the operation system. It is in the synchronous communication environment while the application layer in the server deals with read-write requests and executes update-copy protocols asynchronously. The architectural hybridization of synchrony and asynchrony can achieve the data consistency and availability correctly. We also build two kinds of ITQSes based on TTCB, i.e., the symmetrical and the asymmetrical TTCB quorum systems. In the performance evaluations, we show that TTCB quorum systems are of smaller size, lower load and higher availability.展开更多
A deep understanding of the internal ballistic process and the factors affecting it is of primary importance to efficiently design a gun system and ensure its safe management. One of the main goals of internal ballist...A deep understanding of the internal ballistic process and the factors affecting it is of primary importance to efficiently design a gun system and ensure its safe management. One of the main goals of internal ballistics is to estimate the gas pressure into the combustion chamber and the projectile muzzle velocity in order to use the propellant to its higher efficiency while avoiding over-pressure phenomena. Dealing with the internal ballistic problem is a complex undertaking since it requires handling the interaction between different constituents during a transient time lapse with very steep rise of pressure and temperature. Several approaches have been proposed in the literature, based on different assumptions and techniques. Generally, depending on the used mathematical framework, they can be classified into two categories: computational fluid dynamics-based models and lumped-parameter ones. By focusing on gun systems, this paper offers a review of the main contributions in the field by mentioning their advantages and drawbacks. An insight into the limitations of the currently available modelling strategies is provided,as well as some considerations on the choice of one model over another. Lumped-parameter models, for example, are a good candidate for performing parametric analysis and optimisation processes of gun systems, given their minimum requirements of computer resources. Conversely, CFD-based models have a better capacity to address more sophisticated phenomena like pressure waves and turbulent flow effects. The performed review also reveals that too little attention has been given to small calibre guns since the majority of currently available models are conceived for medium and large calibre gun systems.Similarly, aspects like wear phenomena, bore deformations or projectile-barrel interactions still need to be adequately addressed and our suggestion is to dedicate more effort on it.展开更多
A dynamic multi-beam resource allocation algorithm for large low Earth orbit(LEO)constellation based on on-board distributed computing is proposed in this paper.The allocation is a combinatorial optimization process u...A dynamic multi-beam resource allocation algorithm for large low Earth orbit(LEO)constellation based on on-board distributed computing is proposed in this paper.The allocation is a combinatorial optimization process under a series of complex constraints,which is important for enhancing the matching between resources and requirements.A complex algorithm is not available because that the LEO on-board resources is limi-ted.The proposed genetic algorithm(GA)based on two-dimen-sional individual model and uncorrelated single paternal inheri-tance method is designed to support distributed computation to enhance the feasibility of on-board application.A distributed system composed of eight embedded devices is built to verify the algorithm.A typical scenario is built in the system to evalu-ate the resource allocation process,algorithm mathematical model,trigger strategy,and distributed computation architec-ture.According to the simulation and measurement results,the proposed algorithm can provide an allocation result for more than 1500 tasks in 14 s and the success rate is more than 91%in a typical scene.The response time is decreased by 40%com-pared with the conditional GA.展开更多
[Objective]Real-time monitoring of cow ruminant behavior is of paramount importance for promptly obtaining relevant information about cow health and predicting cow diseases.Currently,various strategies have been propo...[Objective]Real-time monitoring of cow ruminant behavior is of paramount importance for promptly obtaining relevant information about cow health and predicting cow diseases.Currently,various strategies have been proposed for monitoring cow ruminant behavior,including video surveillance,sound recognition,and sensor monitoring methods.How‐ever,the application of edge device gives rise to the issue of inadequate real-time performance.To reduce the volume of data transmission and cloud computing workload while achieving real-time monitoring of dairy cow rumination behavior,a real-time monitoring method was proposed for cow ruminant behavior based on edge computing.[Methods]Autono‐mously designed edge devices were utilized to collect and process six-axis acceleration signals from cows in real-time.Based on these six-axis data,two distinct strategies,federated edge intelligence and split edge intelligence,were investigat‐ed for the real-time recognition of cow ruminant behavior.Focused on the real-time recognition method for cow ruminant behavior leveraging federated edge intelligence,the CA-MobileNet v3 network was proposed by enhancing the MobileNet v3 network with a collaborative attention mechanism.Additionally,a federated edge intelligence model was designed uti‐lizing the CA-MobileNet v3 network and the FedAvg federated aggregation algorithm.In the study on split edge intelli‐gence,a split edge intelligence model named MobileNet-LSTM was designed by integrating the MobileNet v3 network with a fusion collaborative attention mechanism and the Bi-LSTM network.[Results and Discussions]Through compara‐tive experiments with MobileNet v3 and MobileNet-LSTM,the federated edge intelligence model based on CA-Mo‐bileNet v3 achieved an average Precision rate,Recall rate,F1-Score,Specificity,and Accuracy of 97.1%,97.9%,97.5%,98.3%,and 98.2%,respectively,yielding the best recognition performance.[Conclusions]It is provided a real-time and effective method for monitoring cow ruminant behavior,and the proposed federated edge intelligence model can be ap‐plied in practical settings.展开更多
Risk management often plays an important role in decision making un-der uncertainty.In quantitative risk management,assessing and optimizing risk metrics requires eficient computing techniques and reliable theoretical...Risk management often plays an important role in decision making un-der uncertainty.In quantitative risk management,assessing and optimizing risk metrics requires eficient computing techniques and reliable theoretical guarantees.In this pa-per,we introduce several topics on quantitative risk management and review some of the recent studies and advancements on the topics.We consider several risk metrics and study decision models that involve the metrics,with a main focus on the related com-puting techniques and theoretical properties.We show that stochastic optimization,as a powerful tool,can be leveraged to effectively address these problems.展开更多
In order to improve the efficiency of cloud-based web services,an improved plant growth simulation algorithm scheduling model.This model first used mathematical methods to describe the relationships between cloud-base...In order to improve the efficiency of cloud-based web services,an improved plant growth simulation algorithm scheduling model.This model first used mathematical methods to describe the relationships between cloud-based web services and the constraints of system resources.Then,a light-induced plant growth simulation algorithm was established.The performance of the algorithm was compared through several plant types,and the best plant model was selected as the setting for the system.Experimental results show that when the number of test cloud-based web services reaches 2048,the model being 2.14 times faster than PSO,2.8 times faster than the ant colony algorithm,2.9 times faster than the bee colony algorithm,and a remarkable 8.38 times faster than the genetic algorithm.展开更多
As a non-contact ultra-precision machining method,abrasive water jet polishing(AWJP)has signi-ficant application in optical elements processing due to its stable tool influence function(TIF),no subsurface damage and s...As a non-contact ultra-precision machining method,abrasive water jet polishing(AWJP)has signi-ficant application in optical elements processing due to its stable tool influence function(TIF),no subsurface damage and strong adaptability to workpiece shapes.In this study,the effects of jet pressure,nozzle diameter and impinging angle on the distribution of pressure,velocity and wall shear stress in the polishing flow field were systematically analyzed by computational fluid dynamics(CFD)simulation.Based on the Box-Behnken experimental design,a response surface regression model was constructed to investigate the influence mech-anism of process parameters on material removal rate(MRR)and surface roughness(Ra)of fused silica.And experimental results showed that increasing jet pressure and nozzle diameter significantly improved MRR,consistent with shear stress distribution revealed by CFD simulations.However,increasing jet pressure and impinging angle caused higher Ra values,which was unfavorable for surface quality improvement.Genetic algorithm(GA)was used for multi-objective optimization to establish Pareto solutions,achieving concurrent optimization of polishing efficiency and surface quality.A parameter combination of 2 MPa jet pressure,0.3 mm nozzle diameter,and 30°impinging angle achieved MRR of 169.05μm^(3)/s and Ra of 0.50 nm.Exper-imental verification showed prediction errors of 4.4%(MRR)and 3.8%(Ra),confirming the model’s reliabil-ity.This parameter optimization system provides theoretical basis and technical support for ultra-precision polishing of complex curved optical components.展开更多
Fourier Ptychographic Microscopy(FPM)is a high-throughput computational optical imaging technology reported in 2013.It effectively breaks through the trade-off between high-resolution imaging and wide-field imaging.In...Fourier Ptychographic Microscopy(FPM)is a high-throughput computational optical imaging technology reported in 2013.It effectively breaks through the trade-off between high-resolution imaging and wide-field imaging.In recent years,it has been found that FPM is not only a tool to break through the trade-off between field of view and spatial resolution,but also a paradigm to break through those trade-off problems,thus attracting extensive attention.Compared with previous reviews,this review does not introduce its concept,basic principles,optical system and series of applications once again,but focuses on elaborating the three major difficulties faced by FPM technology in the process from“looking good”in the laboratory to“working well”in practical applications:mismatch between numerical model and physical reality,long reconstruction time and high computing power demand,and lack of multi-modal expansion.It introduces how to achieve key technological innovations in FPM through the dual drive of Artificial Intelligence(AI)and physics,including intelligent reconstruction algorithms introducing machine learning concepts,optical-algorithm co-design,fusion of frequency domain extrapolation methods and generative adversarial networks,multi-modal imaging schemes and data fusion enhancement,etc.,gradually solving the difficulties of FPM technology.Conversely,this review deeply considers the unique value of FPM technology in potentially feeding back to the development of“AI+optics”,such as providing AI benchmark tests under physical constraints,inspirations for the balance of computing power and bandwidth in miniaturized intelligent microscopes,and photoelectric hybrid architectures.Finally,it introduces the industrialization path and frontier directions of FPM technology,pointing out that with the promotion of the dual drive of AI and physics,it will generate a large number of industrial application case,and looks forward to the possibilities of future application scenarios and expansions,for instance,body fluid biopsy and point-of-care testing at the grassroots level represent the expansion of the growth market.展开更多
Trans-medium flight vehicles can combine high aerial maneuverability and underwater concealment ability,which have attracted much attention recently.As the most crucial procedure,the trajectory design generally determ...Trans-medium flight vehicles can combine high aerial maneuverability and underwater concealment ability,which have attracted much attention recently.As the most crucial procedure,the trajectory design generally determines the trans-medium flight vehicle performance.To quantitatively analyze the flight vehicle performance,an entire aerial-aquatic trajectory model is developed in this paper.Different from modeling a trajectory purely for the water entry process,the constructed entire trajectory model has integrated aerial,water entry,and underwater trajectories together,which can consider the influence of the connected trajectories.As for the aerial and underwater trajectories,explicit dynamic models are established to obtain the trajectory parameters.Due to the complicated fluid force during high-velocity water entry,a computational fluid dynamics model is investigated to analyze this phase.The compu-tational domain size is adaptively refined according to the final aerial trajectory state,where the redundant computational domain is removed.An entire trajectory optimization problem is then formulated to maximize the total flight range via tuning the joint states of different trajectories.Simultaneously,several constraints,i.e.,the max impact load,trajectory height,etc.,are involved in the optimization problem.Rather than directly optimizing by a heuristic algorithm,a multi-surrogate cooperative sampling-based optimization method is proposed to alleviate the computational complexity of the entire trajectory optimization problem.In this method,various surrogates coopera-tively generate infill sample points,thereby preventing the poor approximation.After optimization,the total flight range can be improved by 20%,while all the constraints are satisfied.The result demonstrates the effectiveness and practicability of the developed model and optimization framework.展开更多
Installing the splitter plates is a passive aerodynamic solution for eliminating vortex-induced vibration (VIV). However, the influences of splitter plates on the VIV and aerostatic performances are more complicated d...Installing the splitter plates is a passive aerodynamic solution for eliminating vortex-induced vibration (VIV). However, the influences of splitter plates on the VIV and aerostatic performances are more complicated due to aerodynamic interference between highway and railway decks. To study the effects of splitter plates, wind tunnel experiments for measuring VIV and aerostatic forces of twin decks under two opposite flow directions were conducted, while the surrounding flow and wind pressure of static twin decks with and without splitter plates are numerically simulated. The results showed that the incoming flow direction affects the VIV response and aerostatic coefficients. The highway deck has poor vertical and torsional VIV, and the VIV region and amplitude are different under different directions. While the railway deck only has vertical VIV when located upstream. The splitter plates can impede the process of vortex generation, shedding and impinging at the gap between twin deck, and significantly reducing the surface fluctuating pressure coefficient, thus effectively suppressing the VIV of twin decks. While, the splitter plates hurt the upstream deck regarding static wind stability and have little effect on the downstream deck. The splitter plates of appropriate width are recommended to improve VIV performances in twin parallel bridges.展开更多
Equipment systems-of-systems (SOS) effectiveness evaluation can provide important reference for construction and optimization of the equipment SoS. After discussing the basic theory and methods of parallel experimen...Equipment systems-of-systems (SOS) effectiveness evaluation can provide important reference for construction and optimization of the equipment SoS. After discussing the basic theory and methods of parallel experiments, we depict an SoS effectiveness analysis and evaluation method using parallel expe- riments theory in detail. A case study is carried out which takes the missile defense system as an example. An artificial system of the missile defense system is constructed with the multi-agent modeling method. Then, single factor, multiple factors and defense position deployment computational experiments are carried out and evaluated with the statistical analysis method. Experiment re- sults show that the altitude of the secondary interception missile is not the key factor which affects SoS effectiveness and putting the defense position ahead will increase defense effectiveness. The case study demonstrates the feasibility of the proposed method.展开更多
文摘A new file assignment strategy of parallel I/O, which is named heuristic file sorted assignment algorithm was proposed on cluster computing system. Based on the load balancing, it assigns the files to the same disk according to the similar service time. Firstly, the files were sorted and stored at the set I in descending order in terms of their service time, then one disk of cluster node was selected randomly when the files were to be assigned, and at last the continuous files were taken orderly from the set I to the disk until the disk reached its load maximum. The experimental results show that the new strategy improves the performance by 20.2% when the load of the system is light and by 31.6% when the load is heavy. And the higher the data access rate, the more evident the improvement of the performance obtained by the heuristic file sorted assignment algorithm.
基金Supported by the National Natural Science Foundation of China(12061048)NSF of Jiangxi Province(20232BAB201026,20232BAB201018)。
文摘In this paper,a new technique is introduced to construct higher-order iterative methods for solving nonlinear systems.The order of convergence of some iterative methods can be improved by three at the cost of introducing only one additional evaluation of the function in each step.Furthermore,some new efficient methods with a higher-order of convergence are obtained by using only a single matrix inversion in each iteration.Analyses of convergence properties and computational efficiency of these new methods are made and testified by several numerical problems.By comparison,the new schemes are more efficient than the corresponding existing ones,particularly for large problem sizes.
基金supported by the Natural Science Foundation of Xinjiang Uygur Autonomous Region(Grant No.2022D01B 187)。
文摘Federated learning(FL)is a distributed machine learning paradigm for edge cloud computing.FL can facilitate data-driven decision-making in tactical scenarios,effectively addressing both data volume and infrastructure challenges in edge environments.However,the diversity of clients in edge cloud computing presents significant challenges for FL.Personalized federated learning(pFL)received considerable attention in recent years.One example of pFL involves exploiting the global and local information in the local model.Current pFL algorithms experience limitations such as slow convergence speed,catastrophic forgetting,and poor performance in complex tasks,which still have significant shortcomings compared to the centralized learning.To achieve high pFL performance,we propose FedCLCC:Federated Contrastive Learning and Conditional Computing.The core of FedCLCC is the use of contrastive learning and conditional computing.Contrastive learning determines the feature representation similarity to adjust the local model.Conditional computing separates the global and local information and feeds it to their corresponding heads for global and local handling.Our comprehensive experiments demonstrate that FedCLCC outperforms other state-of-the-art FL algorithms.
基金supported by the National Natural Science Foundation of China(61571149,62001139)the Initiation Fund for Postdoctoral Research in Heilongjiang Province(LBH-Q19098)the Natural Science Foundation of Heilongjiang Province(LH2020F0178).
文摘Fog computing has emerged as an important technology which can improve the performance of computation-intensive and latency-critical communication networks.Nevertheless,the fog computing Internet-of-Things(IoT)systems are susceptible to malicious eavesdropping attacks during the information transmission,and this issue has not been adequately addressed.In this paper,we propose a physical-layer secure fog computing IoT system model,which is able to improve the physical layer security of fog computing IoT networks against the malicious eavesdropping of multiple eavesdroppers.The secrecy rate of the proposed model is analyzed,and the quantum galaxy–based search algorithm(QGSA)is proposed to solve the hybrid task scheduling and resource management problem of the network.The computational complexity and convergence of the proposed algorithm are analyzed.Simulation results validate the efficiency of the proposed model and reveal the influence of various environmental parameters on fog computing IoT networks.Moreover,the simulation results demonstrate that the proposed hybrid task scheduling and resource management scheme can effectively enhance secrecy performance across different communication scenarios.
文摘This paper presents a kind of artificial intelligent system-generalized computing system (GCS for short), and introduces its mathematical description, implement problem and learning problem.
基金Project(61170049) supported by the National Natural Science Foundation of ChinaProject(2012AA010903) supported by the National High Technology Research and Development Program of China
文摘Peta-scale high-perfomlance computing systems are increasingly built with heterogeneous CPU and GPU nodes to achieve higher power efficiency and computation throughput. While providing unprecedented capabilities to conduct computational experiments of historic significance, these systems are presently difficult to program. The users, who are domain experts rather than computer experts, prefer to use programming models closer to their domains (e.g., physics and biology) rather than MPI and OpenME This has led the development of domain-specific programming that provides domain-specific programming interfaces but abstracts away some performance-critical architecture details. Based on experience in designing large-scale computing systems, a hybrid programming framework for scientific computing on heterogeneous architectures is proposed in this work. Its design philosophy is to provide a collaborative mechanism for domain experts and computer experts so that both domain-specific knowledge and performance-critical architecture details can be adequately exploited. Two real-world scientific applications have been evaluated on TH-IA, a peta-scale CPU-GPU heterogeneous system that is currently the 5th fastest supercomputer in the world. The experimental results show that the proposed framework is well suited for developing large-scale scientific computing applications on peta-scale heterogeneous CPU/GPU systems.
文摘Objective To establish a body composition analysis system based on chest CT,and to observe its value for evaluating content of chest muscle and adipose.Methods T7—T8 layer CT images of 108 pneumonia patients were collected(segmented dataset),and chest CT data of 984 patients were screened from the COVID 19-CT dataset(10 cases were randomly selected as whole test dataset,the remaining 974 cases were selected as layer selection dataset).T7—T8 layer was classified based on convolutional neural network(CNN)derived networks,including ResNet,ResNeXt,MobileNet,ShuffleNet,DenseNet,EfficientNet and ConvNeXt,then the accuracy,precision,recall and specificity were used to evaluate the performance of layer selection dataset.The skeletal muscle(SM),subcutaneous adipose tissue(SAT),intermuscular adipose tissue(IMAT)and visceral adipose tissue(VAT)were segmented using classical fully CNN(FCN)derived network,including FCN,SegNet,UNet,Attention UNet,UNET++,nnUNet,UNeXt and CMUNeXt,then Dice similarity coefficient(DSC),intersection over union(IoU)and 95 Hausdorff distance(HD)were used to evaluate the performance of segmented dataset.The automatic body composition analysis system was constructed based on optimal layer selection network and segmentation network,the mean absolute error(MAE),root mean squared error(RMSE)and standard deviation(SD)of MAE were used to evaluate the performance of automatic system for testing the whole test dataset.Results The accuracy,precision,recall and specificity of DenseNet network for automatically classifying T7—T8 layer from chest CT images was 95.06%,84.83%,92.27%and 95.78%,respectively,which were all higher than those of the other layer selection networks.In segmentation of SM,SAT,IMAT and overall,DSC and IoU of UNet++network were all higher,while 95HD of UNet++network were all lower than those of the other segmentation networks.Using DenseNet as the layer selection network and UNet++as the segmentation network,MAE of the automatic body composition analysis system for predicting SM,SAT,IMAT,VAT and MAE was 27.09,6.95,6.65 and 3.35 cm 2,respectively.Conclusion The body composition analysis system based on chest CT could be used to assess content of chest muscle and adipose.Among them,the UNet++network had better segmentation performance in adipose tissue than SM.
基金supported by the National Natural Science Foundation of China (60774091)
文摘Quorum systems have been used to solve the problem of data consistency in distributed fault-tolerance systems. But when intrusions occur, traditional quorum systems have some disadvantages. For example, synchronous quorum systems are subject to DOS attacks, while asynchronous quorum systems need a larger system size (at least 3f+1 for generic data, and f fewer for self-verifying data). In order to solve the problems above, an intrusion-tolerance quorum system (ITQS) of hybrid time model based on trust timely computing base is presented (TTCB). The TTCB is a trust secure real-time component inside the server with a well defined interface and separated from the operation system. It is in the synchronous communication environment while the application layer in the server deals with read-write requests and executes update-copy protocols asynchronously. The architectural hybridization of synchrony and asynchrony can achieve the data consistency and availability correctly. We also build two kinds of ITQSes based on TTCB, i.e., the symmetrical and the asymmetrical TTCB quorum systems. In the performance evaluations, we show that TTCB quorum systems are of smaller size, lower load and higher availability.
基金the support provided by the Royal Higher Institute for Defence (RHID) of the Belgian Defence, which has contributed to the progress of this ongoing research.
文摘A deep understanding of the internal ballistic process and the factors affecting it is of primary importance to efficiently design a gun system and ensure its safe management. One of the main goals of internal ballistics is to estimate the gas pressure into the combustion chamber and the projectile muzzle velocity in order to use the propellant to its higher efficiency while avoiding over-pressure phenomena. Dealing with the internal ballistic problem is a complex undertaking since it requires handling the interaction between different constituents during a transient time lapse with very steep rise of pressure and temperature. Several approaches have been proposed in the literature, based on different assumptions and techniques. Generally, depending on the used mathematical framework, they can be classified into two categories: computational fluid dynamics-based models and lumped-parameter ones. By focusing on gun systems, this paper offers a review of the main contributions in the field by mentioning their advantages and drawbacks. An insight into the limitations of the currently available modelling strategies is provided,as well as some considerations on the choice of one model over another. Lumped-parameter models, for example, are a good candidate for performing parametric analysis and optimisation processes of gun systems, given their minimum requirements of computer resources. Conversely, CFD-based models have a better capacity to address more sophisticated phenomena like pressure waves and turbulent flow effects. The performed review also reveals that too little attention has been given to small calibre guns since the majority of currently available models are conceived for medium and large calibre gun systems.Similarly, aspects like wear phenomena, bore deformations or projectile-barrel interactions still need to be adequately addressed and our suggestion is to dedicate more effort on it.
基金This work was supported by the National Key Research and Development Program of China(2021YFB2900603)the National Natural Science Foundation of China(61831008).
文摘A dynamic multi-beam resource allocation algorithm for large low Earth orbit(LEO)constellation based on on-board distributed computing is proposed in this paper.The allocation is a combinatorial optimization process under a series of complex constraints,which is important for enhancing the matching between resources and requirements.A complex algorithm is not available because that the LEO on-board resources is limi-ted.The proposed genetic algorithm(GA)based on two-dimen-sional individual model and uncorrelated single paternal inheri-tance method is designed to support distributed computation to enhance the feasibility of on-board application.A distributed system composed of eight embedded devices is built to verify the algorithm.A typical scenario is built in the system to evalu-ate the resource allocation process,algorithm mathematical model,trigger strategy,and distributed computation architec-ture.According to the simulation and measurement results,the proposed algorithm can provide an allocation result for more than 1500 tasks in 14 s and the success rate is more than 91%in a typical scene.The response time is decreased by 40%com-pared with the conditional GA.
文摘[Objective]Real-time monitoring of cow ruminant behavior is of paramount importance for promptly obtaining relevant information about cow health and predicting cow diseases.Currently,various strategies have been proposed for monitoring cow ruminant behavior,including video surveillance,sound recognition,and sensor monitoring methods.How‐ever,the application of edge device gives rise to the issue of inadequate real-time performance.To reduce the volume of data transmission and cloud computing workload while achieving real-time monitoring of dairy cow rumination behavior,a real-time monitoring method was proposed for cow ruminant behavior based on edge computing.[Methods]Autono‐mously designed edge devices were utilized to collect and process six-axis acceleration signals from cows in real-time.Based on these six-axis data,two distinct strategies,federated edge intelligence and split edge intelligence,were investigat‐ed for the real-time recognition of cow ruminant behavior.Focused on the real-time recognition method for cow ruminant behavior leveraging federated edge intelligence,the CA-MobileNet v3 network was proposed by enhancing the MobileNet v3 network with a collaborative attention mechanism.Additionally,a federated edge intelligence model was designed uti‐lizing the CA-MobileNet v3 network and the FedAvg federated aggregation algorithm.In the study on split edge intelli‐gence,a split edge intelligence model named MobileNet-LSTM was designed by integrating the MobileNet v3 network with a fusion collaborative attention mechanism and the Bi-LSTM network.[Results and Discussions]Through compara‐tive experiments with MobileNet v3 and MobileNet-LSTM,the federated edge intelligence model based on CA-Mo‐bileNet v3 achieved an average Precision rate,Recall rate,F1-Score,Specificity,and Accuracy of 97.1%,97.9%,97.5%,98.3%,and 98.2%,respectively,yielding the best recognition performance.[Conclusions]It is provided a real-time and effective method for monitoring cow ruminant behavior,and the proposed federated edge intelligence model can be ap‐plied in practical settings.
文摘Risk management often plays an important role in decision making un-der uncertainty.In quantitative risk management,assessing and optimizing risk metrics requires eficient computing techniques and reliable theoretical guarantees.In this pa-per,we introduce several topics on quantitative risk management and review some of the recent studies and advancements on the topics.We consider several risk metrics and study decision models that involve the metrics,with a main focus on the related com-puting techniques and theoretical properties.We show that stochastic optimization,as a powerful tool,can be leveraged to effectively address these problems.
基金Shanxi Province Higher Education Science and Technology Innovation Fund Project(2022-676)Shanxi Soft Science Program Research Fund Project(2016041008-6)。
文摘In order to improve the efficiency of cloud-based web services,an improved plant growth simulation algorithm scheduling model.This model first used mathematical methods to describe the relationships between cloud-based web services and the constraints of system resources.Then,a light-induced plant growth simulation algorithm was established.The performance of the algorithm was compared through several plant types,and the best plant model was selected as the setting for the system.Experimental results show that when the number of test cloud-based web services reaches 2048,the model being 2.14 times faster than PSO,2.8 times faster than the ant colony algorithm,2.9 times faster than the bee colony algorithm,and a remarkable 8.38 times faster than the genetic algorithm.
文摘As a non-contact ultra-precision machining method,abrasive water jet polishing(AWJP)has signi-ficant application in optical elements processing due to its stable tool influence function(TIF),no subsurface damage and strong adaptability to workpiece shapes.In this study,the effects of jet pressure,nozzle diameter and impinging angle on the distribution of pressure,velocity and wall shear stress in the polishing flow field were systematically analyzed by computational fluid dynamics(CFD)simulation.Based on the Box-Behnken experimental design,a response surface regression model was constructed to investigate the influence mech-anism of process parameters on material removal rate(MRR)and surface roughness(Ra)of fused silica.And experimental results showed that increasing jet pressure and nozzle diameter significantly improved MRR,consistent with shear stress distribution revealed by CFD simulations.However,increasing jet pressure and impinging angle caused higher Ra values,which was unfavorable for surface quality improvement.Genetic algorithm(GA)was used for multi-objective optimization to establish Pareto solutions,achieving concurrent optimization of polishing efficiency and surface quality.A parameter combination of 2 MPa jet pressure,0.3 mm nozzle diameter,and 30°impinging angle achieved MRR of 169.05μm^(3)/s and Ra of 0.50 nm.Exper-imental verification showed prediction errors of 4.4%(MRR)and 3.8%(Ra),confirming the model’s reliabil-ity.This parameter optimization system provides theoretical basis and technical support for ultra-precision polishing of complex curved optical components.
基金National Natural Science Foundation of China(No.12574332)the Space Optoelectronic Measurement and Perception Lab.,Beijing Institute of Control Engineering(No.LabSOMP-2023-10)Major Science and Technology Innovation Program of Xianyang City(No.L2024-ZDKJ-ZDCGZH-0021)。
文摘Fourier Ptychographic Microscopy(FPM)is a high-throughput computational optical imaging technology reported in 2013.It effectively breaks through the trade-off between high-resolution imaging and wide-field imaging.In recent years,it has been found that FPM is not only a tool to break through the trade-off between field of view and spatial resolution,but also a paradigm to break through those trade-off problems,thus attracting extensive attention.Compared with previous reviews,this review does not introduce its concept,basic principles,optical system and series of applications once again,but focuses on elaborating the three major difficulties faced by FPM technology in the process from“looking good”in the laboratory to“working well”in practical applications:mismatch between numerical model and physical reality,long reconstruction time and high computing power demand,and lack of multi-modal expansion.It introduces how to achieve key technological innovations in FPM through the dual drive of Artificial Intelligence(AI)and physics,including intelligent reconstruction algorithms introducing machine learning concepts,optical-algorithm co-design,fusion of frequency domain extrapolation methods and generative adversarial networks,multi-modal imaging schemes and data fusion enhancement,etc.,gradually solving the difficulties of FPM technology.Conversely,this review deeply considers the unique value of FPM technology in potentially feeding back to the development of“AI+optics”,such as providing AI benchmark tests under physical constraints,inspirations for the balance of computing power and bandwidth in miniaturized intelligent microscopes,and photoelectric hybrid architectures.Finally,it introduces the industrialization path and frontier directions of FPM technology,pointing out that with the promotion of the dual drive of AI and physics,it will generate a large number of industrial application case,and looks forward to the possibilities of future application scenarios and expansions,for instance,body fluid biopsy and point-of-care testing at the grassroots level represent the expansion of the growth market.
基金supported by the National Natural Science Foundation of China(Grant Nos.52425211,52272360,and 52472394)Chongqing Natural Science Foundation(CSTB2023NSCQ-MSX0300)。
文摘Trans-medium flight vehicles can combine high aerial maneuverability and underwater concealment ability,which have attracted much attention recently.As the most crucial procedure,the trajectory design generally determines the trans-medium flight vehicle performance.To quantitatively analyze the flight vehicle performance,an entire aerial-aquatic trajectory model is developed in this paper.Different from modeling a trajectory purely for the water entry process,the constructed entire trajectory model has integrated aerial,water entry,and underwater trajectories together,which can consider the influence of the connected trajectories.As for the aerial and underwater trajectories,explicit dynamic models are established to obtain the trajectory parameters.Due to the complicated fluid force during high-velocity water entry,a computational fluid dynamics model is investigated to analyze this phase.The compu-tational domain size is adaptively refined according to the final aerial trajectory state,where the redundant computational domain is removed.An entire trajectory optimization problem is then formulated to maximize the total flight range via tuning the joint states of different trajectories.Simultaneously,several constraints,i.e.,the max impact load,trajectory height,etc.,are involved in the optimization problem.Rather than directly optimizing by a heuristic algorithm,a multi-surrogate cooperative sampling-based optimization method is proposed to alleviate the computational complexity of the entire trajectory optimization problem.In this method,various surrogates coopera-tively generate infill sample points,thereby preventing the poor approximation.After optimization,the total flight range can be improved by 20%,while all the constraints are satisfied.The result demonstrates the effectiveness and practicability of the developed model and optimization framework.
基金Projects(51925808,52078504,51822803) supported by the National Natural Science Foundation of ChinaProject(2022JJ10082) supported by the Natural Science Foundation of Hunan Province,China+1 种基金Project(N2022Z004) supported by the Research on Technology Development Trend and Key Common Problems in Railway,ChinaProject(Xplorer Prize 2021) supported by the Tencent Foundation,China。
文摘Installing the splitter plates is a passive aerodynamic solution for eliminating vortex-induced vibration (VIV). However, the influences of splitter plates on the VIV and aerostatic performances are more complicated due to aerodynamic interference between highway and railway decks. To study the effects of splitter plates, wind tunnel experiments for measuring VIV and aerostatic forces of twin decks under two opposite flow directions were conducted, while the surrounding flow and wind pressure of static twin decks with and without splitter plates are numerically simulated. The results showed that the incoming flow direction affects the VIV response and aerostatic coefficients. The highway deck has poor vertical and torsional VIV, and the VIV region and amplitude are different under different directions. While the railway deck only has vertical VIV when located upstream. The splitter plates can impede the process of vortex generation, shedding and impinging at the gap between twin deck, and significantly reducing the surface fluctuating pressure coefficient, thus effectively suppressing the VIV of twin decks. While, the splitter plates hurt the upstream deck regarding static wind stability and have little effect on the downstream deck. The splitter plates of appropriate width are recommended to improve VIV performances in twin parallel bridges.
文摘Equipment systems-of-systems (SOS) effectiveness evaluation can provide important reference for construction and optimization of the equipment SoS. After discussing the basic theory and methods of parallel experiments, we depict an SoS effectiveness analysis and evaluation method using parallel expe- riments theory in detail. A case study is carried out which takes the missile defense system as an example. An artificial system of the missile defense system is constructed with the multi-agent modeling method. Then, single factor, multiple factors and defense position deployment computational experiments are carried out and evaluated with the statistical analysis method. Experiment re- sults show that the altitude of the secondary interception missile is not the key factor which affects SoS effectiveness and putting the defense position ahead will increase defense effectiveness. The case study demonstrates the feasibility of the proposed method.