期刊文献+
共找到1,691篇文章
< 1 2 85 >
每页显示 20 50 100
GPIC:A GPU-based parallel independent cascade algorithm in complex networks
1
作者 Chang Su Xu Na +1 位作者 Fang Zhou Linyuan Lü 《Chinese Physics B》 2025年第3期20-30,共11页
Independent cascade(IC)models,by simulating how one node can activate another,are important tools for studying the dynamics of information spreading in complex networks.However,traditional algorithms for the IC model ... Independent cascade(IC)models,by simulating how one node can activate another,are important tools for studying the dynamics of information spreading in complex networks.However,traditional algorithms for the IC model implementation face significant efficiency bottlenecks when dealing with large-scale networks and multi-round simulations.To settle this problem,this study introduces a GPU-based parallel independent cascade(GPIC)algorithm,featuring an optimized representation of the network data structure and parallel task scheduling strategies.Specifically,for this GPIC algorithm,we propose a network data structure tailored for GPU processing,thereby enhancing the computational efficiency and the scalability of the IC model.In addition,we design a parallel framework that utilizes the full potential of GPU's parallel processing capabilities,thereby augmenting the computational efficiency.The results from our simulation experiments demonstrate that GPIC not only preserves accuracy but also significantly boosts efficiency,achieving a speedup factor of 129 when compared to the baseline IC method.Our experiments also reveal that when using GPIC for the independent cascade simulation,100-200 simulation rounds are sufficient for higher-cost studies,while high precision studies benefit from 500 rounds to ensure reliable results,providing empirical guidance for applying this new algorithm to practical research. 展开更多
关键词 complex networks information spreading independent cascade model parallel computing GPU
在线阅读 下载PDF
Efficient Task Completion for Parallel Offloading in Vehicular Fog Computing 被引量:7
2
作者 Jindou Xie Yunjian Jia +2 位作者 Zhengchuan Chen Zhaojun Nan Liang Liang 《China Communications》 SCIE CSCD 2019年第11期42-55,共14页
In this paper,we investigate vehicular fog computing system and develop an effective parallel offloading scheme.The service time,that addresses task offloading delay,task decomposition and handover cost,is adopted as ... In this paper,we investigate vehicular fog computing system and develop an effective parallel offloading scheme.The service time,that addresses task offloading delay,task decomposition and handover cost,is adopted as the metric of offloading performance.We propose an available resource-aware based parallel offloading scheme,which decides target fog nodes by RSU for computation offloading jointly considering effect of vehicles mobility and time-varying computation capability.Based on Hidden Markov model and Markov chain theories,proposed scheme effectively handles the imperfect system state information for fog nodes selection by jointly achieving mobility awareness and computation perception.Simulation results are presented to corroborate the theoretical analysis and validate the effectiveness of the proposed algorithm. 展开更多
关键词 parallel OFFLOADING vehicular FOG computING TASK OFFLOADING HMM
在线阅读 下载PDF
Parallel Computing of the Underwater Explosion Cavitation Effects on Full-scale Ship Structures 被引量:7
3
作者 Zhi Zong Yanjie Zhao +2 位作者 Fan Ye Haitao Li Gang Chen 《Journal of Marine Science and Application》 2012年第4期469-477,共9页
As well as shock wave and bubble pulse loading, cavitation also has very significant influences on the dynamic response of surface ships and other near-surface marine structures to underwater explosive loadings. In th... As well as shock wave and bubble pulse loading, cavitation also has very significant influences on the dynamic response of surface ships and other near-surface marine structures to underwater explosive loadings. In this paper, the acoustic-structure coupling method embedded in ABAQUS is adopted to do numerical analysis of underwater explosion considering cavitation. Both the shape of bulk cavitation region and local cavitation region are obtained, and they are in good agreement with analytical results. The duration of reloading is several times longer than that of a shock wave. In the end, both the single computation and parallel computation of the cavitation effect on the dynamic responses of a full-scale ship are presented, which proved that reloading caused by cavitation is non-ignorable. All these results are helpful in understanding underwater explosion cavitation effects. 展开更多
关键词 underwater explosion CAVITATION parallel computation full-scale ship
在线阅读 下载PDF
High-throughput computational screening of Cu-MOFs with open metal sites for efficient C_2H_2/C_2H_4 separation 被引量:6
4
作者 Lei Liu Lei Wang +2 位作者 Dahuan Liu Qingyuan Yang Chongli Zhong 《Green Energy & Environment》 SCIE CSCD 2020年第3期333-340,共8页
Cost effective separation of acetylene(C_2H_2)and ethylene(C_2H_4)is of key importance to obtain essential chemical raw materials for polymer industry.Due to the low compression limit of C_2H_2,there is an urgent dema... Cost effective separation of acetylene(C_2H_2)and ethylene(C_2H_4)is of key importance to obtain essential chemical raw materials for polymer industry.Due to the low compression limit of C_2H_2,there is an urgent demand to develop suitable materials for efficiently separating the two gases under ambient conditions.In this paper,we provided a high-throughput screening strategy to study porous metal-organic frameworks(MOFs)containing open metal sites(OMS)for C_2H_2/C_2H_4 separation,followed by a rational design of novel MOFs in-silico.A set of accurate force fields was established from ab initio calculations to describe the critical role of OMS towards guest molecules.From a large-scale computational screening of 916 experimental Cu-paddlewheel-based MOFs,three materials were identified with excellent separation performance.The structure-performance relationships revealed that the optimal materials should have the largest cavity diameter around 5-10?and pore volume in-between 0.3-1.0 cm^3 g^(-1).Based on the systematic screening study result,three novel MOFs were further designed with the incorporation of fluorine functional group.The results showed that Cu-OMS and the-F group on the aromatic rings close to Cu sites could generate a synergistic effect on the preferential adsorption of C_2H_2 over C_2H_4,leading to a remarkable improvement of C_2H_2 separation performance of the materials.The findings could provide insight for future experimental design and synthesis of high-performance nanostructured materials for C_2H_2/C_2H_4 separation. 展开更多
关键词 Acetylene and ethylene Metal-organic frameworks Open metal sites large-scale computation Materials design
在线阅读 下载PDF
Parallel computing approach for efficient 3-D X-ray-simulated image reconstruction 被引量:1
5
作者 Ou-Yi Li Yang Wang +1 位作者 Qiong Zhang Yong-Hui Li 《Nuclear Science and Techniques》 SCIE EI CAS CSCD 2023年第7期122-136,共15页
Accurate 3-dimensional(3-D)reconstruction technology for nondestructive testing based on digital radiography(DR)is of great importance for alleviating the drawbacks of the existing computed tomography(CT)-based method... Accurate 3-dimensional(3-D)reconstruction technology for nondestructive testing based on digital radiography(DR)is of great importance for alleviating the drawbacks of the existing computed tomography(CT)-based method.The commonly used Monte Carlo simulation method ensures well-performing imaging results for DR.However,for 3-D reconstruction,it is limited by its high time consumption.To solve this problem,this study proposes a parallel computing method to accelerate Monte Carlo simulation for projection images with a parallel interface and a specific DR application.The images are utilized for 3-D reconstruction of the test model.We verify the accuracy of parallel computing for DR and evaluate the performance of two parallel computing modes-multithreaded applications(G4-MT)and message-passing interfaces(G4-MPI)-by assessing parallel speedup and efficiency.This study explores the scalability of the hybrid G4-MPI and G4-MT modes.The results show that the two parallel computing modes can significantly reduce the Monte Carlo simulation time because the parallel speedup increment of Monte Carlo simulations can be considered linear growth,and the parallel efficiency is maintained at a high level.The hybrid mode has strong scalability,as the overall run time of the 180 simulations using 320 threads is 15.35 h with 10 billion particles emitted,and the parallel speedup can be up to 151.36.The 3-D reconstruction of the model is achieved based on the filtered back projection(FBP)algorithm using 180 projection images obtained with the hybrid G4-MPI and G4-MT.The quality of the reconstructed sliced images is satisfactory because the images can reflect the internal structure of the test model.This method is applied to a complex model,and the quality of the reconstructed images is evaluated. 展开更多
关键词 parallel computing Monte Carlo Digital radiography 3-D reconstruction
在线阅读 下载PDF
Study of a GPU-based parallel computing method for the Monte Carlo program 被引量:2
6
作者 罗志飞 邱睿 +3 位作者 李明 武祯 曾志 李君利 《Nuclear Science and Techniques》 SCIE CAS CSCD 2014年第A01期27-30,共4页
关键词 并行计算方法 蒙特卡罗程序 GPU GEANT4 模拟程序 蒙特卡洛方法 并行处理能力 图形处理单元
在线阅读 下载PDF
Parallel-Computing Wavelet-Based FDTD Method for Modeling Nanoscale Optical Resonator
7
作者 蒋锡燕 王瑾 +1 位作者 陆云清 许吉 《Transactions of Nanjing University of Aeronautics and Astronautics》 EI 2014年第3期260-268,共9页
An efficient wavelet-based finite-difference time-domain(FDTD)method is implemented for analyzing nanoscale optical devices,especially optical resonator.Because of its highly linear numerical dispersion properties the... An efficient wavelet-based finite-difference time-domain(FDTD)method is implemented for analyzing nanoscale optical devices,especially optical resonator.Because of its highly linear numerical dispersion properties the high-spatial-order FDTD achieves significant reduction in the number of cells,i.e.used memory,while analyzing a high-index dielectric ring resonator working as an add/drop multiplexer.The main novelty is that the wavelet-based FDTD model is extended in a parallel computation environment to solve physical problems with large dimensions.To demonstrate the efficiency of the parallelized FDTD model,a mirrored cavity is analyzed.The analysis shows that the proposed model reduces computation time and memory cost,and the parallel computation result matches the theoretical model. 展开更多
关键词 integrated optics electromagnetic field analysis finite difference time domain(FDTD) WAVELET parallel computation
在线阅读 下载PDF
Towards sparse matrix operations:graph database approach for power grid computation
8
作者 Daoxing Li Kai Xiao +2 位作者 Xiaohui Wang Pengtian Guo Yong Chen 《Global Energy Interconnection》 EI CAS CSCD 2023年第1期50-63,共14页
The construction of new power systems presents higher requirements for the Power Internet of Things(PIoT)technology.The“source-grid-load-storage”architecture of a new power system requires PIoT to have a stronger mu... The construction of new power systems presents higher requirements for the Power Internet of Things(PIoT)technology.The“source-grid-load-storage”architecture of a new power system requires PIoT to have a stronger multi-source heterogeneous data fusion ability.Native graph databases have great advantages in dealing with multi-source heterogeneous data,which make them suitable for an increasing number of analytical computing tasks.However,only few existing graph database products have native support for matrix operation-related interfaces or functions,resulting in low efficiency when handling matrix calculations that are commonly encountered in power grids.In this paper,the matrix computation process is expressed by a strategy called graph description,which relies on the natural connection between the matrix and structure of the graph.Based on that,we implement matrix operations on graph database,including matrix multiplication,matrix decomposition,etc.Specifically,only the nodes relevant to the computation and their neighbors are concerned in the process,which prunes the influence of zero elements in the matrix and avoids useless iterations compared to the conventional matrix computation.Based on the graph description,a series of power grid computations can be implemented on graph database,which reduces redundant data import and export operations while leveraging the parallel computing capability of graph database.It promotes the efficiency of PIoT when handling multi-source heterogeneous data.An comprehensive experimental study over two different scale power system datasets compares the proposed method with Python and MATLAB baselines.The results reveal the superior performance of our proposed method in both power flow and N-1 contingency computations. 展开更多
关键词 Graph database Graph description MATRIX parallel computing Power flow
在线阅读 下载PDF
基于FPGA的MobileNetV1目标检测加速器设计
9
作者 严飞 郑绪文 +2 位作者 孟川 李楚 刘银萍 《现代电子技术》 北大核心 2025年第1期151-156,共6页
卷积神经网络是目标检测中的常用算法,但由于卷积神经网络参数量和计算量巨大导致检测速度慢、功耗高,且难以部署到硬件平台,故文中提出一种采用CPU与FPGA融合结构实现MobileNetV1目标检测加速的应用方法。首先,通过设置宽度超参数和分... 卷积神经网络是目标检测中的常用算法,但由于卷积神经网络参数量和计算量巨大导致检测速度慢、功耗高,且难以部署到硬件平台,故文中提出一种采用CPU与FPGA融合结构实现MobileNetV1目标检测加速的应用方法。首先,通过设置宽度超参数和分辨率超参数以及网络参数定点化来减少网络模型的参数量和计算量;其次,对卷积层和批量归一化层进行融合,减少网络复杂性,提升网络计算速度;然后,设计一种八通道核间并行卷积计算引擎,每个通道利用行缓存乘法和加法树结构实现卷积运算;最后,利用FPGA并行计算和流水线结构,通过对此八通道卷积计算引擎合理的复用完成三种不同类型的卷积计算,减少硬件资源使用量、降低功耗。实验结果表明,该设计可以对MobileNetV1目标检测进行硬件加速,帧率可达56.7 f/s,功耗仅为0.603 W。 展开更多
关键词 卷积神经网络 目标检测 FPGA MobileNetV1 并行计算 硬件加速
在线阅读 下载PDF
冲击地压扰动响应失稳理论并行计算
10
作者 潘一山 王学滨 +1 位作者 郑一方 陈双印 《煤炭学报》 北大核心 2025年第1期81-91,共11页
目前,冲击地压理论研究已经完成了从定性分析到定量分析的转变。巷道围岩临界应力计算是巷道安全性评价的重要依据。鉴于冲击地压问题的极度复杂性,在理论上继续取得突破极为困难。基于理论公式的巷道围岩临界应力计算,无法考虑更复杂... 目前,冲击地压理论研究已经完成了从定性分析到定量分析的转变。巷道围岩临界应力计算是巷道安全性评价的重要依据。鉴于冲击地压问题的极度复杂性,在理论上继续取得突破极为困难。基于理论公式的巷道围岩临界应力计算,无法考虑更复杂的实际情况,例如非圆形巷道、非静水压力和复杂岩层结构。冲击地压理论和数值计算相结合具有更加广阔的应用前景,能使冲击地压理论进一步走向实际应用,这是极有价值的发展方向。这方面研究成果的成功取得依赖于数值计算技术的快速发展。研究将当今较先进的岩层运动并行计算系统StrataKing(一种自主开发的以拉格朗日元与离散元耦合方法为基础的非线性断裂力学GPU并行计算方法)与冲击地压扰动响应失稳理论相结合,首次提出了圆形巷道扰动响应失稳理论的数值模拟方法。该方法的思想是将非线性断裂力学数值分析方法中的Ⅱ型断裂能设定为中间变量,从而建立了静水压力条件下圆形巷道围岩临界应力与冲击能指数之间的关系。为了获取冲击能指数的数值解,采用了仅出现一个剪切面的理想岩样进行单轴压缩数值试验,以排除其他因素对应力-应变曲线峰后倾向于直线部分斜率的影响。对于高角度剪切破裂,提出了将非标准岩样的计算结果转换成标准岩样的结果的折算方法。折算后冲击能指数的范围为0.17~13.52,位于全国131个冲击地压矿井的调研数据之内。巷道围岩临界应力的计算结果是理论结果的0.4~2.5倍,这与针对全国20个冲击地压矿井的调研数据(临界应力的修正系数普遍大于1,甚至接近8)定性相符,从局部化破坏围岩比均匀破坏围岩的承载力高的角度进行了解释。冲击地压与局部化的关系过去有讨论,扰动响应失稳理论与局部化过去并无关系。通过局部化,扰动响应失稳理论与冲击地压之间在破坏机理上产生了密切的关联。StrataKing可为冲击地压矿井巷道安全性评价提供强大的算力支撑。 展开更多
关键词 冲击地压 定量分析 扰动响应失稳理论 冲击能指数 局部化 并行计算 临界应力
在线阅读 下载PDF
基于并行计算的计算智能综述
11
作者 吴菲 陈嘉诚 王万良 《浙江大学学报(工学版)》 北大核心 2025年第1期27-38,共12页
传统计算智能技术缺乏实时性和适应性,基于并行计算的计算智能技术能够提高计算效率,解决多模态信息兼容处理的问题.分别从智能计算的3个分支(神经网络、进化算法和群智能算法)介绍计算智能与大数据并行计算融合的研究现状.总结并行计... 传统计算智能技术缺乏实时性和适应性,基于并行计算的计算智能技术能够提高计算效率,解决多模态信息兼容处理的问题.分别从智能计算的3个分支(神经网络、进化算法和群智能算法)介绍计算智能与大数据并行计算融合的研究现状.总结并行计算智能面临的问题与挑战,思考相关研究的发展方向. 展开更多
关键词 并行计算 计算智能 神经网络 进化算法 群智能
在线阅读 下载PDF
基于最小CU代价的AVS3帧内预测并行算法
12
作者 张全 王顺 +2 位作者 刘洋毅 段昶 彭博 《电信科学》 北大核心 2025年第2期57-67,共11页
为解决第三代音视频编码标准(audio video coding standard 3,AVS3)帧内预测的耗时问题,提出一种基于最小编码单元(coding unit,CU)代价的帧内预测并行算法。首先,将图像划分为最小CU。然后,利用原始像素作为参考,并行计算所有最小CU的... 为解决第三代音视频编码标准(audio video coding standard 3,AVS3)帧内预测的耗时问题,提出一种基于最小编码单元(coding unit,CU)代价的帧内预测并行算法。首先,将图像划分为最小CU。然后,利用原始像素作为参考,并行计算所有最小CU的帧内模式代价。最后,用代价组合的方式快速计算出其他CU的帧内模式优先级,选择最优的15个模式进入粗略模式决策(rough mode decision,RMD)阶段。此外,为减少方法引入的误差,提出了3种优化策略。在预测前对原始像素进行预处理,使其更贴合重构像素;修改帧内预测的代价函数,以更准确地估计每种模式的优先级;大尺寸CU使用顶层的CU代价作为参考,减少CU组合累积的误差。实验结果表明,在码率仅下降0.35%的情况下,整体编码的计算时间减少了27%,有效地减少了帧内预测的耗时并保证了编码质量。 展开更多
关键词 AVS3 帧内预测 并行计算
在线阅读 下载PDF
基于动态自适应计算引擎的MobileNetV3网络加速器设计
13
作者 项浩斌 杨瑞敏 +2 位作者 吴文涛 李春雷 董燕 《电子技术应用》 2025年第1期8-17,共10页
现有面向高效轻量化MobileNetV3网络的加速方法通常采用高度定制的计算引擎进行模型计算,从而限制了加速器的可扩展性使其仅适用于小型网络或资源丰富的硬件平台。针对此问题,提出了基于动态自适应计算引擎的MobileNetV3网络加速器。首... 现有面向高效轻量化MobileNetV3网络的加速方法通常采用高度定制的计算引擎进行模型计算,从而限制了加速器的可扩展性使其仅适用于小型网络或资源丰富的硬件平台。针对此问题,提出了基于动态自适应计算引擎的MobileNetV3网络加速器。首先,设计了局部感知区域卷积的流水线推理架构实现特征、权重的高度并行处理和缓冲调度。其次,提出全局自适应的点卷积方法优化点卷积,并结合空间探索获得最优的参数配置以实现最大计算并行性。此外,加速器可以根据模型参数变化动态配置以适应不同场景。实验结果显示加速器推理速度为8 F/s,是现有方法速度的2.7倍。 展开更多
关键词 卷积神经网络 并行计算 动态自适应 边缘设备 硬件加速
在线阅读 下载PDF
基于多核CPU的雷达导引头数字化实时仿真研究
14
作者 苏灏杨 夏伟杰 +1 位作者 吴雪 王宇 《遥测遥控》 2025年第2期92-99,共8页
雷达导引头仿真系统在导引头精确打击目标的过程中起着重要的作用。随着仿真系统的复杂度不断提升以及数据处理需求的日益增长,传统的串行计算仿真技术已难以满足雷达导引头数字仿真系统对实时性的严格要求。针对现有雷达导引头仿真过... 雷达导引头仿真系统在导引头精确打击目标的过程中起着重要的作用。随着仿真系统的复杂度不断提升以及数据处理需求的日益增长,传统的串行计算仿真技术已难以满足雷达导引头数字仿真系统对实时性的严格要求。针对现有雷达导引头仿真过程中耗时过长的问题,本文提出了一种全流程的数字化实时仿真方法。首先,将传统全流程仿真架构的核心部分——接收控制系统指令、接收回波仿真数据、SAR(Synthetic aperture Radar,合成孔径雷达)成像处理、成像结果上传与界面动态更新进行流水并行化。其次,利用OpenMP(开放式多处理)多核并行模型,对SAR成像算法主要步骤进行多核CPU(Central Processing Unit,中央处理器)并行处理。然后,引入高性能数学计算库FFTW3(西方最快傅里叶变换第3版)快速实现成像算法的傅里叶变换,加快SAR成像算法处理速度。最后仿真结果表明:该全流程的设计方法相较于传统的串行仿真,加速比达到100倍左右,同时加速前后的SAR图像相似度接近于1。在处理精度和效果一致的前提下,该方法能够完成雷达导引头系统的全流程实时仿真,具有较好的工程应用前景。 展开更多
关键词 雷达导引头 SAR仿真 多核CPU 并行计算 实时仿真
在线阅读 下载PDF
基于岩层运动并行计算系统StrataKing的岩层运动模拟
15
作者 王学滨 余保健 +2 位作者 李小帅 张钦杰 郑一方 《矿业科学学报》 北大核心 2025年第2期214-225,共12页
目前,我国煤炭行业岩层运动模拟主要靠国外通用商业软件,存在计算效率低、计算规模小、矿业特色不明显等问题。基于自主开发的岩层运动并行计算系统StrataKing,通过GPU并行提升计算效率和计算规模,单元数为国外通用商业软件的10倍。Stra... 目前,我国煤炭行业岩层运动模拟主要靠国外通用商业软件,存在计算效率低、计算规模小、矿业特色不明显等问题。基于自主开发的岩层运动并行计算系统StrataKing,通过GPU并行提升计算效率和计算规模,单元数为国外通用商业软件的10倍。StrataKing的新突破包括垮落岩石的膨胀、煤层和巷道开挖后的逐渐卸荷及支护作用下的随采随充等,分别用于探究110工法、长壁开采和充填开采的岩层运动规律。结果表明,对于110工法,得以保留的左巷的最大下沉量为0.103 m,与现场结果基本相符;对于长壁开采,工作面两侧一定范围内的煤体开裂,应力向远处转移,在此掘巷有利于巷道维护,弯曲下沉带下边界与煤层下表面的距离和相似模拟结果基本相符;对于充填开采,模型上表面最大下沉量比未充填模型减少56.52%,占采高的比例降低55.55%,若干岩层裂而未断。 展开更多
关键词 煤矿 岩层运动 并行计算 云计算 采矿方法
在线阅读 下载PDF
基于稀疏化的宽带波束形成并行优化方法
16
作者 常桂林 刘霖 赵麒瑞 《电子设计工程》 2025年第4期160-164,169,共6页
针对相控阵小型化设计中,宽带波束形成硬件开销大、算法时效性差的问题,提出一种基于稀疏化的宽带波束形成并行优化方法。该方法引入二次项和正则化参数,快速达到满足残差要求的稀疏化解系数;进一步利用离散化特性,将频率不变最佳响应... 针对相控阵小型化设计中,宽带波束形成硬件开销大、算法时效性差的问题,提出一种基于稀疏化的宽带波束形成并行优化方法。该方法引入二次项和正则化参数,快速达到满足残差要求的稀疏化解系数;进一步利用离散化特性,将频率不变最佳响应结合阵元稀疏优化的计算过程分裂为独立任务,实现并行化改进。计算得到的横向滤波器系数对应的波束形成性能,具有稳定低旁瓣与恒定束宽的优点。仿真验证了所提出方法的有效性,该方法具有较强的工程应用价值。 展开更多
关键词 相控阵 宽带波束形成 低复杂度 并行计算
在线阅读 下载PDF
基于异构大数据平台的并行化K-means算法设计与实现
17
作者 张适显 黄万兵 熊文 《无线互联科技》 2025年第4期88-91,119,共5页
K-means算法是数据挖掘和机器学习中用于聚类分析的基础工具,广泛应用于文档聚类、异常值检测等多个领域。然而,随着大数据时代的来临,传统方法难以满足大规模数据聚类分析的处理需求。为此,文章基于Spark和GPU构建异构大数据平台,对K-m... K-means算法是数据挖掘和机器学习中用于聚类分析的基础工具,广泛应用于文档聚类、异常值检测等多个领域。然而,随着大数据时代的来临,传统方法难以满足大规模数据聚类分析的处理需求。为此,文章基于Spark和GPU构建异构大数据平台,对K-means算法进行并行化设计与实现,以提高K-means算法的数据处理效率和资源利用率。文章在4个公开的真实数据集上验证了该方法的有效性,与传统的并行化K-means方法进行对比,实验结果证明该方法相较传统方法具备更好的性能。 展开更多
关键词 并行计算 异构计算 大数据技术 数据挖掘 K-MEANS算法 聚类分析
在线阅读 下载PDF
并联LNG高压泵入口管路布置设计CFD数值模拟研究
18
作者 欧阳文婷 《石化技术》 2025年第3期13-15,共3页
为合理进行液化天然气(LNG)接收站并联高压泵的管道布置设计,防止较大偏流及压力波动的现象,造成管道振动、介质冲击、噪声工况不稳定的问题,基于计算流体动力学(CFD)的数值模拟方法,建立管网的三维模型并划分网格,在典型工况条件下进行... 为合理进行液化天然气(LNG)接收站并联高压泵的管道布置设计,防止较大偏流及压力波动的现象,造成管道振动、介质冲击、噪声工况不稳定的问题,基于计算流体动力学(CFD)的数值模拟方法,建立管网的三维模型并划分网格,在典型工况条件下进行CFD三维仿真,分析判断偏流程度、流场分布。研究表明:LNG 接收站并联高压泵发生偏流与实际运行情况相符,距进口总管的最近的泵入口流量最大,但偏流差值在允许范围;合理进行π弯设计不会明显引起压力波动和流量分配不均,不是导致高压泵出现抽液困难的主要因素;可以运用CFD进行三维仿真分析模拟管路布置设计,给后续设计优化及问题分析提供理论依据。 展开更多
关键词 LNG接收站 并联高压泵 计算流体力学 数值模拟 管路布置设计
在线阅读 下载PDF
DirectCompute加速图像处理方法的研究 被引量:1
19
作者 张益男 袁杰 《现代电子技术》 2012年第22期55-58,共4页
当代图像处理,特别是基于大量密集科学计算的数据处理,需要用高度的并行处理技术来解决计算效率问题。然而处理速度更多依赖于具体的算法和系统实现间的相互联系,这就对进行并行处理的工作人员的并行技术及相关经验提出了一定的要求。... 当代图像处理,特别是基于大量密集科学计算的数据处理,需要用高度的并行处理技术来解决计算效率问题。然而处理速度更多依赖于具体的算法和系统实现间的相互联系,这就对进行并行处理的工作人员的并行技术及相关经验提出了一定的要求。为完成并行处理并加大并行计算效益,采用了DirectCompute方法。针对工程中的角点匹配计算时延问题,凭借有效的硬件资源,对其进行了并行处理,提高了程序执行效率,并缩短了计算延时。 展开更多
关键词 图像三维重建 并行计算 GPU Directcompute
在线阅读 下载PDF
Fast parallel Grad–Shafranov solver for real-time equilibrium reconstruction in EAST tokamak using graphic processing unit 被引量:1
20
作者 黄耀 肖炳甲 罗正平 《Chinese Physics B》 SCIE EI CAS CSCD 2017年第8期276-283,共8页
To achieve real-time control of tokamak plasmas, the equilibrium reconstruction has to be completed sufficiently quickly. For the case of an EAST tokamak experiment, real-time equilibrium reconstruction is generally r... To achieve real-time control of tokamak plasmas, the equilibrium reconstruction has to be completed sufficiently quickly. For the case of an EAST tokamak experiment, real-time equilibrium reconstruction is generally required to provide results within 1ms. A graphic processing unit(GPU) parallel Grad–Shafranov(G-S) solver is developed in P-EFIT code,which is built with the CUDA? architecture to take advantage of massively parallel GPU cores and significantly accelerate the computation. Optimization and implementation of numerical algorithms for a block tri-diagonal linear system are presented. The solver can complete a calculation within 16 μs with 65×65 grid size and 27 μs with 129×129 grid size, and this solver supports that P-EFIT can fulfill the time feasibility for real-time plasma control with both grid sizes. 展开更多
关键词 TOKAMAK Grad-Shafranov equation equilibrium reconstruction GPU parallel computation
在线阅读 下载PDF
上一页 1 2 85 下一页 到第
使用帮助 返回顶部