期刊文献+
共找到3,703篇文章
< 1 2 186 >
每页显示 20 50 100
New multi-DSP parallel computing architecture for real-time image processing 被引量:4
1
作者 Hu Junhong Zhang Tianxu Jiang Haoyang 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2006年第4期883-889,共7页
The flexibility of traditional image processing system is limited because those system are designed for specific applications. In this paper, a new TMS320C64x-based multi-DSP parallel computing architecture is present... The flexibility of traditional image processing system is limited because those system are designed for specific applications. In this paper, a new TMS320C64x-based multi-DSP parallel computing architecture is presented. It has many promising characteristics such as powerful computing capability, broad I/O bandwidth, topology flexibility, and expansibility. The parallel system performance is evaluated by practical experiment. 展开更多
关键词 parallel computing image processing REAL-TIME computer architecture
在线阅读 下载PDF
Programming for scientific computing on peta-scale heterogeneous parallel systems 被引量:1
2
作者 杨灿群 吴强 +2 位作者 唐滔 王锋 薛京灵 《Journal of Central South University》 SCIE EI CAS 2013年第5期1189-1203,共15页
Peta-scale high-perfomlance computing systems are increasingly built with heterogeneous CPU and GPU nodes to achieve higher power efficiency and computation throughput. While providing unprecedented capabilities to co... Peta-scale high-perfomlance computing systems are increasingly built with heterogeneous CPU and GPU nodes to achieve higher power efficiency and computation throughput. While providing unprecedented capabilities to conduct computational experiments of historic significance, these systems are presently difficult to program. The users, who are domain experts rather than computer experts, prefer to use programming models closer to their domains (e.g., physics and biology) rather than MPI and OpenME This has led the development of domain-specific programming that provides domain-specific programming interfaces but abstracts away some performance-critical architecture details. Based on experience in designing large-scale computing systems, a hybrid programming framework for scientific computing on heterogeneous architectures is proposed in this work. Its design philosophy is to provide a collaborative mechanism for domain experts and computer experts so that both domain-specific knowledge and performance-critical architecture details can be adequately exploited. Two real-world scientific applications have been evaluated on TH-IA, a peta-scale CPU-GPU heterogeneous system that is currently the 5th fastest supercomputer in the world. The experimental results show that the proposed framework is well suited for developing large-scale scientific computing applications on peta-scale heterogeneous CPU/GPU systems. 展开更多
关键词 heterogeneous parallel system programming framework scientific computing GPU computing molecular dynamic
在线阅读 下载PDF
Heuristic file sorted assignment algorithm of parallel I/O on cluster computing system
3
作者 陈志刚 曾碧卿 +3 位作者 熊策 邓晓衡 曾志文 刘安丰 《Journal of Central South University of Technology》 EI 2005年第5期572-577,共6页
A new file assignment strategy of parallel I/O, which is named heuristic file sorted assignment algorithm was proposed on cluster computing system. Based on the load balancing, it assigns the files to the same disk ac... A new file assignment strategy of parallel I/O, which is named heuristic file sorted assignment algorithm was proposed on cluster computing system. Based on the load balancing, it assigns the files to the same disk according to the similar service time. Firstly, the files were sorted and stored at the set I in descending order in terms of their service time, then one disk of cluster node was selected randomly when the files were to be assigned, and at last the continuous files were taken orderly from the set I to the disk until the disk reached its load maximum. The experimental results show that the new strategy improves the performance by 20.2% when the load of the system is light and by 31.6% when the load is heavy. And the higher the data access rate, the more evident the improvement of the performance obtained by the heuristic file sorted assignment algorithm. 展开更多
关键词 cluster computing parallel I/O file sorted assignment variance of service time
在线阅读 下载PDF
Effects of horizontal splitter plates on the vortex-induced vibration and aerostatic characteristics of twin separated parallel decks for a rail-cum-road bridge
4
作者 HE Xu-hui YANG Jia-feng +2 位作者 LIU Lu-lu ZOU Yun-feng HE Jing 《Journal of Central South University》 2025年第3期1024-1043,共20页
Installing the splitter plates is a passive aerodynamic solution for eliminating vortex-induced vibration (VIV). However, the influences of splitter plates on the VIV and aerostatic performances are more complicated d... Installing the splitter plates is a passive aerodynamic solution for eliminating vortex-induced vibration (VIV). However, the influences of splitter plates on the VIV and aerostatic performances are more complicated due to aerodynamic interference between highway and railway decks. To study the effects of splitter plates, wind tunnel experiments for measuring VIV and aerostatic forces of twin decks under two opposite flow directions were conducted, while the surrounding flow and wind pressure of static twin decks with and without splitter plates are numerically simulated. The results showed that the incoming flow direction affects the VIV response and aerostatic coefficients. The highway deck has poor vertical and torsional VIV, and the VIV region and amplitude are different under different directions. While the railway deck only has vertical VIV when located upstream. The splitter plates can impede the process of vortex generation, shedding and impinging at the gap between twin deck, and significantly reducing the surface fluctuating pressure coefficient, thus effectively suppressing the VIV of twin decks. While, the splitter plates hurt the upstream deck regarding static wind stability and have little effect on the downstream deck. The splitter plates of appropriate width are recommended to improve VIV performances in twin parallel bridges. 展开更多
关键词 splitter plates vortex-induced vibration(VIV) aerostatic characteristic wind tunnel test twin parallel decks the rail-cum-road bridges computational fluid dynamics
在线阅读 下载PDF
Preconditioned method in parallel computation
5
作者 Wu Ruichan Wei Jianing 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2006年第1期220-222,共3页
The grid equations in decomposed domain by parallel computation are soled, and a method of local orthogonalization to solve the large-scaled numerical computation is presented. It constructs preconditioned iteration m... The grid equations in decomposed domain by parallel computation are soled, and a method of local orthogonalization to solve the large-scaled numerical computation is presented. It constructs preconditioned iteration matrix by the combination of predigesting LU decomposition and local orthogonalization, and the convergence of solution is proved. Indicated from the example, this algorithm can increase the rate of computation efficiently and it is quite stable. 展开更多
关键词 grid equations parallel computation PRECONDITION LU decomposition local orthogonalization.
在线阅读 下载PDF
Study on High-Performance Computing for Simulation of End Milling Force
6
作者 ZHANG Zhi-hai, ZHENG Li, LI Zhi-zhong, LIU Da-cheng, ZHAN G Bo-peng (Department of Industry Engineering, Tsinghua University, Beijing 1000 84, China) 《厦门大学学报(自然科学版)》 CAS CSCD 北大核心 2002年第S1期183-184,共2页
Milling Process Simulation is one of the important re search areas in manufacturing science. For the purpose of improving the prec ision of simulation and extending its usability, numerical algorithm is more and more ... Milling Process Simulation is one of the important re search areas in manufacturing science. For the purpose of improving the prec ision of simulation and extending its usability, numerical algorithm is more and more used in the milling modeling areas. But simulative efficiency is decreasin g with increase of its complexity. As a result, application of the method is lim ited. Aimed at above question, high-efficient algorithm for milling process sim ulation is studied. It is important for milling process simulation’s applicatio n. Parallel computing is widely used to solve the large-scale computation question s. Its advantages include system flexibility, robust, high-efficient computing capability and high ratio of performance to price. With the development of compu ter network, utilizing the computing resource in the Internet, a virtual computi ng environment with powerful computing capability can be consisted by microc omputers, and the difficulty of building hardware environment which is used to s upport parallel computing is reduced. How to use network technology and parallel algorithm to improve simulative effic iency for milling forces simulation is investigated in the paper. In order to pr edict milling forces, a simplified local milling forces model is used in the pap er. End milling cutter is assumed to be divided by r number of differential elem ents along the axial direction of the cutter. For a given time, the total cuttin g forces can be obtained by summarizing the resultant cutting force produced by each differential cutter disc. Divide the whole simulative time into some segmen ts, send these program’s segments to microcomputers in the Internet and obtain the result of the program’s segments, all of the result of program’s segments a re composed the final result. For implementing the algorithm, a distributed Parallel computing framework is de signed in the paper. In the framework, web server plays a role of controller. Us ing Java RMI(remote method interface), the computing processes in computing serv er are called by web server. There are lots of control processes in web server a nd control the computing servers. The codes of simulative algorithm can be dynam ic sent to the computing servers, and milling forces at the different time are c omputed through utilizing the local computer’s resource. The results that are ca lculated by every computing servers are sent to the web server, and composed the final result. The framework can be used by different simulative algorithm. Comp ared with the algorithm running single machine, the efficiency of provided algor ithm is higher than that of single machine. 展开更多
关键词 end-milling force model SIMULATION high-perfo rmance computing parallel algorithm Java RMI
在线阅读 下载PDF
Efficient Partially Asynchronous Parallel Simulation on Multicomputer Systems: Research and Practice
7
作者 Chen, Delai Hong, Bo +1 位作者 Xie, Zhiwu Weng, Shilie 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1998年第2期40-47,共8页
This paper presents partially asynchronous parallel simulation of continuous-system (PAPSoCS) and some approaches to the issues of its implementation on a multicomputer system. To guarantee the simulation results cor... This paper presents partially asynchronous parallel simulation of continuous-system (PAPSoCS) and some approaches to the issues of its implementation on a multicomputer system. To guarantee the simulation results correct and speedup the simulation, the scheme for efficient PAPSoCS is proposed and the virtual topology star is constructed to match the path of message passing for solving algorithm-architecture adequation problem. Under the circumstances that messages frequently passed inter-processor are much shorter, typically within several 4 bytes, asynchronous communication mode is employed to reduce the communication ratio. Experiment results show that asynchronous parallel simulation has much higher efficiency than its synchronous counterpart. 展开更多
关键词 parallel processing Asynchronous computation Virtual topology Multicomputer system SIMULATION
在线阅读 下载PDF
Combination Method for Parallel Computation in ODEs
8
作者 Song Xiaoqiu(Beijing Institute of Computer Application and Simulation Technology ) 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1996年第4期19-26,共8页
In this paper, a 3rd order combination method with three processes and a 4th order combination method with five processes for solving ODEs are discussed. These methods are the Runge-Kutta method combined with a linear... In this paper, a 3rd order combination method with three processes and a 4th order combination method with five processes for solving ODEs are discussed. These methods are the Runge-Kutta method combined with a linear multistep method, which overcomes the defect of the 3rd order parallel Runge-Kutta method discussed in [1]. 展开更多
关键词 SOFTWARE RELIABILITY Numerical analysis Combination method parallel computation ODEs.
在线阅读 下载PDF
Multi-core optimization for conjugate gradient benchmark on heterogeneous processors
9
作者 邓林 窦勇 《Journal of Central South University》 SCIE EI CAS 2011年第2期490-498,共9页
Developing parallel applications on heterogeneous processors is facing the challenges of 'memory wall',due to limited capacity of local storage,limited bandwidth and long latency for memory access. Aiming at t... Developing parallel applications on heterogeneous processors is facing the challenges of 'memory wall',due to limited capacity of local storage,limited bandwidth and long latency for memory access. Aiming at this problem,a parallelization approach was proposed with six memory optimization schemes for CG,four schemes of them aiming at all kinds of sparse matrix-vector multiplication (SPMV) operation. Conducted on IBM QS20,the parallelization approach can reach up to 21 and 133 times speedups with size A and B,respectively,compared with single power processor element. Finally,the conclusion is drawn that the peak bandwidth of memory access on Cell BE can be obtained in SPMV,simple computation is more efficient on heterogeneous processors and loop-unrolling can hide local storage access latency while executing scalar operation on SIMD cores. 展开更多
关键词 multi-core processor NAS parallelization CG memory optimization
在线阅读 下载PDF
A Parallel Computational Scheme on Hybrid Methods
10
作者 Zhao ShuangsuoDepartment of Mathematics, Lanzhou University, Lanzhou 730000Wang ChangyinInst. of Mech. & Elec. Eng., Gansu University of Technology, Lanzhou 730050 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1994年第4期8-18,共11页
Based on the efficient hybrid methods for solving initial value problems of stiff ODEs, this paper derives a parallel scheme that can be used to solve the problems on parallel computers with N processors, and discusse... Based on the efficient hybrid methods for solving initial value problems of stiff ODEs, this paper derives a parallel scheme that can be used to solve the problems on parallel computers with N processors, and discusses the iteratively B-convergence of the Newton iterative process, finally, the paper provides some numberical results which show that the parallel scheme is highly efficient as N is not too large. 展开更多
关键词 Hybrid methods parallel computation Iteratively B-convergence.
在线阅读 下载PDF
Implementing Higher-Order Gamma on a Massively Parallel computer-A Case study
11
作者 Linpeng Huang Kam Wing Ng, Yongqiang Sun(Department of Computer Science and EngineeringShanghai Jiao Tong University, Shanghai 200030, P. R. China)(Department of Computer Science, The Chinese University of Hong Kong, Hong Kong) 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1995年第3期56-62,共7页
Gamma is a kernel programming language with an elegant chemical reaction metaphor in whichprograms are described in terms of multiset rewriting. Gamma formalism allows one to describe analgorithm without introducing a... Gamma is a kernel programming language with an elegant chemical reaction metaphor in whichprograms are described in terms of multiset rewriting. Gamma formalism allows one to describe analgorithm without introducing artificial sequentiality and leads to the derivation of a parallel solution to agiven problem naturally. However, the difficulty of incorporating control strategies makes Gamma not onlyhard for one to define any sophisticated approaches but also impossible to reach a decent level of efficiencyin any direct implementation. Recently, a higherorder multiset programming paradigm, named higher--order Gamma, is introduced by Metayer to alleviate these problems. In this paper, we investigate the possibility of implementing higherorder Gamma on Maspar, a massively data parallel computer. The results showthat a program written in higher--order Gamma can be transformed naturally toward an efficientimplementation on a real parallel machine. 展开更多
关键词 Massively parallel computation GAMMA programming paradigm
在线阅读 下载PDF
气象格点数算一体空间分析库的设计与实现 被引量:2
12
作者 王舒 徐拥军 +6 位作者 何文春 吴焕萍 高峰 刘媛媛 刘北 吕冠儒 倪学磊 《应用气象学报》 北大核心 2025年第1期121-128,共8页
气象格点数据通常以文件形式存储在分布式文件库中,业务系统在使用过程中需要将文件下载到本地,对文件解析后再进行分析计算。这种方式导致数据检索困难、响应时间长、无法满足业务在线计算及交互式应用需求。为此,2022年底国家气象信... 气象格点数据通常以文件形式存储在分布式文件库中,业务系统在使用过程中需要将文件下载到本地,对文件解析后再进行分析计算。这种方式导致数据检索困难、响应时间长、无法满足业务在线计算及交互式应用需求。为此,2022年底国家气象信息中心基于天擎空间分析库研发完成了分布式环境下气象格点数据与计算集成的数算一体数据库——Post Grid,该数据库包含数据层和算子层。数据层将气象格点数据在要素、起报、预报、空间、层次、样本等维度上的拆分后统一规范化存储,提高数据库的数据读取和分析效率。算子层通过数据库中的SQL函数实现,支持在数据库内部对格点数据进行各种操作,且算子支持分布式并行计算。性能测试和业务应用结果表明:Post Grid数据库能将传统的聚合计算服务时效由分钟级提升至毫秒级,极大提高了气象格点数据服务的性能、灵活性和数算一体能力,具有广泛应用价值。 展开更多
关键词 数算一体 气象格点数据 Post Grid 并行计算 分布式
在线阅读 下载PDF
基于FPGA的MobileNetV1目标检测加速器设计 被引量:2
13
作者 严飞 郑绪文 +2 位作者 孟川 李楚 刘银萍 《现代电子技术》 北大核心 2025年第1期151-156,共6页
卷积神经网络是目标检测中的常用算法,但由于卷积神经网络参数量和计算量巨大导致检测速度慢、功耗高,且难以部署到硬件平台,故文中提出一种采用CPU与FPGA融合结构实现MobileNetV1目标检测加速的应用方法。首先,通过设置宽度超参数和分... 卷积神经网络是目标检测中的常用算法,但由于卷积神经网络参数量和计算量巨大导致检测速度慢、功耗高,且难以部署到硬件平台,故文中提出一种采用CPU与FPGA融合结构实现MobileNetV1目标检测加速的应用方法。首先,通过设置宽度超参数和分辨率超参数以及网络参数定点化来减少网络模型的参数量和计算量;其次,对卷积层和批量归一化层进行融合,减少网络复杂性,提升网络计算速度;然后,设计一种八通道核间并行卷积计算引擎,每个通道利用行缓存乘法和加法树结构实现卷积运算;最后,利用FPGA并行计算和流水线结构,通过对此八通道卷积计算引擎合理的复用完成三种不同类型的卷积计算,减少硬件资源使用量、降低功耗。实验结果表明,该设计可以对MobileNetV1目标检测进行硬件加速,帧率可达56.7 f/s,功耗仅为0.603 W。 展开更多
关键词 卷积神经网络 目标检测 FPGA MobileNetV1 并行计算 硬件加速
在线阅读 下载PDF
基于GPU并行计算的拓扑优化全流程加速设计方法
14
作者 张长东 吴奕凡 +3 位作者 周铉华 李旭东 肖息 张自来 《航空制造技术》 北大核心 2025年第12期34-41,67,共9页
随着大尺寸航空航天装备的发展需求,高效高精度的大规模拓扑优化设计成为该领域关注的焦点。针对现有大规模拓扑优化设计存在的计算量巨大、计算效率低下等问题,基于GPU并行计算开展了拓扑优化全流程加速设计方法的研究。对网格划分、... 随着大尺寸航空航天装备的发展需求,高效高精度的大规模拓扑优化设计成为该领域关注的焦点。针对现有大规模拓扑优化设计存在的计算量巨大、计算效率低下等问题,基于GPU并行计算开展了拓扑优化全流程加速设计方法的研究。对网格划分、刚度矩阵计算与组装、有限元求解等过程进行了并行加速,实现了高效高精度的体素网格划分及有限元过程的高效求解。此外,该方法针对拓扑优化设计过程的加速需求,对灵敏度过滤过程进行了并行加速处理。以300万体素单元的姿态推力器模型为设计对象,发现相比于Abaqus 2022软件的拓扑优化并行加速计算,本文所提方法的加速比提高了1259%,且两种方法的相似度极高,验证了所提方法的有效性与实用性。 展开更多
关键词 拓扑优化 并行计算 GPU加速 符号距离场 稀疏矩阵 网格划分
在线阅读 下载PDF
基于计算着色器的并行Delaunay三角剖分算法
15
作者 陈国军 李震烁 陈昊祯 《图学学报》 北大核心 2025年第1期159-169,共11页
Delaunay三角剖分是一种经典的计算几何算法,在众多领域中有着广泛地使用,随着实际需求的不断提高,现有的Delaunay三角剖分算法已不能满足大规模数据的需求,为此,提出了一种基于计算着色器的并行Delaunay三角剖分方法,该方法通过纹理缓... Delaunay三角剖分是一种经典的计算几何算法,在众多领域中有着广泛地使用,随着实际需求的不断提高,现有的Delaunay三角剖分算法已不能满足大规模数据的需求,为此,提出了一种基于计算着色器的并行Delaunay三角剖分方法,该方法通过纹理缓存将点集数据输入到计算着色器中,并利用计算着色器加速Delaunay三角剖分,同时在现有方法的基础上提出动态插入法解决点集在离散空间中的重映射问题。此外,为了能够让显存有限的GPU构建出远超其显存限制的Delaunay三角网,提出基于计算着色器的分区双向扫描算法,并将点集划分为多个子区域,然后通过扫描各个子区域的方式进行构网。实验结果表明:在相同运行环境下,基于计算着色器的方法与现有的方法相比缩短了构网时间。同时分区双向扫描算法很好地解决了GPU的显存瓶颈问题,能让显存有限的GPU构建出远超其显存容量的Delaunay三角网。 展开更多
关键词 DELAUNAY三角剖分 计算着色器 GPU 并行计算 VORONOI图
在线阅读 下载PDF
冲击地压扰动响应失稳理论并行计算
16
作者 潘一山 王学滨 +1 位作者 郑一方 陈双印 《煤炭学报》 北大核心 2025年第1期81-91,共11页
目前,冲击地压理论研究已经完成了从定性分析到定量分析的转变。巷道围岩临界应力计算是巷道安全性评价的重要依据。鉴于冲击地压问题的极度复杂性,在理论上继续取得突破极为困难。基于理论公式的巷道围岩临界应力计算,无法考虑更复杂... 目前,冲击地压理论研究已经完成了从定性分析到定量分析的转变。巷道围岩临界应力计算是巷道安全性评价的重要依据。鉴于冲击地压问题的极度复杂性,在理论上继续取得突破极为困难。基于理论公式的巷道围岩临界应力计算,无法考虑更复杂的实际情况,例如非圆形巷道、非静水压力和复杂岩层结构。冲击地压理论和数值计算相结合具有更加广阔的应用前景,能使冲击地压理论进一步走向实际应用,这是极有价值的发展方向。这方面研究成果的成功取得依赖于数值计算技术的快速发展。研究将当今较先进的岩层运动并行计算系统StrataKing(一种自主开发的以拉格朗日元与离散元耦合方法为基础的非线性断裂力学GPU并行计算方法)与冲击地压扰动响应失稳理论相结合,首次提出了圆形巷道扰动响应失稳理论的数值模拟方法。该方法的思想是将非线性断裂力学数值分析方法中的Ⅱ型断裂能设定为中间变量,从而建立了静水压力条件下圆形巷道围岩临界应力与冲击能指数之间的关系。为了获取冲击能指数的数值解,采用了仅出现一个剪切面的理想岩样进行单轴压缩数值试验,以排除其他因素对应力-应变曲线峰后倾向于直线部分斜率的影响。对于高角度剪切破裂,提出了将非标准岩样的计算结果转换成标准岩样的结果的折算方法。折算后冲击能指数的范围为0.17~13.52,位于全国131个冲击地压矿井的调研数据之内。巷道围岩临界应力的计算结果是理论结果的0.4~2.5倍,这与针对全国20个冲击地压矿井的调研数据(临界应力的修正系数普遍大于1,甚至接近8)定性相符,从局部化破坏围岩比均匀破坏围岩的承载力高的角度进行了解释。冲击地压与局部化的关系过去有讨论,扰动响应失稳理论与局部化过去并无关系。通过局部化,扰动响应失稳理论与冲击地压之间在破坏机理上产生了密切的关联。StrataKing可为冲击地压矿井巷道安全性评价提供强大的算力支撑。 展开更多
关键词 冲击地压 定量分析 扰动响应失稳理论 冲击能指数 局部化 并行计算 临界应力
在线阅读 下载PDF
永磁同步电机并行可切换式模型预测控制研究 被引量:1
17
作者 刘涛 赵晴晴 +3 位作者 俞亚伟 习金玉 赵宝山 侯玮杰 《组合机床与自动化加工技术》 北大核心 2025年第4期92-97,共6页
传统模型预测控制策略基于固定的控制周期与预测寻优结构,难以实现电机系统的动态-稳态性能综合优化。针对这一问题,提出了一种基于多核并行计算架构的可切换式模型预测控制策略。该方法通过分析两种经典模型预测控制策略在预测寻优过... 传统模型预测控制策略基于固定的控制周期与预测寻优结构,难以实现电机系统的动态-稳态性能综合优化。针对这一问题,提出了一种基于多核并行计算架构的可切换式模型预测控制策略。该方法通过分析两种经典模型预测控制策略在预测寻优过程中的数据依赖关系,构建具有不同控制周期、不同控制策略的微单元,通过设计算法切换策略,实现变结构、变周期控制。在此基础上,为减少切换过程引起的被控量波动,设计了时序优化策略。相关实验结果表明,所提控制策略兼顾了电机系统的动、稳态控制性能,实现了模型预测控制的稳态控制精度、暂态超调量、响应时间的同步优化。 展开更多
关键词 永磁同步电机 模型预测控制策略 变周期控制 多核并行计算
在线阅读 下载PDF
基于并行计算的计算智能综述
18
作者 吴菲 陈嘉诚 王万良 《浙江大学学报(工学版)》 北大核心 2025年第1期27-38,共12页
传统计算智能技术缺乏实时性和适应性,基于并行计算的计算智能技术能够提高计算效率,解决多模态信息兼容处理的问题.分别从智能计算的3个分支(神经网络、进化算法和群智能算法)介绍计算智能与大数据并行计算融合的研究现状.总结并行计... 传统计算智能技术缺乏实时性和适应性,基于并行计算的计算智能技术能够提高计算效率,解决多模态信息兼容处理的问题.分别从智能计算的3个分支(神经网络、进化算法和群智能算法)介绍计算智能与大数据并行计算融合的研究现状.总结并行计算智能面临的问题与挑战,思考相关研究的发展方向. 展开更多
关键词 并行计算 计算智能 神经网络 进化算法 群智能
在线阅读 下载PDF
基于ROACH2-GPU的集群相关器研究——Hashpipe软件在X-engine模块中的应用
19
作者 张科 王钊 +6 位作者 李吉夏 吴锋泉 田海俊 牛晨辉 张巨勇 陈志平 陈学雷 《贵州师范大学学报(自然科学版)》 北大核心 2025年第2期114-121,共8页
随着国际上越来越多干涉阵列设备的建造与运行,为人类探测未知宇宙的奥秘提供了丰富的观测数据,然而随之带来高速和密集型数据实时处理的巨大困难,对传统的数据处理技术提出了严峻的挑战。基于我国已建造的天籁计划一期项目在数据实时... 随着国际上越来越多干涉阵列设备的建造与运行,为人类探测未知宇宙的奥秘提供了丰富的观测数据,然而随之带来高速和密集型数据实时处理的巨大困难,对传统的数据处理技术提出了严峻的挑战。基于我国已建造的天籁计划一期项目在数据实时关联计算的需求,利用GPU在高性能并行计算上的优势,为天籁柱形探路者阵列设计并实现一套基于ROACH2-GPU的集群相关器,深入探究Hashpipe(High availibility shared pipeline engine)软件在集群相关器X-engine模块中的应用。首先介绍ROACH2-GPU集群相关器的整体架构,然后研究Hashpipe的核心功能和数据处理方法,实现了完整的分布式异构处理功能,优化了Hashpipe控制和参数接口。根据实际观测需求,可修改程序参数,能实现不同通道数量的相关器配置,降低后端软硬件设计的难度和成本。最后,在完成软件正确性测试的基础上,进行了强射电天文源的观测和处理,能够获得准确的干涉条纹。 展开更多
关键词 ROACH2-GPU Hashpipe 集群相关器 X-engine模块 并行计算
在线阅读 下载PDF
基于光量子计算机的电网停电后分区模型及量子比特扩容方法 被引量:1
20
作者 刘成骏 娄骐 +3 位作者 徐一骏 顾伟 文凯 马寅 《电力系统自动化》 北大核心 2025年第11期80-90,共11页
电网发生大停电事故后,采用分区并行恢复策略能够保证电网快速恢复正常运行,而精准、高效的分区方法则是恢复策略能够实施的重要前提之一。面对新型电力系统日益严格的时效性需求,现存方法无法直接求解模型且求解结果需要人工选择,并受... 电网发生大停电事故后,采用分区并行恢复策略能够保证电网快速恢复正常运行,而精准、高效的分区方法则是恢复策略能够实施的重要前提之一。面对新型电力系统日益严格的时效性需求,现存方法无法直接求解模型且求解结果需要人工选择,并受到经典计算机算力限制。针对上述问题,文中利用近年来新兴的量子计算技术,提出了基于量子计算的电网停电后快速恢复分区方法。首先,以切除线路权重和最小为目标,考虑电网实际运行安全约束,构建了大停电后网络分区模型,并将其转化为光量子计算机能直接求解的二元二次无约束优化模型;然后,考虑实机量子比特数限制,初步探讨了基于子问题抽取的量子比特扩容方法在实际应用中的可能性;最后,依托专用量子计算机,采用两个不同规模的系统分别验证了所提分区模型和量子扩容方法的有效性。 展开更多
关键词 新型电力系统 量子计算 量子比特扩容 停电 二元二次无约束优化 谱聚类 分区并行恢复
在线阅读 下载PDF
上一页 1 2 186 下一页 到第
使用帮助 返回顶部