基于快速傅里叶变换的快速迭代收缩阈值算法(fast iterative shrinkage threshold algorithm based on fast Fourier transform, FFT-FISTA)具有较高的计算效率,但其忽略点扩散函数的空间变化及卷绕误差,造成声源识别性能的损失,为此提...基于快速傅里叶变换的快速迭代收缩阈值算法(fast iterative shrinkage threshold algorithm based on fast Fourier transform, FFT-FISTA)具有较高的计算效率,但其忽略点扩散函数的空间变化及卷绕误差,造成声源识别性能的损失,为此提出基于函数波束形成的改进FFT-FISTA算法。改进算法以函数波束形成输出作为FFT-FISTA算法的迭代输入,建立函数波束形成、声源分布及升幂空间转移不变点扩散函数的线性方程组,基于周期边界条件下的快速傅里叶变换进行迭代求解,使被运算的非周期函数变为一个周期函数,解决补零边界带来的波数泄漏问题,可提高运算准确性,进一步提升成像性能;通过指数运算锐化点扩散函数主瓣,拓展点扩散函数空间转移不变性假设的适用性。仿真和试验结果表明,相较于常规FFT-FISTA算法,改进算法能提升成像空间分辨率及动态范围,扩大FFT-FISTA算法的有效成像区域,压缩气体泄漏试验结果验证了改进算法的有效性。展开更多
DFT is widely applied in the field of signal process and others. Most present rapid ways of calculation are either based on paralleled computers connected by such particular systems like butterfly network, hypercube e...DFT is widely applied in the field of signal process and others. Most present rapid ways of calculation are either based on paralleled computers connected by such particular systems like butterfly network, hypercube etc; or based on the assumption of instant transportation, non-conflict communication, complete connection of paralleled processors and unlimited usable processors. However, the delay of communication in the system of information transmission cannot be ignored. This paper works on the following aspects: instant transmission, dispatching missions, and the path of information through the communication link in the computer cluster systems; layout of the dynamic FFT algorithm under the different structures of computer clusters.展开更多
快速傅里叶变换(fast Fourier transform,FFT)在数字信号处理中占据核心地位.随着高性能超长点数FFT需求的增长,数字信号处理器(digital signal processor,DSP)的计算能力越来越难以满足需求,集成FFT加速器成为重要的发展趋势.为了支持...快速傅里叶变换(fast Fourier transform,FFT)在数字信号处理中占据核心地位.随着高性能超长点数FFT需求的增长,数字信号处理器(digital signal processor,DSP)的计算能力越来越难以满足需求,集成FFT加速器成为重要的发展趋势.为了支持超长点数FFT,将2维分解算法推广到多维,提出一种可集成于DSP的高性能超长点数FFT加速器结构.该结构通过基于素数个存储体的无冲突体编址方法实现了3维转置运算;通过递推算法实现了高效铰链因子生成;使用单精度浮点二项融合点积运算和融合加-减运算,对FFT运算电路进行了精细化设计.实现了对4G点数单精度浮点FFT计算的支持.综合结果表明:FFT加速器运行频率能够达到1GHz以上,性能达到640Gflop/s.在支持的点数和性能方面都较已有研究成果取得大幅提升.展开更多
文摘基于快速傅里叶变换的快速迭代收缩阈值算法(fast iterative shrinkage threshold algorithm based on fast Fourier transform, FFT-FISTA)具有较高的计算效率,但其忽略点扩散函数的空间变化及卷绕误差,造成声源识别性能的损失,为此提出基于函数波束形成的改进FFT-FISTA算法。改进算法以函数波束形成输出作为FFT-FISTA算法的迭代输入,建立函数波束形成、声源分布及升幂空间转移不变点扩散函数的线性方程组,基于周期边界条件下的快速傅里叶变换进行迭代求解,使被运算的非周期函数变为一个周期函数,解决补零边界带来的波数泄漏问题,可提高运算准确性,进一步提升成像性能;通过指数运算锐化点扩散函数主瓣,拓展点扩散函数空间转移不变性假设的适用性。仿真和试验结果表明,相较于常规FFT-FISTA算法,改进算法能提升成像空间分辨率及动态范围,扩大FFT-FISTA算法的有效成像区域,压缩气体泄漏试验结果验证了改进算法的有效性。
文摘DFT is widely applied in the field of signal process and others. Most present rapid ways of calculation are either based on paralleled computers connected by such particular systems like butterfly network, hypercube etc; or based on the assumption of instant transportation, non-conflict communication, complete connection of paralleled processors and unlimited usable processors. However, the delay of communication in the system of information transmission cannot be ignored. This paper works on the following aspects: instant transmission, dispatching missions, and the path of information through the communication link in the computer cluster systems; layout of the dynamic FFT algorithm under the different structures of computer clusters.
文摘快速傅里叶变换(fast Fourier transform,FFT)在数字信号处理中占据核心地位.随着高性能超长点数FFT需求的增长,数字信号处理器(digital signal processor,DSP)的计算能力越来越难以满足需求,集成FFT加速器成为重要的发展趋势.为了支持超长点数FFT,将2维分解算法推广到多维,提出一种可集成于DSP的高性能超长点数FFT加速器结构.该结构通过基于素数个存储体的无冲突体编址方法实现了3维转置运算;通过递推算法实现了高效铰链因子生成;使用单精度浮点二项融合点积运算和融合加-减运算,对FFT运算电路进行了精细化设计.实现了对4G点数单精度浮点FFT计算的支持.综合结果表明:FFT加速器运行频率能够达到1GHz以上,性能达到640Gflop/s.在支持的点数和性能方面都较已有研究成果取得大幅提升.