期刊文献+

一种用于实时图像处理的众核结构设计 被引量:4

Novel many-core architecture design for real-time image processing
在线阅读 下载PDF
导出
摘要 基于数据流模型和硬件可重构技术,提出了一种面向图像处理应用的可重构的多模式众核处理器结构.处理器采用了可扩展的层次化阵列结构,分布式共享存储和带硬件握手的近邻互连,可以分区并发实现多种并行模式,并克服了传统处理器实现数据流计算的低效性;基于VC++开发了集成仿真平台,用于对结构性能和指令性能的仿真验证,并在现场可编程门阵列上实现了包含64个处理单元的所提结构.仿真结果表明,所提结构实现了超过图形处理单元的性能以及接近专用集成电路的数据吞吐量. Based on the data-flow model and hardware reconfigurable technology, a polymorphic reconfigurable many-core processor architecture is presented for image processing. It is a scalable hierarchically organized parallel architecture,which is capable of supporting a dynamic mixture of multiple parallel computing models,and overcomes the inefficiency of traditional data-flow implementation by using distributed shared memory and neighbor interconnect architecture with hardware handshaking.From the beginning of the architecture design,based on the VC++,the integrated simulation platform (ISE) is developed for verifying the architecture and the performance of the instruction set.In addition,we also implement the proposed architecture on the FPGA.Experimental results show that the architecture can be used in many image processing applications,and achieve the throughput close to that of the ASIC and the performance better than that of the GPU.
出处 《西安电子科技大学学报》 EI CAS CSCD 北大核心 2015年第2期95-101,共7页 Journal of Xidian University
基金 国家自然科学基金资助项目(61136002 61272120)
关键词 多模式可重构结构 数据流 众核 并行计算 polymorphic reconfigurable architecture data-flow many-core parallel computin
作者简介 刘镇弢(1971-),男,西安电子科技大学博士研究生,E—mail:liuzhentao@xupt.edu.cn.
  • 相关文献

参考文献12

  • 1Licciardo G D,Albanese L F.Design of a Context-adaptive Variable Length Encoder for Real-time Video Compression on Reconfigurable Platforms[J].IET Image Processing,2012,6(4):301-308.
  • 2Dixit H V,Jeyakumar A,Kasat P S,et al.VLSI Design of Fast DCTQ-IQIDCT Processor for Real Time Image Compression[C]//Proceedings of Tenth International Conference on Wireless and Optical Communications Networks.Piscataway:IEEE,2013:1-5.
  • 3Orchard G,Zhang J,Suo Y,et al.Real Time Compressive Sensing Video Reconstruction in Hardware[J].IEEE Journal on Emerging and Selected Topics in Circuits and Systems,2012,2(3):604-615.
  • 4Coates A,Baumstarck P,Le Q,et al.Scalable Learning for Object Detection with GPU Hardware[C]//Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems.Piscataway:IEEE,2009:4287-4293.
  • 5Vangal S,Howard J,Ruhl G,et al.An 80-tile 1.28 TFLOPS Network-on-chip in 65nm CMOS[C]//Proceedings of IEEE International Solid-State Circuits Conference.Piscataway:IEEE,2007:98-589.
  • 6Hutchings B,Nelson B,West S,et al.Comparing Fine-grained Performance on the Ambric MPPA Against an FPGA[C]//Proceedings of International Conference on Field Programmable Logic and Applications.Piscataway:IEEE,2009:174-179.
  • 7Zhang Y,Yan C,Dai F,et al.Efficient Parallel Framework for H.264/AVC Deblocking Filter on Many-core Platform[J].IEEE Transactions on Multimedia,2012,14(3):510-524.
  • 8Sankaralingam K,Nagarajan R,Liu H,et al.Exploiting ILP,TLP,and DLP with the Polymorphous TRIPS Architecture[C]//Proceedings of 30th Annual International Symposium on Computer Architecture.Piscataway:IEEE,2003:422-433.
  • 9Dennis B,Misunas D P.A Preliminary Architecture for a Basic Data-flow Processor[J].ACM SIGARCH Computer Architecture News,1974,3(4):126-132.
  • 10SHEN XuBang.Evolution of MPP SoC architecture techniques[J].Science in China(Series F),2008,51(6):756-764. 被引量:7

二级参考文献11

  • 1沈绪榜,张发存,冯国臣,车得亮,王光.计算机体系结构的分类模型[J].计算机学报,2005,28(11):1759-1766. 被引量:10
  • 2Manners D,Makimoto T.Living with the Chip[]..1995
  • 3Le H Q,Starke W J,Fields J S, et al.IBM POWER6 microarchitecture[].J Res Dev.2007
  • 4AMD Corp.AMD Opteron?[].Product Data Sheet.2004
  • 5Ratner M,Ratner D, et al.Nanotechnology——A Gentle Introduction to the Next Big Idea[]..2003
  • 6Macias N I.Adaptive method for growing electronic circuits on an imperfect synthetic matrix[].Biosystems Engineering.2004
  • 7Adleman L M.Computing with DNA[].Scientific American.1998
  • 8Flynn M J.Very high speed computing systems[].Proceedings of the IEEE.1966
  • 9NVIDIA Corp.NVIDIA GeForce 8800 Architecture Technical Brief[]..2006
  • 10Persson E.ATI Radeon? HD 2000 programming guide[].AMD Graphics Products Report.2007

共引文献6

同被引文献7

引证文献4

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部