期刊文献+

多核Cache稀疏目录性能提升方法综述 被引量:2

A survey of performance improvement methods for multi-core cache sparse directory
在线阅读 下载PDF
导出
摘要 受限于功耗,十多年前通用微处理器就停止追求更高的主频转而向集成更多处理器核的方向发展;同时,随着晶体管密度按摩尔定律不断提高,单片可集成的处理器核数成倍增长,片上多核、众核处理器已成为高性能微处理器发展的主流。未来千核级通用众核处理器支持共享存储编程模型是一种必然趋势,但传统的Cache一致性目录结构面临着查找延迟高、目录项替换频繁以及硬件代价和功耗可扩展性有限等问题。稀疏目录实现了传统目录结构硬件开销与一致性维护效率的折衷,被认为是众核处理器维护Cache一致性的一种高能效、可扩展结构。综述了近年来提高稀疏目录性能的相关研究与方法,并对其在面积、访问延迟、功耗和实现复杂性等方面进行分析,归纳出这些方法各自的优点和存在的不足,对创新设计未来高性能众核处理器共享存储体系结构具有一定的参考价值。 Due to limited power consumption, the general-purpose processor stopped pursuing higher frequency more than a decade ago and moved towards integrating more processor cores on a single chip. At the same time, with the increasing density of transistors according to the law of Moore, the number of processor cores integrated on a single chip has been doubled and redoubled, thus multi-core and many-core processors have become the mainstream of high-performance processors. It is an inevitable trend for the future kilo-core general processor to support shared memory programming model. However, the traditional cache coherence directory structure is confronted with the problems of high latency, frequent replacement of directory entries, limited scalability for the hardware cost and power consumption. The sparse directory realizes the tradeoff between the hardware cost of the traditional directory structure and the coherence maintenance efficiency, and is considered as an energy-efficient and scalable structure for many-core processors to maintain cache coherence. We review related research and methods for improving the performance of sparse directory in recent years, analyze their characteristics in terms of area, access delay, power consumption and implementation complexity, and summarize the merits and shortcomings of these directory schemes. It has certain reference significance for designing novel scalable shared memory architectures for future many-core processors.
作者 吴健虢 陈海燕 刘胜 邓让钰 陈俊杰 WU Jian-guo;CHEN Hai-yan;LIU Sheng;DENG Rang-yu;CHEN Jun-jie(School of Computer,National University of Defense Technology,Changsha 410073,China)
出处 《计算机工程与科学》 CSCD 北大核心 2019年第3期385-392,共8页 Computer Engineering & Science
基金 国家自然科学基金(61472432)
关键词 单片多核 CACHE一致性 稀疏目录 相联度 可扩展 chip multi-processor(CMP) cache coherence sparse directory associativity scalable
作者简介 吴健虢(1993-),男,云南保山人,硕士,研究方向为微处理器技术。E-mail: 2267076470@qq.com;陈海燕(1967-),女,四川南充人,硕士,研究员,研究方向为微处理器技术。E-mail: hychen608@163.com;刘胜(1984-),男,河南南阳人,博士,助理研究员,研究方向为微处理器技术。E-mail: liusheng83@nudt.edu.cn;邓让钰(1972-),男,湖南新田人,博士,研究员,研究方向为高性能微处理器体系结构。E-mail: rydeng@nudt.edu.cn;陈俊杰(1986-),男,湖南常德人,硕士,工程师,研究方向为微处理器技术。E-mail: 393221249@qq.com.
  • 相关文献

参考文献4

二级参考文献76

  • 1陈书明,李振涛,万江华,胡定磊,郭阳,汪东,扈啸,孙书为.“银河飞腾”高性能数字信号处理器研究进展[J].计算机研究与发展,2006,43(6):993-1000. 被引量:29
  • 2David E. Culler, Jaswinder Pal Singh, Anoop Gupt.并行计算机体系结构:硬件与软件结合的设计与分析[M].北京:机械工业出版社,2002.612-668.
  • 3IOzone Filesystem Benchmark [ Z ]. http://www, iozone, org.
  • 4Goodman J. R. Cache consistency and Sequential Consistency [ R]. SCI Committee, 1991.1 - 4.
  • 5Strenstrom, P, A survey of cache coherence schemes for multiprocessors[J].IEEE Computer 1990,23(6) : 12 - 24.
  • 6Sun Microsystems, Inc. UltraSPARC Virtual Machine Specification[R]. Santa Clara, 2008.5 - 293.
  • 7Mark Smotherman. A Sequencing-Based Taxonomy of I/O Systems and Review of Historical Machines[M]. San Francisco:Morgan Kaufmann Publishers Inc, 1989.10- 15.
  • 8B Nitzberg, V Lo. Distributed shared memory: a survey of issues and algorithms[ J]. IEEE Computer, 1991:24(8) :52 - 60.
  • 9A Forin, J Barrera, R Sanzi. The shared memory server[ A ]. USENIX Winter Conference [ C ]. Baltimore: USENIX Press, 1989.229 - 243.
  • 10W Hsu,A Smith. Characterization of I/O traffic in personal and server workloads[J].IBM Systems Journal, 2003,42 (4) : 347 - 372.

共引文献4

同被引文献19

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部