期刊文献+

多核处理器中基于Radix-Join的嵌套循环连接优化 被引量:1

Nested Loop Join Optimization Based on Radix-Join in Chip Multi-Processor
在线阅读 下载PDF
导出
摘要 针对目前主流的多核处理器,研究了基于共享Cache多核处理器的数据库Nested Loop Join(NINLJ)优化.针对无索引情况下的NLJ,提出了基于Radix-NL-Join算法的NLJ多线程执行框架.从减少Cache访问冲突和提高Cache命中率两个方面优化了NINLJ多线程执行框架中的聚集划分和聚集连接线程.主要贡献如下:1.针对多线程访问共享Cache容易出现共享Cache访问冲突的问题,优化了聚集划分阶段的多线程聚集划分线程的启动时机;2.针对聚集连接阶段,聚集连接线程Cache访问性能不佳,利用聚集连接线程顺序访问聚集的优势,采用预取线程提高聚集连接线程的性能;3.在实验中,基于开源数据库EaseDB实现了上述多线程执行框架,测试了多线程NLJ的性能.实验结果表明,提出的NLJ多线程执行框架,可以充分利用多核处理器的计算资源,并有效地解决共享Cache在多线程条件下的Cache访问冲突问题,大大提高了NLJ的性能,相对于未采用Cache优化的多线程Radix-NL-Join算法,其性能提升了26%左右. Aiming at current chip multi-processor(CMP),presented in this paper is a non-indexed nested loop join (NINLJ) optimization based on shared cache CMP. The authors firstly present multithreaded NINLJ execution framework based on radix-NL-join algorithm,and then,through reducing cache conflict and improving cache hit ratio,optimize cache performance of cluster partition thread and cluster join thread in the framework. The main contributions are as follows: 1.Aiming at the shared cache confliction when multiple threads access shared cache simultaneously,the start time of cluster partition thread is optimized to reduce shared cache confliction in cluster partition phase; 2. In cluster join phase,cluster join threads have poor cache behaviors. To solve this performance bottleneck,the advantage of sequent cluster access is utilized when cluster join threads executing,and preload thread is adopted to preload cluster from main memory to L2-cache before cluster join threads need it; 3.In the experiments,the framework is realized in EaseDB,and the performance of multithreaded NINLJ is tested. The experiment results show that the multithreaded NINLJ execution framework could fully utilize computing resource of CMP and effectively solve shared cache conflict in multithreaded environment,and the performance of NINLJ is improved. The algorithm proposed outperforms traditional multithreaded Radix-NL-Join by 26% on average.
出处 《计算机研究与发展》 EI CSCD 北大核心 2010年第6期1079-1087,共9页 Journal of Computer Research and Development
基金 国家"八六三"高技术研究发展计划基金项目(2007AA120400 2007AA12Z208) 国家自然科学基金项目(40801160)
关键词 嵌套循环连接 Radix-Join 共享Cache多核处理器 多线程优化 Cache访问优化 nested loop join radix-join shared cache chip multi-processor multithreaded optimization cache access optimization
作者简介 邓亚丹,1981年生,博士,主要研究方向为数据库内核优化和空间数据库.dengyadan2008@yahoo. com. cn 景宁,1963年生,教授,博士生导师,主要研究方向为地理信息系统与空间数据库. 熊伟,1976年生,博士,讲师,主要研究方向为主动数据库和空间数据库.
  • 相关文献

参考文献12

  • 1Zhou J, Cieslewiez J, Ross K A, et al. Improving database performance on simultaneous multithreaded processors [C]// Proc of the 31st Int Conf on Very Large Data Bases. New York: ACM, 2005:49-60.
  • 2Hardavellas N, Pandis I, Johnson R. Database servers on chip multiprocessors limitations and opportunities [C]//Proc of the 3rd Biennial Conf on Innovative Data Systems Research. New York: ACM, 2007:79-87.
  • 3Cieslewicz J, Ross K A, Giannakakis I. Parallel buffer for chip multiprocessors [C] //Proe of the 3rd Int Workshop on Data Management on New Hardware. New York: ACM, 2007 : 2.
  • 4Cieslewiez J, Ross K A. Adaptive aggregation on chip multiprocezsors [C] //Proc of the 33rd Int Conf on Very Large Data Bases. New York: ACM, 2007:339-350.
  • 5Chen Shimin, Gibbons P B, Kozuch M. Scheduling threads for constructive cache sharing on CMPs [C] //Proe of the 19th Annual ACM Symp on Parallel Algorithms and Architectures. New York: ACM, 2007:105-115.
  • 6Hankins R A, Patel J M. Effect of node size on the performance of cache-conscious B^+-Tree[C] /lint Conf on Measurements and Modeling of Computer Systems. New York: ACM, 2003:283-294.
  • 7Chen S, Ailamaki A, Gibbons P B. Improving hash join performance through prefetching [C] HProc of Int Conf on Data Engineering. New York: ACM, 2004: 116-127.
  • 8Boncz P, Manegold S, Kersten M L. Database architecture optimized for the new bottleneck: Memory access [C]//Proc of the 25th VLDB Conf. New York: ACM, 1999:231-246.
  • 9He Bingsheng, Luo Qiong. Cache-oblivious nested loop joins [C] //Proc of Int Conf on Information and Knowledge Management. New York: ACM, 2006 : 718-727.
  • 10叶常春,杨利,杨树强.基于工作站机群结构的并行数据库连接算法的实现[J].计算机工程与科学,2000,22(5):77-80. 被引量:1

二级参考文献4

  • 1李建中.并行数据库技术.数据库技术新进展[M].北京:清华大学出版社,1997.104-130.
  • 2杨利.复杂查询并行执行及并行数据库系统技术研究[博士学位论文].长沙:国防科技大学,1995.93-116.
  • 3李建中,数据库技术新进展,1997年,104页
  • 4杨利,博士学位论文,1995年,93页

同被引文献3

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部