期刊文献+

基于矩阵分解的Flink实时推荐策略

Flink Real-Time Recommendation Strategy Based on Matrix Decomposition
在线阅读 下载PDF
导出
摘要 虽然互联网快速进步发展,但也带来了大量的网络数据流,随之而来的是数据的综合存储,数据的综合计算和数据分析等诸多问题,各种业务系统的复杂多样化,数据分析的实效性要求也变得越来越高,先前常用的离线分析很多已经不适用于当今的生产需要,如今对数据的推荐系统在实时性方面有了更高的需求。基于矩阵分解的推荐算法作为目前较为流行的推荐算法,不论从预测的准确度还是预测的精确度都要明显地优于其它的算法。但传统的矩阵分解方法在处理大规模数据时存在计算速度慢和计算资源不足的问题。Flink大数据框架作为当前热门的流数据处理框架,在迭代计算与流数据处理上有明显的优势。本文将矩阵分解方法与Flink处理相结合,在原有的矩阵分解推荐算法的基础上,提出一种基于Flink的矩阵分解算法的优化模型,解决了矩阵分解在大数据环境下的瓶颈。 Although progress and rapid development of the Internet also brought a lot of network data flow, the following is the comprehensive storage of data, data comprehensive calculation and data analysis and many other problems. With the complexity and diversification of various business systems, the requirements for the effectiveness of data analysis have become increasingly high. In the past, most offline analysis commonly used is no longer applicable to today’s production needs. Now the data recommendation system is requested to have a higher demand in real time. As a popular recommendation algorithm at present, the recommendation algorithm based on matrix decomposition is obviously superior to other algorithms in terms of accuracy and accuracy of prediction. However, the traditional matrix decomposition method has the problems of slow computation speed and insufficient computation resources when dealing with large-scale data. As a popular streaming data processing framework, Flink big data framework has obvious advantages in iterative computation and streaming data processing. In this paper, matrix decomposition method is combined with Flink processing. On the basis of the original matrix decomposition recommendation algorithm, an optimization model of matrix decomposition algorithm based on Flink is proposed to solve the bottleneck of matrix decomposition in the big data environment.
出处 《计算机科学与应用》 2021年第6期1783-1790,共8页 Computer Science and Application
  • 相关文献

参考文献11

二级参考文献92

  • 1宋枫溪,程科,杨静宇,刘树海.最大散度差和大间距线性投影与支持向量机[J].自动化学报,2004,30(6):890-896. 被引量:58
  • 2张光卫,李德毅,李鹏,康建初,陈桂生.基于云模型的协同过滤推荐算法[J].软件学报,2007,18(10):2403-2411. 被引量:199
  • 3Koren Y, Bell R, Volinsky C. Matrix Factorization Techniques for Recommender Systems[J]. Computer, 2009,42 (8) : 30-37.
  • 4Bell R M,Koren Y. Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights[C]//Proc of the 7th IEEE International Conference on Data Mining. Omaha NE, USA: IEEE, 2007: 43-52.
  • 5Takacs G, Pilaszy I, Nemeth B, et al. Matrix Factorization and Neighbor Based Algorithms the Netflix Prize Problem [C]// Proceedings of the 2008 ACM conference on Recommender sys- tems. Lausanne, Switzerland: ACM, 2008 : 267 274.
  • 6Zhou Y, Wilkinson D, Schreiber R, et al. Large-Scale ParallelCollaborative Filtering for the Netflix Prize[C]//Proc of the 4th international conference on Algorithmic Aspects in Information and Management. 2008.
  • 7Dean J,Ghemawat S. MapReduee: Simplified Data Processing on Large Clusters[J]. Communication of the ACM 50: anniversary issue, 2008,51 (1) : 107d 13.
  • 8Hadoop. Open-source software for reliable, scalable, distributed computing[-EB/OL], http://hadoop, apache, org/, 2011.
  • 9Mahout. Scalable machine learning and data mining[EB/OL]. http://mahout, apache, org, 2011.
  • 10Takacs G, Pliaszy I, Nemeth B, et al. Investigation of Various Matrix Factorization Methods for Large Recommender Systems [C]// Proc of the IEEE International Conference on Data Mi- ning Workshops. IEEE, 2008: 553-562.

共引文献151

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部