基于矩阵分解的Flink实时推荐策略

Flink Real-Time Recommendation Strategy Based on Matrix Decomposition

在线阅读下载PDF

导出

摘要虽然互联网快速进步发展,但也带来了大量的网络数据流,随之而来的是数据的综合存储,数据的综合计算和数据分析等诸多问题,各种业务系统的复杂多样化,数据分析的实效性要求也变得越来越高,先前常用的离线分析很多已经不适用于当今的生产需要,如今对数据的推荐系统在实时性方面有了更高的需求。基于矩阵分解的推荐算法作为目前较为流行的推荐算法,不论从预测的准确度还是预测的精确度都要明显地优于其它的算法。但传统的矩阵分解方法在处理大规模数据时存在计算速度慢和计算资源不足的问题。Flink大数据框架作为当前热门的流数据处理框架,在迭代计算与流数据处理上有明显的优势。本文将矩阵分解方法与Flink处理相结合,在原有的矩阵分解推荐算法的基础上,提出一种基于Flink的矩阵分解算法的优化模型,解决了矩阵分解在大数据环境下的瓶颈。 Although progress and rapid development of the Internet also brought a lot of network data flow, the following is the comprehensive storage of data, data comprehensive calculation and data analysis and many other problems. With the complexity and diversification of various business systems, the requirements for the effectiveness of data analysis have become increasingly high. In the past, most offline analysis commonly used is no longer applicable to today’s production needs. Now the data recommendation system is requested to have a higher demand in real time. As a popular recommendation algorithm at present, the recommendation algorithm based on matrix decomposition is obviously superior to other algorithms in terms of accuracy and accuracy of prediction. However, the traditional matrix decomposition method has the problems of slow computation speed and insufficient computation resources when dealing with large-scale data. As a popular streaming data processing framework, Flink big data framework has obvious advantages in iterative computation and streaming data processing. In this paper, matrix decomposition method is combined with Flink processing. On the basis of the original matrix decomposition recommendation algorithm, an optimization model of matrix decomposition algorithm based on Flink is proposed to solve the bottleneck of matrix decomposition in the big data environment.

作者谢荣臻陈源东白巧娈罗金炎

机构地区闽江学院数学与数据科学学院

出处《计算机科学与应用》 2021年第6期1783-1790,共8页 Computer Science and Application

关键词 Flink 大数据实时计算流处理

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献11

1张延彬.基于移动通信行业的大数据服务研究[J].电信工程技术与标准化,2016,29(2):44-47. 被引量：4
2古来,黄俊,张若凡,古智星,许二敏.结合多信息的概率矩阵分解模型[J].软件导刊,2018,17(9):67-71. 被引量：3
3翁小兰,王志坚.协同过滤推荐算法研究进展[J].计算机工程与应用,2018,54(1):25-31. 被引量：87
4孟利民,赵维,应颂翔.评分预测问题中个性化推荐模型的研究[J].浙江工业大学学报,2016,44(2):119-123. 被引量：5
5王圣涛,郝龙飞,贾洁民.一种基于NSGA-Ⅱ的协同过滤推荐算法[J].电子产品世界,2016,23(2):57-60. 被引量：1
6冯洋.基于改进的奇异值分解的红外弱小目标检测[J].激光技术,2016,40(3):335-338. 被引量：26
7张宇,程久军.基于MapReduce的矩阵分解推荐算法研究[J].计算机科学,2013,40(1):19-21. 被引量：8
8王振军,黄瑞章.基于Spark的矩阵分解与最近邻融合的推荐算法[J].计算机系统应用,2017,26(4):124-129. 被引量：11
9谢人强,陈震.基于共同评分项和权重计算的推荐算法研究[J].计算机技术与发展,2016,26(9):69-72. 被引量：2
10李昆仑,郭昌隆,关立伟.一种融合近邻用户影响力的矩阵分解推荐算法[J].小型微型计算机系统,2018,39(1):37-41. 被引量：10

二级参考文献92

1宋枫溪,程科,杨静宇,刘树海.最大散度差和大间距线性投影与支持向量机[J].自动化学报,2004,30(6):890-896. 被引量：58
2张光卫,李德毅,李鹏,康建初,陈桂生.基于云模型的协同过滤推荐算法[J].软件学报,2007,18(10):2403-2411. 被引量：199
3Koren Y, Bell R, Volinsky C. Matrix Factorization Techniques for Recommender Systems[J]. Computer, 2009,42 (8) : 30-37.
4Bell R M,Koren Y. Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights[C]//Proc of the 7th IEEE International Conference on Data Mining. Omaha NE, USA: IEEE, 2007: 43-52.
5Takacs G, Pilaszy I, Nemeth B, et al. Matrix Factorization and Neighbor Based Algorithms the Netflix Prize Problem [C]// Proceedings of the 2008 ACM conference on Recommender sys- tems. Lausanne, Switzerland: ACM, 2008 : 267 274.
6Zhou Y, Wilkinson D, Schreiber R, et al. Large-Scale ParallelCollaborative Filtering for the Netflix Prize[C]//Proc of the 4th international conference on Algorithmic Aspects in Information and Management. 2008.
7Dean J,Ghemawat S. MapReduee: Simplified Data Processing on Large Clusters[J]. Communication of the ACM 50: anniversary issue, 2008,51 (1) : 107d 13.
8Hadoop. Open-source software for reliable, scalable, distributed computing[-EB/OL], http://hadoop, apache, org/, 2011.
9Mahout. Scalable machine learning and data mining[EB/OL]. http://mahout, apache, org, 2011.
10Takacs G, Pliaszy I, Nemeth B, et al. Investigation of Various Matrix Factorization Methods for Large Recommender Systems [C]// Proc of the IEEE International Conference on Data Mi- ning Workshops. IEEE, 2008: 553-562.

共引文献151

1宫园园,艾宏志.Hadoop平台的民俗文化旅游资源推荐系统[J].科技通报,2021(2):62-66. 被引量：6
2蔡淦,王松坡.乐胃煎治疗胃癌前病变51例疗效观察[J].上海中医药杂志,2000,34(1):11-13. 被引量：17
3戚丽丽,孙静宇,陈俊杰.基于均模型的IBCF算法研究[J].山东大学学报（理学版）,2013,48(11):105-110. 被引量：2
4顾瑞春,王静宇.一种基于MapReduce的并行聚类模型[J].计算机与现代化,2014(1):90-92. 被引量：1
5吕昊,王兰.激光雷达在运动目标定位中的应用[J].激光杂志,2016,37(9):72-75. 被引量：3
6陈丽娟,汪锋.光学图像复原的质量评价与分析[J].激光杂志,2016,37(9):133-137. 被引量：1
7俞梁英,王子欧.光纤陀螺微振动信号检测的建模与分析[J].激光杂志,2016,37(10):57-61. 被引量：2
8钟志群,陈利.Retinex算法卫星遥感图像增强的应用[J].激光杂志,2016,37(10):106-110. 被引量：3
9罗黎霞.激光通信网络中的异常数据检测方法研究[J].激光杂志,2016,37(10):133-136. 被引量：16
10孙珊珊.蚁群算法优化支持向量机的人脸识别[J].现代电子技术,2016,39(21):92-94. 被引量：4

1赵鸿昌.基于图的连通性指标实现知识点的最优推荐[J].中国教育信息化,2021(12):85-91.
2孙梦佳,潘雪莲,华薇娜.我国国际期刊论文的开放获取现状——基于大规模数据的比较分析[J].现代情报,2021,41(7):168-176. 被引量：10
3查满霞,祝永晋,朱霖,张伯雷,钱柱中.面向实时流数据处理的边缘计算资源调度算法[J].计算机应用,2021,41(S01):142-148. 被引量：19
4龙泽昊,张添源,许伟,秦其明.基于Android的农田干旱遥感动态监测系统研制[J].国土资源遥感,2021,33(2):256-261. 被引量：2
5刘军,陈嘉钦.智能化能促进中国产业结构转型升级吗[J].现代经济探讨,2021(7):105-111. 被引量：18
6朱清华,胡甚平,田力,李文晶.基于二维灰云模型的LNG动力船航行过程风险推理[J].中国安全生产科学技术,2021,17(6):180-186. 被引量：7
7包梦莹,邹志萱,樊慧琴,黄媛媛,王佃云,刘立恒.鸡柔嫩艾美耳球虫丝氨酸羟甲基转移酶DNA疫苗免疫保护效果研究[J].中兽医学杂志,2021(4):6-7.

计算机科学与应用

2021年第6期

浏览历史

内容加载中请稍等...

基于矩阵分解的Flink实时推荐策略

参考文献11

二级参考文献92

共引文献151

相关作者

相关机构

相关主题

浏览历史