计算机科学 ›› 2013, Vol. 40 ›› Issue (1): 19-21.

• 综述 • 上一篇    下一篇

基于MapReduce的矩阵分解推荐算法研究

张 宇,程久军   

  1. (同济大学电子与信息工程学院 上海201804);(同济大学嵌入式系统与服务计算教育部重点实验室 上海201804)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Study on Recommendation Algorithm with Matrix Factorization Method Based on MapReduce

  • Online:2018-11-16 Published:2018-11-16

摘要: 矩阵分解是近几年提出的一种协同过滤推荐技术,但其每项预测评分的计算都要综合大量评分数据,同时在计算时还需要存储庞大的特征矩阵,用单一结点来进行推荐将会遇到计算时间和计算资源瓶颈。结合MapReduce分布式计算框架和矩阵分解推荐算法,设计了一种基于MapRcducc的矩阵分解推荐算法来解决该问题,利用Hadoop的分布式缓存技术和MapFilc文件结构解决了大特征矩阵在多结点间的高效共享问题并实现了多正则因子的并行处理。通过在Netflix数据集上的实验表明,该MapReduce算法及数据存储方案能带来较高的加速比,从而提高了推荐算法的计算效率。

关键词: 协同过滤推荐,矩阵分解推荐,MapReduce, Hadoop

Abstract: Matrix factorization is a collaborative filtering recommendation technique proposed in recent years. In the process of recommendation, each prediction depends on the collaboration of the whole known rating set and the feature matrices need huge storage. So the recommendation with only one node will meet the bottleneck of time and resource. A MapReduce-based matrix factorization recommendation algorithm was proposed to solve this problem. The big feature matrices were shared by Hadoop distributed cache and MapFile techniques.The MapReduce algorithm could also handle multi-? situation. The experiment on Netflix data set shows that the MapReduce-based algorithm has high speedup and improves the efficiency of collaborative filtering.

Key words: Collaborative filtering recommendation, Matrix factorization, MapReduce, Hadoop

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!