Computer Science ›› 2021, Vol. 48 ›› Issue (1): 226-232.doi: 10.11896/jsjkx.191200098

• Artificial Intelligence • Previous Articles     Next Articles

Deep Interest Factorization Machine Network Based on DeepFM

WANG Rui-ping, JIA Zhen, LIU Chang, CHEN Ze-wei, LI Tian-rui   

  1. School of Information Science and Technology,Southwest Jiaotong University,Chengdu 611756,China
  • Received:2019-12-16 Revised:2020-05-17 Online:2021-01-15 Published:2021-01-15
  • About author:WANG Rui-ping,born in 1995,postgraduate.Her main research interests include recommendation algorithm and natural language processing.
    LI Tian-rui,born in 1969,Ph.D,professor,Ph.D supervisor,is a distinguished member of China Computer Federation.His main research interests include big data intelligence,rough sets and granular computing.
  • Supported by:
    National Key R&D Program of China(2017YFB1401400).

Abstract: The recommendation system can sort out and display the information that may be of interest from the mass of information according to users' preferences.As deep learning has achieved good results in multiple research fields,it has also begun to be applied to recommendation systems.However,the current recommendation ranking algorithms based on deep learning often use Embedding & MLP mode and can only obtain high-level feature interactions.In order to solve the problem that only high-order feature interaction can be obtained,DeepFM adds FM to the above mode,which can learn the low-order and high-order feature interaction end-to-end.But the DeepFM cannot express the diversity of user interests.In view of this,this paper proposes a Deep Interest Factorization Machine Network(DIFMN) by introducing the multi-head attention mechanism into DeepFM.DIFMN can adaptively learn the user representation according to the different items to be recommended,showing the diversity of user intere-sts.In addition,the model adds preference representations according to the type of user's historical behaviors,so that it can be applied not only to tasks that record only historical behaviors that the user likes,but also to tasks that record both historical beha-viors that the user likes and dislikes.This paper uses tensorflow-gpu to implement the algorithm,and performs comparative tests on two public datasets of Amazon(Electronics) and movieLen-20 m.Experiment results show that RelaImprimproves by 17.70% and 35.24% respectively compared to DeepFM,which validates the feasibility and effectiveness of the proposed method.

Key words: CTR prediction, Deep learning, DeepFM, Multi-head attention mechanism, Recommendation algorithm, User interest modeling

CLC Number: 

  • TP391
[1] MARZ N,WARREN J.Big Data:Principles and best practices of scalable realtime data systems[M].Manning Publications,2015.
[2] RICCI F,ROKACH L,SHAPIRA B.Introduction to Recom-mender Systems Handbook[M]//Recommender Systems Handbook.Boston:Springer,2011:1-35.
[3] YU C J,ZHUANG Y,WEI S C,et al.Field-aware factorization machines for CTR prediction[C]//Proceedings of the 10th ACM Conference on Recommender Systems.ACM,2016:43-50.
[4] COVINGTON P,ADAMS J,SARGIN E.Deep neural networks for youtube recommendations[C]//Proceedings of the 10th ACM Conference on Recommender Systems.ACM,2016:191-198.
[5] ZHOU G R,ZHU X Q,SONG C R,et al.Deep interest network for click-through rate prediction[C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining.ACM,2018:1059-1068.
[6] CHEN Q,ZHAO H,LI W,et al.Behavior Sequence Transformer for E-commerce Recommendation in Alibaba[J].arXiv:1905.06874,2019.
[7] ZHOU G R,MOU N,FAN Y,et al.Deep interest evolution network for click-through rate prediction[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:5941-5948.
[8] CHENG H T,KOC L,HARMSEN J,et al.Wide & deep learning for recommender systems[C]//Proceedings of the 1st Workshop on Deep Learning for Recommender Systems.ACM,2016:7-10.
[9] GUO H F,TANG R M,YE Y M,et al.DeepFM:a factorization-machine based neural network for CTR prediction[C]//Proceedings of the 26th International Joint Conference on Artificial Intelligence.AAAI Press,2017:1725-1731.
[10] RENDLE S.Factorization machines[C]//Proceedings of 2010 IEEE International Conference on Data Mining.IEEE,2010:995-1000.
[11] VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Advances in Neural Information Processing Systems.2017:5998-6008.
[12] MCAULEY J,TARGETT C,SHI Q,et al.Image-based recommendations on styles and substitutes[C]//Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval.ACM,2015:43-52.
[13] HE R,MCAULEY J.Ups and downs:Modeling the visual evolution of fashion trends with one-class collaborative filtering[C]//Proceedings of the 25th International Conference on World Wide Web.2016:507-517.
[14] HARPER F M,KONSTAN J A.The movielens datasets:His-tory and context[J].ACM Transactions on Interactive Intelligent Systems,2015,5(4):19.
[15] QU Y,CAI H,REN K,et al.Product-based neural networks for user response prediction[C]//Proceedings of 2016 IEEE 16th International Conference on Data Mining.2016:1149-1154.
[16] ZHU H,JIN J,TAN C,et al.Optimized cost per click in taobao display advertising[C]//Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.ACM:2191-2200.
[17] YAN L,LI W J,XUE G R,et al.Coupled group lasso for web-scale ctr prediction in display advertising[C]//Proceedings of International Conference on Machine Learning.2014:802-810.
[18] RICHARDSON M,DOMINOWSKA E,RAGNO R.Predicting clicks:estimating the click-through rate for new ads[C]//Proceedings of the 16th International Conference on World Wide Web.ACM,2007:521-530.
[19] FENG YF,LV F Y,SHEN W C,et al.Deep session interest network for click-through rate prediction[C]//Proceedings of 28th International Joint Conference on Artificial Intelligence.2019.
[20] ZHU H,LI X,ZhANG P,et al.Learning tree-based deep model for recommender systems[C]//Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining.2018:1079-1088.
[1] XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[2] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[3] TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[4] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[5] WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[6] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[7] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[8] HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.
[9] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[10] HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.
[11] ZHOU Hui, SHI Hao-chen, TU Yao-feng, HUANG Sheng-jun. Robust Deep Neural Network Learning Based on Active Sampling [J]. Computer Science, 2022, 49(7): 164-169.
[12] SU Dan-ning, CAO Gui-tao, WANG Yan-nan, WANG Hong, REN He. Survey of Deep Learning for Radar Emitter Identification Based on Small Sample [J]. Computer Science, 2022, 49(7): 226-235.
[13] LIU Wei-ye, LU Hui-min, LI Yu-peng, MA Ning. Survey on Finger Vein Recognition Research [J]. Computer Science, 2022, 49(6A): 1-11.
[14] SUN Fu-quan, CUI Zhi-qing, ZOU Peng, ZHANG Kun. Brain Tumor Segmentation Algorithm Based on Multi-scale Features [J]. Computer Science, 2022, 49(6A): 12-16.
[15] KANG Yan, XU Yu-long, KOU Yong-qi, XIE Si-yu, YANG Xue-kun, LI Hao. Drug-Drug Interaction Prediction Based on Transformer and LSTM [J]. Computer Science, 2022, 49(6A): 17-21.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!