基于注意力LSTM的音乐主题推荐模型

Abstract

Abstract: Aiming at the problems of low classification accuracy,long period,and difficulty in meeting the demand for theme music in people’s life,an attention mechanism and LSTM (Long Short-Term Memory) were designed.Based on the neural network model,it consists of a music theme model and a music recommendation model.On the basis of using the attention mechanism and the LSTM network to realize music emotion classification,the music theme model effectively combines the audio codebook and the topic model to achieve Discrimination of a subcategory of music topics under an emotion.In the music recommendation model,a low-level descriptor and a spectrogram are used to construct a joint representation of manual features and Convolutional Recurrent Neural Network (CRNN) features.The emotions expressed by the user’s voice are obtained,and the user is given a precise music theme recommendation by using this mo-del.In the experiment,two models were designed separately,and two different traditional models were used as the baseline.The experimental results show thatthis model not only can improve the classification accuracy of the subject,but also can accurately judge the emotion of the user’s voice data,so as to achieve the recommendation of the theme music compared with the traditional single model.

Key words: Attention mechanism, Convolutional recurrent neural network, Long short-term memory network, Low-Level descriptor, Music theme recommendation, Topic model

CLC Number:

TP183

JIA Ning, ZHENG Chun-jun. Model of Music Theme Recommendation Based on Attention LSTM[J].Computer Science, 2019, 46(11A): 230-235.

References

[1]VELARDE G,CHACÓN C C,MEREDITH D,et al.Convolution-based classification of audio and symbolic representations of music[J].Journal of New Music Research,2018:1-15.
[2]LAKOMKIN E,ZAMANI M A,Weber C,et al.EmoRL:Continuous Acoustic Emotion Classification using Deep Reinforcement Learning[C]∥ICRA’18.2018.
[3]HE H,XIA R.Joint Binary Neural Network for Multi-labelLearning with Applications to Emotion Classification∥Natural Language Processing and Chinese Computing.2018.
[4]RAJANNA A R,ARYAFAR K,SHOKOUFANDEH A,et al.Deep Neural Networks:A Case Study for Music Genre Classification[C]∥IEEE International Conference on Machine Lear-ning & Applications.IEEE,2015.
[5]TRABELSII,AYED D B.On the Use of Different Feature Extraction Methods for Linear and Non Linear kernels[C]∥2012 6th International Conference on Sciences of Electronics,Technologies of Information and Telecommunications (SETIT). IEEE,2012.
[6]LI T,OGIHARA M,LI Q.A Comparative Study on Content-Based Music Genre Classification[C]∥International AcmSigir Conference on Research & Development in Informaion Retrie-val.ACM,2003.
[7]LEE K K,PARK K S.Robust Feature Extraction for Automatic Classification of Korean Traditional Music in Digital Library[C]∥International Conference on Asian Digital Libraries:Implementing Strategies & Sharing Experiences.Springer-Verlag,2005.
[8]DU W,LIN H,SUN J,et al.A new hierarchical method for music genre classification[C]∥International Congress on Image & Signal Processing.IEEE,2017.
[9]DEB S,DANDAPAT S.Multiscale Amplitude Feature and Significance of Enhanced Vocal Tract Information for Emotion Classification[J].IEEE Transactions on Cybernetics,2018,PP(99):1-14.
[10]HUANG Y S,CHOU S Y,YANG Y H.Pop Music Highligh-ter:Marking the Emotion Keypoints∥Audio and Speech Processing.2018.
[11]MIRSAMADI S,BARSOUM E,ZHANG C.Automatic Speech Emotion Recognition Using Recurrent Neural Networks with Local Attention[C]∥ICASSP.IEEE,2017.
[12]BERTIN-MAHIEUX T,ELLIS D P.Large-scale cover songrecognition using hashed chroma landmarks[C]∥2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).IEEE,2011:117-120.
[13]VAN DEN OORD A,DIELEMAN S,ZEN H,et al.Wavenet:A generative model for raw audio[C]∥SSW.2016:125.
[14]EZZAT S,EL GAYAR N,GHANEM M M.Sentiment analysis of call centre audio conversations using text classification[J].International Journal of Computer Information Systems and Industrial Management Applications,2012,4(1): 619-627.
[15]PALKAR V V,JOEG P.Proposing scalable method for musicgenre classification[C]∥International Conference on Inventive Computation Technologies.2017.
[16]韩文静,李海峰,阮华斌.语音情感识别研究进展综述[J].软件学报,2014,25(1):37-50.
[17]PALO H K,MOHANTY M N,CHANDRA M.Computational Vision and Robotics[J].Advances in Intelligent Systems and Computing,2015,332:63-70.
[18]RODDY C.Emotion recognition in human-computer interaction[J].Signal Processing Magazine,IEEE,2001,18(1):32-80.
[19]DAVIES M E P,DEGARA N,PLUMBLEY M D.Measuringthe Performance of Beat Tracking Algorithms Using a Beat Error Histogram[J].IEEE Signal Processing Letters,2011,18(3):157-160.
[20]YUAN C,GLASS J.Speech2Vec:A Sequence-to-SequenceFramework for Learning Word Embedding from Speech[C]∥Interspeech.2018.

Related Articles 15

[1]	ZHOU Fang-quan, CHENG Wei-qing. Sequence Recommendation Based on Global Enhanced Graph Neural Network [J]. Computer Science, 2022, 49(9): 55-63.
[2]	DAI Yu, XU Lin-feng. Cross-image Text Reading Method Based on Text Line Matching [J]. Computer Science, 2022, 49(9): 139-145.
[3]	ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[4]	XIONG Li-qin, CAO Lei, LAI Jun, CHEN Xi-liang. Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization [J]. Computer Science, 2022, 49(9): 172-182.
[5]	RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[6]	ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[7]	SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[8]	YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[9]	WANG Ming, PENG Jian, HUANG Fei-hu. Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction [J]. Computer Science, 2022, 49(8): 40-48.
[10]	WANG Xin-tong, WANG Xuan, SUN Zhi-xin. Network Traffic Anomaly Detection Method Based on Multi-scale Memory Residual Network [J]. Computer Science, 2022, 49(8): 314-322.
[11]	JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[12]	JIN Fang-yan, WANG Xiu-li. Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM [J]. Computer Science, 2022, 49(7): 179-186.
[13]	XIONG Luo-geng, ZHENG Shang, ZOU Hai-tao, YU Hua-long, GAO Shang. Software Self-admitted Technical Debt Identification with Bidirectional Gate Recurrent Unit and Attention Mechanism [J]. Computer Science, 2022, 49(7): 212-219.
[14]	PENG Shuang, WU Jiang-jiang, CHEN Hao, DU Chun, LI Jun. Satellite Onboard Observation Task Planning Based on Attention Neural Network [J]. Computer Science, 2022, 49(7): 242-247.
[15]	ZHANG Ying-tao, ZHANG Jie, ZHANG Rui, ZHANG Wen-qiang. Photorealistic Style Transfer Guided by Global Information [J]. Computer Science, 2022, 49(7): 100-105.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Model of Music Theme Recommendation Based on Attention LSTM

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0