基于回复生成的对话意图预测

doi:10.11896/jsjkx.200700137

Abstract

Abstract: With the continuous development of the human-machine dialogue system,it is of great significance for the computer to accurately understand the speaker's dialogue act and predict the act of response according to the history information of the dialogue.Previous research work focus on act prediction of responses based on dialogue text and existing labels.But in many scena-rios,the reply has not been generated.Therefore,this paper proposes a dialogue act prediction model based on reply generation.In the generation part,the Seq2Seq structure is used to generate text based on the conversation history information as text information for future replies in the conversation;in the classification part,the LSTM model is used to express the generated reply text and the existing conversation information as clause level representations.Combined with the attention mechanism,it highlights the connection between the dialogue sentence of the same round and the generated response.The experimental results show that the proposed model a chieves a 2.54% F1-score improvement compared to the simple baseline model,and the joint training method contributes to the improvement of model performance.

Key words: Attention mechanism, Dialogue act, Prediction model, Text generation

CLC Number:

TP391

WANG Bo-yu, WANG Zhong-qing, ZHOU Guo-dong. Dialogue Act Prediction Based on Response Generation[J].Computer Science, 2021, 48(2): 212-216.

References

[1] TUR G,CELIKYILMAZ A,HAKKANI-TÜR D.Latent se-mantic modeling for slot filling in conversational understanding[C]//2013 IEEE International Conference on Acoustics,Speech and Signal Processing.IEEE,2013:8307-8311.
[2] EYBEN F,WÖLLMER M,GRAVES A,et al.On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues[J].Journal on Multimodal User Interfaces,2010,3(1／2):7-19.
[3] LIU J,LI Y L,LIN M.Review of Intent Detection Methods in Human-Machine Dialogue System[J].Computer Engineering and Applications,2019,55(12):1-7,43.
[4] LI W B,ZHANG L,SHU X.Application and Research on Generative Automatic Question Answering System Based on Seq2Seq [J].Modern Computer,2017(36):59-62.
[5] HOCHREITER S,SCHMIDHUBER J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780.
[6] BAHDANAU D,CHO K,BENGIO Y.Neural machine translation by jointly learning to align and translate[J/OL].https://arxiv.org/abs/1409.0473.
[7] KALCHBRENNER N,BLUNSOM P.Recurrent Convolutional Neural Networks for Discourse Compositionality[C]//Procee-dings of the Workshop on Continuous Vector Space Models and their Compositionality.2013:119-126.
[8] MIKOLOV T,KARAFIT M,BURGET L,et al.Recurrent neural network based language model [C]//Proceedings of 11th Annual Conference of the International Speech Communication Association.2010:1045-1048.
[9] KHANPOUR H,GUNTAKANDLA N,NIELSEN R.Dialogue act classification in domain-independent conversations using a deep recurrent neural network[C]//Proceedings of COLING 2016,the 26th International Conference on Computational Linguistics.2016:2012-2021.
[10] KUMAR H,AGARWAL A,DASGUPTA R,et al.Dialogue act sequence labeling using hierarchical encoder with crf[C]//Thirty-Second AAAI Conference on Artificial Intelligence.2018:3440-3447.
[11] LAFFERTY J.Conditional random fields:Probabilistic modelsfor segmenting and labeling sequence data[C]//Proceedings of the 18th International Conference on Machine Learning.2001.
[12] RAHEJA V,TETREAULT J.Dialogue Act Classification with Context-Aware Self-Attention[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2019:3727-3733.
[13] LI R,LIN C,COLLINSON M,et al.A Dual-Attention Hierarchical Recurrent Neural Networkfor Dialogue Act Classification[C]//Proceedings of the 23rd Conference on Computational Natural Language Learning.ACL,2019:383-392.
[14] TANAKA K,TAKAYAMA J,ARASE Y.Dialogue-Act Prediction of Future Responses Based on Conversation History[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics:Student Research Workshop.2019:197-202.
[15] CHO K,VAN MERRI$\widetilde{E}$NBOER B,GULCEHRE C,et al.Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP).2014:1724-1734.
[16] FREITAG M,AL-ONAIZAN Y.Beam Search Strategies forNeural Machine Translation[J].Association for Computational Linguistics,2017,2017:56.
[17] VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Advances in Neural Information Processing Systems.2017:5998-6008.
[18] LI Y,SU H,SHEN X,et al.DailyDialog:A Manually Labelled Multi-turn Dialogue Dataset[C]//Procee-dings of the Eighth International Joint Conference on Natural Language Processing.2017:986-995.
[19] PAPINENI K,ROUKOS S,WARD T,et al.BLEU:a method for automatic evaluation of machine translation[C]//Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics.2002:311-318.

Related Articles 15

[1]	RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[2]	ZHOU Fang-quan, CHENG Wei-qing. Sequence Recommendation Based on Global Enhanced Graph Neural Network [J]. Computer Science, 2022, 49(9): 55-63.
[3]	DAI Yu, XU Lin-feng. Cross-image Text Reading Method Based on Text Line Matching [J]. Computer Science, 2022, 49(9): 139-145.
[4]	ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[5]	XIONG Li-qin, CAO Lei, LAI Jun, CHEN Xi-liang. Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization [J]. Computer Science, 2022, 49(9): 172-182.
[6]	JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[7]	ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[8]	SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[9]	YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[10]	WANG Ming, PENG Jian, HUANG Fei-hu. Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction [J]. Computer Science, 2022, 49(8): 40-48.
[11]	JIN Fang-yan, WANG Xiu-li. Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM [J]. Computer Science, 2022, 49(7): 179-186.
[12]	XIONG Luo-geng, ZHENG Shang, ZOU Hai-tao, YU Hua-long, GAO Shang. Software Self-admitted Technical Debt Identification with Bidirectional Gate Recurrent Unit and Attention Mechanism [J]. Computer Science, 2022, 49(7): 212-219.
[13]	PENG Shuang, WU Jiang-jiang, CHEN Hao, DU Chun, LI Jun. Satellite Onboard Observation Task Planning Based on Attention Neural Network [J]. Computer Science, 2022, 49(7): 242-247.
[14]	ZHANG Ying-tao, ZHANG Jie, ZHANG Rui, ZHANG Wen-qiang. Photorealistic Style Transfer Guided by Global Information [J]. Computer Science, 2022, 49(7): 100-105.
[15]	ZENG Zhi-xian, CAO Jian-jun, WENG Nian-feng, JIANG Guo-quan, XU Bin. Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism [J]. Computer Science, 2022, 49(7): 106-112.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Dialogue Act Prediction Based on Response Generation

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0