Computer Science ›› 2022, Vol. 49 ›› Issue (11A): 210900246-7.doi: 10.11896/jsjkx.210900246

• Artificial Intelligence • Previous Articles     Next Articles

EGOS-DST:Efficient Schema-guided Approach to One-step Dialogue State Tracking for Diverse Expressions

ZHU Ruo-chen1, YANG Chang-chun1, ZHANG Deng-hui2   

  1. 1 School of Computer Science and Artificial Intelligence,Changzhou University,Changzhou,Jiangsu 213159,China
    2 College of Information Science and Technology,Zhejiang Shuren University,Hangzhou 310015,China
  • Online:2022-11-10 Published:2022-11-21
  • About author:ZHU Ruo-chen,born in 1997,postgra-duate,is a member of China Computer Federation.His main research interests include natural language processing and dialogue system.
    ZHANG Deng-hui,born in 1970,master,professor,is a member of China Computer Federation.His main research interests include distributed service collaboration and natural language processing.
  • Supported by:
    Zhejiang Provincial Natural Science Foundation of China(LGF21F020024) and Youth Academic Team Project of Zhejiang Shuren University.

Abstract: Recent dialogue state tracking works have focused on the hybrid approach to balance the two extreme methods(i.e.,over-reliance on ontology and complete abandoning ontology).However,some special phenomena are ignored in these works.For instant,value sharing and recommendation acceptance.In addition,the widely used slot gate mechanism makes it difficult for the model to process slots in parallel and the mechanism also propagates errors to the slot value generation steps.This paper proposes a new hybrid approach that deals with four different phenomena,namely diverse value,unseen value,value sharing and recommendation acceptance.By modifying the candidate value set and the model input,our model can parallelly process slots in one step and no longer depend on the slot gate.Experimental results indicate that the model achieves 57.7% and 59.5% joint goal accuracy on the English dataset MultiWOZ 2.2 and 2.3,respectively,and reaches 68.1% on the Chinese dataset RiSAWOZ,with only 10ms infer time.Finally,the robustness of the model is analyzed.The results on MultiWOZ 2.2 show that the joint target accuracy rate is 55.4% when the recommendation error rate reaches 15%.

Key words: Task-oriented dialogue system, Dialogue state tracking, BERT, Parallel computing, Schema-guided dialogue

CLC Number: 

  • TP391.1
[1]RASTOGI A,ZANG X,SUNKARA S,et al.Schema-guided dialogue state tracking task at DSTC8[J].arXiv:2002.01359,2020.
[2]ZHONG V,XIONG C,SOCHER R.Global-Locally Self-Attentive Encoder for Dialogue State Tracking[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers).Melbourne:Association for Computational Linguistics,2018:1458-1467.
[3]SHAN Y,LI Z,ZHANG J,et al.A Contextual Hierarchical Attention Network with Adaptive Objective for Dialogue State Tracking[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.Online:Association for Computational Linguistics,2020:6322-6333.
[4]YE F,MANOTUMRUKSA J,ZHANG Q,et al.Slot Self-At-tentive Dialogue State Tracking[J].arXiv:2101.09374,2021.
[5]XU P,HU Q.An End-to-end Approach for Handling Unknown Slot Values in Dialogue State Tracking[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers).Melbourne:Association for Computational Linguistics,2018:1448-1457.
[6]CHAO G L,LANE I.Bert-dst:Scalable end-to-end dialoguestate tracking with bidirectional encoder representations from transformer[J].arXiv:1907.03040,2019.
[7]VINYALS O,FORTUNATO M,JAITLY N.Pointer Networks[C]//Advances in Neural Information Processing Systems.Montréal:Curran Associates,Inc.,2015:2692-2700.
[8]WANG Y,GUO Y,ZHU S.Slot Attention with Value Normalization for Multi-Domain Dialogue State Tracking[C]//Procee-dings of the 2020 Conference on Empirical Methods in Natural Language Processing(EMNLP).Online:Association for Computational Linguistics,2020:3019-3028.
[9]ZHU S,LI J,CHEN L,et al.Efficient Context and Schema Fusion Networks for Multi-Domain Dialogue State Tracking[C]//Findings of the Association for Computational Linguistics:EMNLP 2020.Online:Association for Computational Linguistics,2020:766-781.
[10]HAN T,LIU X,TAKANOBU R,et al.MultiWOZ 2.3:Amulti-domain task-oriented dataset enhanced with annotation corrections and co-reference annotation[J].arXiv:2010.05594,2020.
[11]ZANG X,RASTOGI A,SUNKARA S,et al.MultiWOZ 2.2:A Dialogue Dataset with Additional Annotation Corrections and State Tracking Baselines[C]//Proceedings of the 2nd Workshop on Natural Language Processing for Conversational AI.Online:Association for Computational Linguistics,2020:109-117.
[12]OUYANG Y,CHEN M,DAI X,et al.Dialogue State Tracking with Explicit Slot Connection Modeling[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.Online:Association for Computational Linguistics,2020:34-40.
[13]HECK M,NIEKERK C,LUBIS N,et al.TripPy:A Triple Copy Strategy for Value Independent Neural Dialog State Tracking[C]//Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue.1st virtual meeting:Association for Computational Linguistics,2020:35-44.
[14]HENDERSON M,THOMSON B,YOUNG S.Word-Based Dialog State Tracking with Recurrent Neural Networks[C]//Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue(SIGDIAL).Philadelphia,PA:Association for Computational Linguistics,2014:292-299.
[15]ZILKA L,JURCICEK F.Incremental LSTM-based dialog state tracker[C]//2015 IEEE Workshop on Automatic Speech Recognition and Understanding(ASRU).Scottsdale,AZ:IEEE Press,2015:757-762.
[16]MRKŠIĆ N,Ó SÉAGHDHA D,WEN T H,et al.Neural Belief Tracker:Data-Driven Dialogue State Tracking[C]//Proceedings of the 55th Annual Meeting of the Association for Computa-tional Linguistics(Volume 1:Long Papers).Vancouver:Association for Computational Linguistics,2017:1777-1788.
[17]WU C-S,MADOTTO A,HOSSEINI-ASL E,et al.Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.Florence:Association for Computational Linguistics,2019:808-819.
[18]GOEL R,PAUL S,HAKKANI-TÜR D.Hyst:A hybrid ap-proach for flexible and accurate dialogue state tracking[J].ar-Xiv:1907.00883,2019.
[19]RASTOGI A,ZANG X,SUNKARA S,et al.Towards scalable multi-domain conversational agents:The schema-guided dialogue dataset[C]//Proceedings of the AAAI Conference on Artificial Intelligence.New York:AAAI Press,2020:8689-8696.
[20]ZHANG J,HASHIMOTO K,WU C S,et al.Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking[C]//Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics.Barcelona,Spain(Online):Association for Computational Linguistics,2020:154-167.
[21]KIM S,YANG S,KIM G,et al.Efficient Dialogue State Tracking by Selectively Overwriting Memory[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.Online:Association for Computational Linguistics,2020:567-582.
[22]ZENG Y,NIE J Y.Multi-domain dialogue state tracking based on state graph[J].arXiv:2010.11137,2020.
[23]HU J,YANG Y,CHEN C,et al.SAS:Dialogue State Tracking via Slot Attention and Slot Information Sharing[C]//Procee-dings of the 58th Annual Meeting of the Association for Computational Linguistics.Online:Association for Computational Linguistics,2020:6366-6375.
[24]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isAll you Need[C]//Advances in Neural Information Processing Systems.Long Beach,CA:Curran Associates,Inc.,2017:5998-6008.
[25]DEVLIN J,CHANG M W,LEE K,et al.BERT:Pre-training of Deep Bidirectional Transformers for Language Understanding[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies,Volume 1(Long and Short Papers).Minneapolis,Minnesota:Association for Computational Linguistics,2019:4171-4186.
[26]WU C S,HOI S C H,SOCHER R,et al.TOD-BERT:Pre-trained Natural Language Understanding for Task-Oriented Dialogue[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing(EMNLP).Online:Association for Computational Linguistics,2020:917-929.
[27]QUAN J,ZHANG S,CAO Q,et al.RiSAWOZ:A Large-Scale Multi-Domain Wizard-of-Oz Dataset with Rich Semantic Annotations for Task-Oriented Dialogue Modeling[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing(EMNLP).Online:Association for Computational Linguistics,2020:930-940.
[28]BOWMAN S R,VILNIS L,VINYALS O,et al.Generating Sentences from a Continuous Space[C]//Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning.Berlin:Association for Computational Linguistics,2016:10-21.
[29]KINGMA D P,BA J.Adam:A method for stochastic optimization[J].arXiv:1412.6980,2014.
[30]HOWARD J,RUDER S.Universal Language Model Fine-tuning for Text Classification[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics(Volume 1:Long Papers).Melbourne:Association for Computational Linguistics,2018:328-339.
[31]REN L,NI J,MCAULEY J.Scalable and Accurate DialogueState Tracking via Hierarchical Sequence Generation[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing(EMNLP-IJCNLP).Hong Kong:Association for Computational Linguistics,2019:1876-1885.
[32]ZHOU L,SMALL K.Multi-domain dialogue state tracking as dynamic knowledge graph enhanced question answering[J].arXiv:1911.06192,2019.
[33]LEE H,LEE J,KIM T Y.SUMBT:Slot-Utterance Matching for Universal and Scalable Belief Tracking[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.Florence:Association for Computational Linguistics,2019:5478-5483.
[34]FENG Y,WANG Y,LI H.A Sequence-to-Sequence Approach to Dialogue State Tracking[J].arXiv:2011.09553,2020.
[35]QUAN J,XIONG D.Modeling Long Context for Task-Oriented Dialogue State Generation[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.Online:Association for Computational Linguistics,2020:7119-7124.
[36]SANH V,DEBUT L,CHAUMOND J,et al.DistilBERT,a distilled version of BERT:smaller,faster,cheaper and lighter[J].arXiv:1910.01108,2019.
[1] KANG Yan, WU Zhi-wei, KOU Yong-qi, ZHANG Lan, XIE Si-yu, LI Hao. Deep Integrated Learning Software Requirement Classification Fusing Bert and Graph Convolution [J]. Computer Science, 2022, 49(6A): 150-158.
[2] YU Ben-gong, ZHANG Zi-wei, WANG Hui-ling. TS-AC-EWM Online Product Ranking Method Based on Multi-level Emotion and Topic Information [J]. Computer Science, 2022, 49(6A): 165-171.
[3] YU Jia-qi, KANG Xiao-dong, BAI Cheng-cheng, LIU Han-qing. New Text Retrieval Model of Chinese Electronic Medical Records [J]. Computer Science, 2022, 49(6A): 32-38.
[4] GUO Yu-xin, CHEN Xiu-hong. Automatic Summarization Model Combining BERT Word Embedding Representation and Topic Information Enhancement [J]. Computer Science, 2022, 49(6): 313-318.
[5] CHEN Xin, LI Fang, DING Hai-xin, SUN Wei-ze, LIU Xin, CHEN De-xun, YE Yue-jin, HE Xiang. Parallel Optimization Method of Unstructured-grid Computing in CFD for DomesticHeterogeneous Many-core Architecture [J]. Computer Science, 2022, 49(6): 99-107.
[6] WEI Ru-ming, CHEN Ruo-yu, LI Han, LIU Xu-hong. Analysis of Technology Trends Based on Deep Learning and Text Measurement [J]. Computer Science, 2022, 49(11A): 211100119-6.
[7] CHEN Zi-zhuo, LIN Xi, WANG Zhong-qing. Stance Detection Based on Argument Boundary Recognition [J]. Computer Science, 2022, 49(11A): 210800180-5.
[8] HUANG Jia-wei, LI Xiao-peng, LING Cheng. Scalable Parallel Computing Method for Conditional Likelihood Probability of Nucleotide Molecular Phylogenetic Tree Based on GPU [J]. Computer Science, 2022, 49(11A): 210800189-7.
[9] CHENG Si-wei, GE Wei-yi, WANG Yu, XU Jian. BGCN:Trigger Detection Based on BERT and Graph Convolution Network [J]. Computer Science, 2021, 48(7): 292-298.
[10] FU Tian-hao, TIAN Hong-yun, JIN Yu-yang, YANG Zhang, ZHAI Ji-dong, WU Lin-ping, XU Xiao-wen. Performance Skeleton Analysis Method Towards Component-based Parallel Applications [J]. Computer Science, 2021, 48(6): 1-9.
[11] HE Ya-ru, PANG Jian-min, XU Jin-long, ZHU Yu, TAO Xiao-han. Implementation and Optimization of Floyd Parallel Algorithm Based on Sunway Platform [J]. Computer Science, 2021, 48(6): 34-40.
[12] YAO Dong, LI Zhou-jun, CHEN Shu-wei, JI Zhen, ZHANG Rui, SONG Lei, LAN Hai-bo. Task-oriented Dialogue System and Technology Based on Deep Learning [J]. Computer Science, 2021, 48(5): 232-238.
[13] DONG Zhe, SHAO Ruo-qi, CHEN Yu-liang, ZHAI Wei-feng. Named Entity Recognition in Food Field Based on BERT and Adversarial Training [J]. Computer Science, 2021, 48(5): 247-253.
[14] LI Fan, YAN Xing, ZHANG Xiao-yu. Optimization of GPU-based Eigenface Algorithm [J]. Computer Science, 2021, 48(4): 197-204.
[15] SHANG Xi-xue, HAN Hai-ting, ZHU Zheng-zhou. Mechanism Design of Right to Earnings of Data Utilization Based on Evolutionary Game Model [J]. Computer Science, 2021, 48(3): 144-150.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!