计算机科学 ›› 2024, Vol. 51 ›› Issue (9): 310-318.doi: 10.11896/jsjkx.230600217

• 人工智能 • 上一篇    下一篇

CCSD:面向话题的讽刺识别方法

刘其龙, 李弼程, 黄志勇   

  1. 华侨大学计算机科学与技术学院 福建 厦门 361000
  • 收稿日期:2023-06-29 修回日期:2023-10-22 出版日期:2024-09-15 发布日期:2024-09-10
  • 通讯作者: 李弼程(lbclm@163.com)
  • 作者简介:(21014083073@stu.hqu.edu.cn)
  • 基金资助:
    装备预研教育部联合基目(8091B022150)

CCSD:Topic-oriented Sarcasm Detection

LIU Qilong, LI Bicheng, HUANG Zhiyong   

  1. School of Computer Science and Technology,Huaqiao University,Xiamen,Fujian 361000,China
  • Received:2023-06-29 Revised:2023-10-22 Online:2024-09-15 Published:2024-09-10
  • About author:LIU Qilong,Born in 1999,postgra-duate.His main research interests include natural language processing,network public opinion knowledge graph and graph neural network.
    LI Bicheng,born in 1970,Ph.D,professor,Ph.D supervisor.His main research interests include intelligent information processing,network ideological security,network public opinion monitoring and guidance,and big data analysis and mining.
  • Supported by:
    Joint Fund of Ministry of Education for Equipment Pre-research(8091B022150).

摘要: 随着社交媒体的发展,越来越多的人在社交平台上发表对热点话题的看法,其中讽刺手法的运用严重影响了社交媒体中情感分析的精度。目前面向话题的讽刺识别研究未同时考虑上下文和常识知识的作用,也忽略了在同一个话题下进行讽刺识别的场景。为此,提出了基于上下文和常识的讽刺识别模型(Sarcasm Detection with Context and Common Sense,CCSD)。首先,模型使用C3KG常识库生成常识文本,并将目标句、话题上下文和常识文本作为预训练BERT模型的输入。其次,使用注意力机制来关注目标句和常识中重要的信息。最后,通过门控机制和特征融合,实现讽刺识别。文中构建了一个面向话题的讽刺识别数据集,以验证模型在特定话题中的有效性。实验结果表明,相比基线模型,新模型的性能更优。

关键词: 讽刺识别, 面向话题的讽刺识别, 上下文, 常识知识, 注意力机制

Abstract: With the development of social media,an increasing number of people express their opinions about hot topics on social platforms,and the utilization of sarcastic expression has severely affected the accuracy of sentiment analysis in social media.Currently,topic-oriented sarcasm detection research does not consider the role of context and common sense knowledge simultaneously,and also ignores the scene of sarcasm recognition under the same topic.This paper proposes a sarcasm detection with context and commonsense(CCSD)approach.Firstly,the model uses the C3KG commonsense knowledge base to generate commonsense text.Then,the target sentence,topic context,and commonsense text are concatenated as the input to the pre-training BERT model.In addition,an attention mechanism is used to focus on important information in the target sentence and commonsense text.Finally,sarcasm detections are realized through gating mechanism and feature fusion.A topic-oriented sarcasm detection dataset is constructed to verify the effectiveness of the proposed model in specific topics.Experimental results show that the proposed model achieves better performance compared to baseline models.

Key words: Sarcasm detection, Topic-oriented sarcasm detection, Context, Common sense knowledge, Attention mechanism

中图分类号: 

  • TP391
[1]PANG B,LEE L.Opinion mining and sentiment analysis[J].Foundations and Trends© in information retrieval,2008,2(1/2):1-135.
[2]YI T,LUU A T,SIU C H,et al.Reasoning with Sarcasm by Reading In-between[C]//Proceedings of the 56th Annual Mee-ting of the Association for Computational Linguistics.Australia:Association for Computational Linguistics,2018:1010-1020.
[3]KUMAR A,NARAPAREDDY V T,SRIKANTH V A,et al.Sarcasm detection using multi-head attention based bidirectional LSTM[J].IEEE Access,2020,8:6388-6397.
[4]BAMMAN D,SMITH N.Contextualized sarcasm detection ontwitter[C]//Proceedings of the International AAAI Conference on Web and Social Media.California:AAAI Press,2015,9(1):574-577.
[5]RAJADESINGAN A,ZAFARANII R,LIU H.Sarcasm detection on twitter:A behavioral modeling approach[C]//Procee-dings of the Eighth ACM International Conference on Web Search and Data Mining.New York:Association for Computing Machinery,2015:97-106.
[6]JOSHI A,SHARMA V,BHATTACHARYYA P.Harnessingcontext incongruity for sarcasm detection[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing.Beijing:Association for Computational Linguistics,2015:757-762.
[7]SANTIAGO C,DEVAMANYU H,VERÓNICA P,et al.To-wards Multimodal Sarcasm Detection(An _Obviously_ Perfect Paper)[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.Florence:Association for Computational Linguistics,2019:4619-4629.
[8]PAN H,LIN Z,FU P,et al.Modeling intra and inter-modality incongruity for multi-modal sarcasm detection[C]//Findings of the Association for Computational Linguistics:EMNLP 2020.Online:Association for Computational Linguistics,2020:1383-1392.
[9]LIANG B,LIN Z,QIN B,et al.Topic-Oriented Sarcasm Detection:New Task,New Dataset and New Method[C]//Procee-dings of the 21st Chinese National Conference on Computational Linguistics.Nanchang:Chinese Information Processing Society of China,2022:557-568.
[10]LI D,LI Y,ZHANG J,et al.C3KG:A Chinese CommonsenseConversation Knowledge Graph[C]//Findings of the Association for Computational Linguistics:ACL 2022.2022:1369-1383.
[11]GHOSH A,VEALE T.Fracking sarcasm using neural network[C]//Proceedings of the 7th Workshop on Computational Approaches to Subjectivity,Sentiment and Social Media Analysis.San Diego:Association for Computational Linguistics,2016:161-169.
[12]VAN HEE C,LEFEVER E,HOSTE V.Semeval-2018 task 3:Irony detection in english tweets[C]//Proceedings of The 12th International Workshop on Semantic Evaluation.New Orleans:Association for Computational Linguistics,2018:39-50.
[13]JOSHI A,TRIPATHI V,PATEL K,et al.Are Word Embedding-based Features Useful for Sarcasm Detection?[C]//Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing.Austin:Association for Computational Linguistics,2016:1006-1011.
[14]TAY Y,TUAN L A,HUI S C,et al.Reasoning with sarcasm by reading in-between[C]//Proceedings of the 56th Annual Mee-ting of the Association for Computational Linguistics.Melbourne:Association for Computational Linguistics,2018:1010-1020.
[15]HAZARIKA D,PORIA S,GORANTLA S.Cascade:Contextual sarcasm detection in online discussion forums[C]//Proceedings of the 27th International Conference on Computational Linguistics.Santa Fe:Association for Computational Linguistics,2018:1837-1848.
[16]KOLCHINSKI Y A,POTTS C.Representing social media users for sarcasm detection[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.Brussels:Association for Computational Linguistics,2018:1115-1121.
[17]BABANEJAD N,DAVOUDI H,AN A,et al.Affective and contextual embedding for sarcasm detection[C]//Proceedings of the 28th International Conference on Computational Linguistics.Barcelona(Online):International Committee on Computational Linguistics,2020:225-243.
[18]BI B,WU C,YAN M,et al.Incorporating External Knowledge into Machine Reading for Generative Question Answering[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing.Hong Kong:Association for Computational Linguistics,2019:2521-2530.
[19]LIU Z,XIONG C,SUN M,et al.Entity-duet neural ranking:Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics.Melbourne:Association for Computational Linguistics,2018:2395-1405.
[20]WANG Z,LI L,ZENG D.Knowledge-enhanced natural lan-guage inference based on knowledge graphs[C]//Proceedings of the 28th International Conference on Computational Linguistics.Barcelona:International Committee on Computational Linguistics,2020:6498-6508.
[21]REIMERS N,GUREVYCHI.Sentence-bert:Sentence embed-dings using siamese bert-networks[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Proces-sing and the 9th International Joint Conference on Natural Language Processing.Hong Kong:Association for Computational Linguistics,2019:3982-3992.
[22]GONG X,ZHAO Q,ZHANG J,et al.The design and construc-tion of a Chinese sarcasm dataset[C]//Proceedings of the Twelfth Language Resources and Evaluation Conference.2020:5034-5039.
[23]DEVLIN J,CHANG M W,LEE K,et al.Bert:Pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Minneapolis:Association for Computational Linguistics,2019:4171-4186.
[24]XU M,WANG D,FENG S,et al.KC-ISA:An Implicit Sentiment Analysis Model Combining Knowledge Enhancement and Context Features[C]//Proceedings of the 29th International Conference on Computational Linguistics.Gyeongju:International Committee on Computational Linguistics,2022:6906-6915.
[25]LI J,PAN H,LIN Z,et al.Sarcasm detection with commonsense knowledge[J].IEEE/ACM Transactions on Audio,Speech,and Language Processing,2021,29:3192-3201
[26]BORDES A,USUNIER N,GARCIA-DURAN A,et al.Translating embeddings for modeling multi-relational data[C]//Proceedings of the 26th International Conference on Neural Information Processing Systems.2013:2787-2795.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!