结合预训练模型和数据增强的跨领域属性级情感分析研究

doi:10.11896/jsjkx.240900114

Abstract

Abstract: Aspect-based Sentiment Analysis(ABSA) is a fine-grained sentiment analysis task,which aimes at identifying specific aspects in text and exploring their sentiment orientation.To solve the problem of poor performance of ABSA model due to its in-ability to adapt to different domain language styles and lack of labeled data in target domain,this paper proposes a cross-domain aspect-based sentiment analysis method combined with pre-trained model.The pretraining model is used to generate labels for the target domain text,and the large language model is used to regenerate natural sentences with more target domain style.Finally,the generated samples and source domain samples are combined for training to predict the target domain.This experimental results on the restaurant and laptop datasets from the SemEval corpus,as well as a publicly available Web service review dataset show that,compared to existing cross-domain sentiment analysis methods,the proposed method achieves at least a 5.33% improvement in F₁ score,fully demonstrating its effectiveness.

Key words: Cross-domain ABSA, Pre-training model, T5, GPT

CLC Number:

TP391

CHEN Ge, WANG Zhongqing. Cross-domain Aspect-based Sentiment Analysis Based on Pre-training Model with Data Augmentation[J].Computer Science, 2025, 52(8): 300-307.

References

[1]HU M,LIU B.Mining and summarizing customer reviews[C]//Proceedings of the Tenth ACM SIGKDD International Confe-rence on Knowledge Discovery and Data Mining.2004:168-177.
[2]WANG W,PAN S J,DAHLMEIER D,et al.Coupled multi-layer attentions for co-extraction of aspect and opinion terms[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2017:3316-3322.
[3]HU M,ZHAO S,ZHANG L,et al.CAN:Constrained attentionnetworks for multi-aspect sentiment analysis[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing.2019:4601-4610.
[4]LI X,BING L,LI P,et al.A unified model for opinion target extraction and target sentiment prediction[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:6714-6721.
[5]PENG H,XU L,BING L,et al.Knowing what,how and why:A near complete solution for aspect-based sentiment analysis[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:8600-8607.
[6]XU L,LI H,LU W,et al.Position-aware tagging for aspect sentiment triplet extraction[C]//Proceedings of the 2020 Confe-rence on Empirical Methods in Natural Language Processing.2020:2339-2349.
[7]WU Z,YING C,ZHAO F,et al.Grid tagging scheme for aspect-oriented fine-grained opinion extraction[C]//Findings of the Association for Computational Linguistics:EMNLP 2020.2020:2576-2585.
[8]ZHANG W,LI X,DENG Y,et al.Towards generative aspect-based sentiment analysis[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. 2021:504-510.
[9]ZHANG W,DENG Y,LI X,et al.Aspect sentiment quad prediction as paraphrase generation[C]//Proceedings of the 2021 Conference on Empirical Methods in Natural Language Proces-sing.2021:9209-9219.
[10]BAI Y,XIE Y,LIU X,et al.BvSP:Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad Prediction[C]//Procee-dings of the 62nd Annual Meeting of the Association for Computational Linguistics.2024:8465-8482.
[11]CHEN B,OUYANG Q,LUO Y,et al.S²GSL:Incorporating Segment to Syntactic Enhanced Graph Structure Learning for Aspect-based Sentiment Analysis[C]//Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics.2024:13366-13379.
[12]JAKOB N,GUREVYCH I.Extracting opinion targets in a single and cross-domain setting with conditional random fields[C]//Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing.2010:1035-1045.
[13]CHERNYSHEVICH M,BELARUS I.Cross-domain extraction of product features using conditional random fields[C]//Proceedings of the 8th International Workshop on Semantic Evaluation(SemEval 2014).2014:309-313.
[14]DING Y,YU J,JIANG J.Recurrent neural networks with auxiliary labels for cross-domain opinion target extraction[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2017:3436-3442.
[15]YU J,GONG C,XIA R.Cross-domain review generation for aspect-based sentiment analysis[C]//Findings of the Association for Computational Linguistics:ACL-IJCNLP 2021.2021:4767-4777.
[16]SU Y,ZHOU X B.Cross-domain Sentiment Analysis Based on Gradient Data Selection[J].Software guid,2023,22(5):50-56.
[17]DENG Y,ZHANG W,PAN S J,et al.Bidirectional generativeframework for cross-domain aspect-based sentiment analysis[C]//Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics.2023:12272-12285.
[18]RAFFEL C,SHAZEER N,ROBERTS A,et al.Exploring the limits of transfer learning with a unified text-to-text transformer[J].Journal of Machine Learning Research,2020,21(140):1-67.
[19]LI Z,LI X,WEI Y,et al.Transferable end-to-end aspect-based sentiment analysis with selective adversarial learning[C]//Proceedings of the 2019 Confe-rence on Empirical Methods in Natural Language Processing and the 9th International Joint Confe-renceon Natural Language Processing.2019:4590-4600.
[20]ZHOU Y,ZHU F,SONG P,et al.An adaptive hybrid framework for cross-domain aspect-based sentiment analysis[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:14630-14637.
[21]GONG C,YU J,XIA R.Unified feature andinstance based domain adaptation for aspect-based sentiment analysis[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing.2020:7035-7045.
[22]LI J,YU J,XIA R.Generative cross-domain data augmentation for aspect and opinion co-extraction[C]//Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2022:4219-4229.
[23]YU J,ZHAO Q,XIA R.Cross-domain data augmentation with domain-adaptive language modeling for aspect-based sentiment analysis[C]//Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics.2023:1456-1470.

Related Articles 15

[1]	KONG Yinling, WANG Zhongqing, WANG Hongling. Study on Opinion Summarization Incorporating Evaluation Object Information [J]. Computer Science, 2025, 52(7): 233-240.
[2]	HU Caishun. Study on Named Entity Recognition Algorithms in Audit Domain Based on Large LanguageModels [J]. Computer Science, 2025, 52(6A): 240700190-4.
[3]	ZHAO Zheyu, WANG Zhongqing, WANG Hongling. Commodity Attribute Classification Method Based on Dual Pre-training [J]. Computer Science, 2025, 52(6A): 240500127-8.
[4]	LI Daicheng, LI Han, LIU Zheyu, GONG Shiheng. MacBERT Based Chinese Named Entity Recognition Fusion with Dependent Syntactic Information and Multi-view Lexical Information [J]. Computer Science, 2025, 52(6A): 240600121-8.
[5]	JIAO Jian, CHEN Ruixiang, HE Qiang, QU Kaiyang, ZHANG Ziyi. Study on Smart Contract Vulnerability Repair Based on T5 Model [J]. Computer Science, 2025, 52(4): 362-368.
[6]	HAN Wei, JIANG Shujuan, ZHOU Wei. Patch Correctness Verification Method Based on CodeBERT and Stacking Ensemble Learning [J]. Computer Science, 2025, 52(1): 250-258.
[7]	ZHANG Jian, LI Hui, ZHANG Shengming, WU Jie, PENG Ying. Review of Pre-training Methods for Visually-rich Document Understanding [J]. Computer Science, 2025, 52(1): 259-276.
[8]	YANG Binxia, LUO Xudong, SUN Kaili. Recent Progress on Machine Translation Based on Pre-trained Language Models [J]. Computer Science, 2024, 51(6A): 230700112-8.
[9]	SHI Jiyun, ZHANG Chi, WANG Yuqiao, LUO Zhaojing, ZHANG Meihui. Generation of Structured Medical Reports Based on Knowledge Assistance [J]. Computer Science, 2024, 51(6): 317-324.
[10]	ZHANG Zhiyuan, ZHANG Weiyan, SONG Yuqiu, RUAN Tong. Multilingual Event Detection Based on Cross-level and Multi-view Features Fusion [J]. Computer Science, 2024, 51(5): 208-215.
[11]	YI Liu, GENG Xinyu, BAI Jing. Hierarchical Multi-label Text Classification Algorithm Based on Parallel Convolutional Network Information Fusion [J]. Computer Science, 2023, 50(9): 278-286.
[12]	LI Yuqiang, LI Linfeng, ZHU Hao, HOU Mengshu. Deep Learning-based Algorithm for Active IPv6 Address Prediction [J]. Computer Science, 2023, 50(7): 261-269.
[13]	LIU Zhe, YIN Chengfeng, LI Tianrui. Chinese Spelling Check Based on BERT and Multi-feature Fusion Embedding [J]. Computer Science, 2023, 50(3): 282-290.
[14]	HE Wenhao, WU Chunjiang, ZHOU Shijie, HE Chaoxin. Study on Short Text Clustering with Unsupervised SimCSE [J]. Computer Science, 2023, 50(11): 71-76.
[15]	HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Cross-domain Aspect-based Sentiment Analysis Based on Pre-training Model with Data Augmentation

PDF (PC)