Computer Science ›› 2025, Vol. 52 ›› Issue (11A): 241000154-6.doi: 10.11896/jsjkx.241000154

• Artificial Intelligence • Previous Articles     Next Articles

Bidding Document Named Entity Recognition Algorithm Based on Multi-head Attention Mechanism and Dictionary Feature Fusion

YANG Hua, WANG Baohui   

  1. School of Software,Beihang University,Beijing 100191,China
  • Online:2025-11-15 Published:2025-11-10

Abstract: The preparation and review of bidding documents play a crucial role in ensuring the smooth operation of the bidding process.Entity recognition technology can notably enhance the accuracy and efficiency of information extraction,thereby improving the readability and retrievability of information during the review of bidding documents.However,due to the complexity of the content and the presence of numerous specialized terms,recognizing long entities poses a significant challenge.Traditional methods for named entity recognition(NER) perform poorly in addressing these issues.This paper proposes an NER approach named Roberta-DFF-BiLSTM-MHA-CRF,which integrates a multi-head attention mechanism,dictionary feature fusion,and the Roberta-BiLSTM-CRF model.Utilizing Roberta as the input layer,this method enhances the capability to capture long-range dependencies.The introduction of the multi-head self-attention mechanism improves the recognition of long entities.Meanwhile,incorporating domain-specific dictionary features addresses the issue of unclear term boundaries.Experimental results demonstrate that the proposed model significantly boosts the accuracy and efficiency of information extraction in the context of NER for bidding documents.When compared to the Bert-BiLSTM-CRF model,it achieves a 2.49 percentage point improvement in precision,a 4.28 percentage point increase in recall,and a 3.37 percentage point enhancement in F1 score.These improvements effectively reduce time and labor costs,offering an efficient new solution for information extraction from bidding documents.

Key words: Entity recognition in tender documents, Multi-head attention, Dictionary feature fusion, Roberta

CLC Number: 

  • TP301
[1]SHI B. Problems in the preparation of bidding documents and rationalization suggestions [J].China Tendering,2023(8):137-138.
[2]MCCALLUM A,FREITAG D,PEREIRA F.Maximum entropyMarkov models for information extraction and segmentation[C]//Proceedings of 17th International Conference on Machine Learning.2000:591-598.
[3]LAFFERTY J,MCCALLUM A,PEREIRA F.Conditionalran-dom fields:probabilistic models for segmenting andlabeling sequence data[C]//Proceedings of 17th International Conference on Machine Learning.San Francisco:MorganKaufmann Publishers,2001:282-289.
[4]YU J D,FAN X Z,YIN J H.Application of hidden Markov model in natural language processing[J].Computer Engineering and Design,2007,28(22):5514-5516.
[5]KALCHBRENNER N,GREFENSTETTE E,BLUNSOM P.A convolutional neural network for modelling sentences[C]//Proceedings of the Association for Computational Linguistics(ACL).2014:655-665.
[6]LAMPLE G,BALLESTEROS M,SUBRAMANIAN S,et al.Neural architectures for named entity recognition[C]//Procee-dings of NAACL-HLT.2016:260-270.
[7]ZHU X,SOBIHANI P,GUO H.Long short-term memory over recursive structures[C]//Proceedings of the 32nd International Conference on Machine Learning(LCML-15).2015:1604-1612.
[8]HUANG Z,WEI X,KAI Y.Bidirectional LSTM-CRF modelsfor sequence tagging[J].arXiv:1508.01991,2015.
[9]ZHANG S F,WEN L Y,BIAN X,et al.Oc-clusion-aware r-cnn:Detecting pedestrians in a crowd[J].The European Conference on Computer Vision(ECCV),2018,11207:657-674.
[10]LIU W,LIAO S C,HU W D,et al.Learningefficient single-stage pedestrian detectors by asymptoticlocalization fitting[C]//Computer Vision-ECCV 2018.2018:643-659.
[11]ZHANG S S,BENENSON R,SCHIELE B.Citypersons:A diverse dataset for pedestrian detection[C]//IEEE Conference on Computer Vision and Pattern Recognition.2017:4457-4465.
[12]LUO B,ZHANG X F,DUAN L,et al.Military Named Entity Recognition Based on RoBERTa-Span-Attack Label Pointer Network[J].Journal of Naval University of Engineering,2024,36(1):76-82,93.
[13]LI J H,XIONG W,GONG K,et al.Research on Entity Recognition of Power Equipment Defects Integrating BERT-WWM and Attention Mechanism[J].Journal of Electric Power,2024,39(2):126-135.
[14]ZHANG Y C,YANG Y,JIANG R,et al.A Business Entity Recognition Model Based on BiLSTM-CRF[J].Computer Engineering,2019,45(5):308-314.
[15]MI J X,XIE H W.Research and Application of Named Entity Recognition for Bidding Materials[J].Computer Engineering and Applications,2023,59(2):314-320.
[16]AEJAS B,BELHI A,ZHANG H,et al.Deep learning-based automatic analysis of legal contracts:a named entity recognition benchmark [J].Neural Computing and Applications,2024,36(23):14465-14481.
[17]AHMET T,METIN T.Enhanced Named Entity Recognition algorithm for financial document verification [J].The Journal of Supercomputing,2023,79(17):19431-19451.
[18]MA J,YU Y.Automatic Extraction Method for Key Information in Logistics Bidding Documents[J].Computer and Digital Engineering,2024,52(5):1400-1405.
[19]HEIM G.Named entity recognition indigitalen sammlungenein werkstattbericht aus der badischen landesbibliothek[J].Bibliotheksdienst,2023,57(6):364-375.
[20]PEI D,JING M,LIU H,et al.A fast RetinaNet fusionframework for multi-spectral pedestrian detection[EB/OL].https://doi.org/10.1016/j.infrared.2019.103178.
[21]MAO H L,AIZIERGUL I,CHEN D G.Named Entity Recognition in Power Grid Dispatching Domain Based on Multi-Head Attention[J].Computer Technology and Development,2023,33(2):181-186,194.
[22]LUO X,XIA X Y,AN Y,et al.Chinese Clinical Entity Recognition Combining Multi-Head Self-Attention Mechanism and BiLSTM-CRF[J].Journal of Hunan University (Natural Sciences Edition),2021,48(4):45-55.
[23]LI B,WANG H C.Implementation and Application of a Chinese Grammar Error Diagnosis System Based on CRF [J].Computer Science,2024,51(S1):1141-1146.
[1] LIU Le, XIAO Rong, YANG Xiao. Application of Decoupled Knowledge Distillation Method in Document-level RelationExtraction [J]. Computer Science, 2025, 52(8): 277-287.
[2] ZHENG Chuangrui, DENG Xiuqin, CHEN Lei. Traffic Prediction Model Based on Decoupled Adaptive Dynamic Graph Convolution [J]. Computer Science, 2025, 52(6A): 240400149-8.
[3] SHEN Xinyang, WANG Shanmin, SUN Yubao. Depression Recognition Based on Speech Corpus Alignment and Adaptive Fusion [J]. Computer Science, 2025, 52(6): 219-227.
[4] WU Fengyuan, LIU Ming, YIN Xiaokang, CAI Ruijie, LIU Shengli. Remote Access Trojan Traffic Detection Based on Fusion Sequences [J]. Computer Science, 2024, 51(6): 434-442.
[5] LUO Yuanyuan, YANG Chunming, LI Bo, ZHANG Hui, ZHAO Xujian. Chinese Medical Named Entity Recognition Method Incorporating Machine ReadingComprehension [J]. Computer Science, 2023, 50(9): 287-294.
[6] CUI Lin, CUI Chenlu, LIU Zhengwei, XUE Kai. Speech Emotion Recognition Based on Improved MFCC and Parallel Hybrid Model [J]. Computer Science, 2023, 50(6A): 220800211-7.
[7] ZHANG Qi, YU Shuangyuan, YIN Hongfeng, XU Baomin. Neural Collaborative Filtering for Social Recommendation Algorithm Based on Graph Attention [J]. Computer Science, 2023, 50(2): 115-122.
[8] ZANG Jie, ZHOU Wanlin, WANG Yan. Semantic Matching Method Integrating Multi-head Attention Mechanism and Siamese Network [J]. Computer Science, 2023, 50(12): 294-301.
[9] WANG Shuai, ZHANG Shu-jun, YE Kang, GUO Qi. Continuous Sign Language Recognition Method Based on Improved Transformer [J]. Computer Science, 2022, 49(11A): 211200198-6.
[10] XIAO Ding, ZHANG Yu-fan, JI Hou-ye. Electricity Theft Detection Based on Multi-head Attention Mechanism [J]. Computer Science, 2022, 49(1): 140-145.
[11] WANG Rui-ping, JIA Zhen, LIU Chang, CHEN Ze-wei, LI Tian-rui. Deep Interest Factorization Machine Network Based on DeepFM [J]. Computer Science, 2021, 48(1): 226-232.
[12] ZHANG Zhi-yang, ZHANG Feng-li, CHEN Xue-qin, WANG Rui-jin. Information Cascade Prediction Model Based on Hierarchical Attention [J]. Computer Science, 2020, 47(6): 201-209.
[13] TANG Guo-qiang,GAO Da-qi,RUAN Tong,YE Qi,WANG Qi. Clinical Electronic Medical Record Named Entity Recognition Incorporating Language Model and Attention Mechanism [J]. Computer Science, 2020, 47(3): 211-216.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!