Computer Science ›› 2021, Vol. 48 ›› Issue (6A): 299-305.doi: 10.11896/jsjkx.200500157

• Intelligent Computing • Previous Articles     Next Articles

Recognition and Transformation for Complex Noun Phrases Based on Boundary Perception

LIU Xiao-die   

  1. Beijing Union University,Beijing 100101,China
  • Online:2021-06-10 Published:2021-06-17
  • About author:LIU Xiao-die,born in 1984,Ph.D,lecturer.Her main research interests include Chinese information processing and corpus linguistics.
  • Supported by:
    National Natural Science Foundation of China(71974095).

Abstract: This paper proposes a rules-based method for recognizing and transforming the complex Noun Phrases to improve the translation quality of them in patent machine translation.By analyzing the semantic chunks and the structural units of Chinese and English complex Noun Phrases,under the guide of the boundary perception,this paper extracts the feature words,builds 57 re-cognition rules,designs combination strategies and realizes the formalization of Chinese complex Noun Phrases.By comparing Chinese and English complex Noun Phrases,this paper summarizes the differences between them,and determines the transformation strategies based on that.At last,it applies the method to an existing machine translation system to test our work.Experimental results show that our rules and strategy are very efficient,and improve the translation quality in patent machine translation.

Key words: Boundary perception, Machine translation, Noun phrase, Patent, Recognize, Rules, Transform

CLC Number: 

  • TP391
[1] 张冬梅,晋耀红.面向专利机器翻译的要素句蜕识别和转换研究[J].计算机科学,2014,41(S1):67-71.
[2] 池毓焕.多元逻辑组合的汉英对比初探.第二届NNC与语言学研讨会论文集[M].北京:海洋出版社,2004:308-312.
[3] 熊亮.非句蜕广义对象语义块的分析与处理[D].北京:中国科学院声学所,2006.
[4] 李千驹.基于HNC理论的汉英逻辑组合变换研究[D].北京:北京师范大学,2008.
[5] 李颖,王侃,池毓焕.面向汉英机器翻译的语义块构成变换[M].北京:科学出版社,2009:91-124.
[6] 詹卫东.面向中文信息处理的现代汉语短语结构规则研究[M].北京:清华大学出版社出版,2000:61-75.
[7] 李素建,刘群.汉语组块的定义和获取.语言计算与基于内容的文本处理[M].北京:清华大学出版社,2003:110-115.
[8] 胡乃全,朱巧明,周国栋.混合的汉语基本名词短语识别方法[J].计算机工程,2009,35(20):199-201.
[9] 田雪,黄德根.一种混合的汉语简单名词短语识别方法[J].小型微型计算机系统,2017,38(4):749-754.
[10] 姜亚辉,姬东鸿.结合半监督与主动学习的复杂名词短语识别[J].计算机工程与设计,2015,36(2):498-501,506.
[11] ZHU Y,JIN Y H.A Chinese-English patent machine translation system based on the theory of hierarchical network of concepts[J].The Journal of China Universities and Telecommunications,2012,19:140-146.
[12] 刘小蝶,朱筠,晋耀红.中文专利中有标记并列结构的自动识别研究[J].计算机工程,2018,44(6):162-168,175.
[1] WANG Ming, PENG Jian, HUANG Fei-hu. Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction [J]. Computer Science, 2022, 49(8): 40-48.
[2] ZHANG Lu-ping, XU Fei. Survey on Spiking Neural P Systems with Rules on Synapses [J]. Computer Science, 2022, 49(8): 217-224.
[3] CHEN Jun, HE Qing, LI Shou-yu. Archimedes Optimization Algorithm Based on Adaptive Feedback Adjustment Factor [J]. Computer Science, 2022, 49(8): 237-246.
[4] LI Tang, QIN Xiao-lin, CHI He-yu, FEI Ke. Secure Coordination Model for Multiple Unmanned Systems [J]. Computer Science, 2022, 49(7): 332-339.
[5] KANG Yan, XU Yu-long, KOU Yong-qi, XIE Si-yu, YANG Xue-kun, LI Hao. Drug-Drug Interaction Prediction Based on Transformer and LSTM [J]. Computer Science, 2022, 49(6A): 17-21.
[6] LI Jian-zhi, WANG Hong-ling, WANG Zhong-qing. Automatic Generation of Patent Summarization Based on Graph Convolution Network [J]. Computer Science, 2022, 49(6A): 172-177.
[7] LAI Teng-fei, ZHOU Hai-yang, YU Fei-hong. Real-time Extend Depth of Field Algorithm for Video Processing [J]. Computer Science, 2022, 49(6A): 314-318.
[8] ZHANG Jia-hao, LIU Feng, QI Jia-yin. Lightweight Micro-expression Recognition Architecture Based on Bottleneck Transformer [J]. Computer Science, 2022, 49(6A): 370-377.
[9] SUN Jie-qi, LI Ya-feng, ZHANG Wen-bo, LIU Peng-hui. Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation [J]. Computer Science, 2022, 49(6A): 434-440.
[10] LIU Yun, DONG Shou-jie. Acceleration Algorithm of Multi-channel Video Image Stitching Based on CUDA Kernel Function [J]. Computer Science, 2022, 49(6A): 441-446.
[11] CAO Yang-chen, ZHU Guo-sheng, SUN Wen-he, WU Shan-chao. Study on Key Technologies of Unknown Network Attack Identification [J]. Computer Science, 2022, 49(6A): 581-587.
[12] ZHAO Xiao-hu, YE Sheng, LI Xiao. Multi-algorithm Fusion Behavior Classification Method for Body Bone Information Reconstruction [J]. Computer Science, 2022, 49(6): 269-275.
[13] DONG Zhen-heng, REN Wei-ping, YOU Xin-dong, LYU Xue-qiang. Machine Translation Method Integrating New Energy Terminology Knowledge [J]. Computer Science, 2022, 49(6): 305-312.
[14] FENG Yan, WANG Rui-cong. Quantum Voting Protocol Based on Quantum Fourier Transform Summation [J]. Computer Science, 2022, 49(5): 311-317.
[15] ZHU Zhe-qing, GENG Hai-jun, QIAN Yu-hua. Line-Segment Clustering Algorithm for Chemical Structure [J]. Computer Science, 2022, 49(5): 113-119.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!