计算机科学 ›› 2015, Vol. 42 ›› Issue (Z11): 374-377.

• 信息安全 • 上一篇    下一篇

一种基于语句主谓语编码的文本水印技术

李桂森,陈建平,马海英,杨方兴   

  1. 南通大学计算机科学与技术学院 南通226019,南通大学计算机科学与技术学院 南通226019,南通大学计算机科学与技术学院 南通226019,南通大学计算机科学与技术学院 南通226019
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家自然科学基金项目(61402244),南通市应用研究计划项目(BK2011026)资助

Method for Text Watermarking Based on Subject-verb Encoding

LI Gui-sen, CHEN Jian-ping, MA Hai-ying and YANG Fang-xing   

  • Online:2018-11-14 Published:2018-11-14

摘要: 文本水印通过在文本中嵌入版权标识信息(水印)来保护文本作品的知识产权。提出一种对文本中语句的主谓语进行编码来嵌入水印的方法。将水印信息转换成十六进制的Unicode码串,借助哈尔滨工业大学的语言技术平台(LTP),对文本中的语句进行一系列处理获取其中的主谓语,用上述Unicode码串中的一段对每一个主谓语进行编码表示,以此实现水印的嵌入。提取水印时,从被检测的文本中获取语句的主谓语,对照嵌入水印时形成的码本,对每个主谓语进行比较和译码,取出各主谓语所对应的Unicode码段,将它们按正确顺序拼接起来,转换成对应的字符,得到嵌入的水印信息。所提算法具有很好的隐蔽性,能有效抵抗各种常见的攻击。

关键词: 数字水印,文本水印,主谓语编码,LTP

Abstract: Text watermarking protects the copyrights of text works by embedding copyright information(watermark) into a text.This paper proposed a text watermarking technique,in which the watermark is embedded by encoding the subject-verbs of the sentences in a text.A watermark message is converted into a string of the hexadecimal Unicode code.With the help of the language technology platform(LTP) of Harbin Institute of Technology,a series of processes are applied to the text to obtain the subject-verbs in the text.Each of the subject-verbs is encoded with one piece of the Unicode string,which achieves the embedding of the watermark.When extracting the watermark,the subject-verbs are obtained from the detected text and decoded according to the codebook generated in the watermark embedding.The corresponding pieces of the Unicode string are taken out from the codebook and put together in correct order.They are then converted back into the original characters to obtain the embedded watermark message.The proposed algorithm has a good nature of concealment and can resist various watermark attacks.

Key words: Digital watermarking,Text watermarking,Subject-verb encoding,LTP

[1] Atallah M,McDonough C,Nirenburg S,et al.Natural Language Processing for Information Assurance and Security:An Overview and Implementations[C]∥Proceedings of the 9th ACM/SIGSAC New Security Paradigms Workshop.Ireland,2000:51-65
[2] 刘旻昊,孙堡垒,郭云彪,等.文本数字水印技术研究综述[J].东南大学学报(自然科学版),2007,37(z1):225-230
[3] 黄华,齐春,李俊,等.文本数字水印[J].中文信息学报,2001,15(5):52-57
[4] 梁旭,远志永,黄明.基于行间距编码的文本数字水印算法[J].信息技术,2008,32(3):38-41
[5] 于晨斐.基于二次余数的Word文档数字水印[J].计算机仿真,2007,24(11):324-326
[6] 蔡菲菲,刘洋,尹香兰.一种基于word文档的文本水印技术研究[J].计算机科学,2012,39(z3):39-40
[7] 陈翔.一种基于中文字符编码的文本水印算法研究[J].计算机技术与发展,2013(2):237-240
[8] Maxemchuk N F.Electronic Document Distribution[J].AT&T Bell Laboratories Technical Journal,1994,73(5):73-80
[9] Atallah M J,Raskin V,Meta C.Natural language watermarking:design,analysis,and a proof-of-concept implementation[C]∥Proc.of Information Hiding-Fourth International Workshop.Berlin:Springer,2001:185-200
[10] 张宇,刘挺,陈毅恒,等.自然语言文本水印[J].中文信息学报,2005,19(1):56-62
[11] 杨超,李仁发,蒋斌,等.基于语义的自然语言文本数字水印研究[J].计算机工程与设计,2005,26(6):1428-1430
[12] 甘灿,孙星明,刘玉玲,等.一种改进的基于同义词替换的中文文本信息隐藏方法[J].东南大学学报,2007,37(z1):137-140
[13] Jalil Z,Mirza A M.A Review of Digital Watermarking Techniques for Text Documents[C]∥International Conference on Information and Multimedia Technology.Jeju Island,2009:230-234
[14] 温泉,孙锬锋,王树勋.零水印的概念与应用[J].电子学报,2003,1(2):214-216
[15] 斯琴,张力,廉德亮.基于文本特征的文本水印算法[J].计算机应用,2009,29(9):2348-2350
[16] 舒娟娟,刘玉玲.基于词性频率的中文文本零水印算法[J].计算机应用,2011,31(z2):103-105
[17] Che Wan-xiang,Li Zheng-hua, Liu Ting.LTP:A Chinese Language Technology Platform[C]∥Proc.of the Coling 2010:De-monstrations.Beijing,2010:13-16

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!