计算机科学 ›› 2012, Vol. 39 ›› Issue (11): 157-159.

• 数据库与数据挖掘 • 上一篇    下一篇

基于QPSO-LSSVM的数据库相似重复记录检测算法

梁雪 任剑锋 景丽   

  1. (河南财经政法大学计算机与信息工程学院 郑州 450002)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Approximate Duplicate Record Detection Algorithm Based on PSO and LSSVM

  • Online:2018-11-16 Published:2018-11-16

摘要: 针对大规模数据库的相似重复记录的检测问题,提出了一种量子群优化算法(QPSO)与最小二乘支持向量 机(LSSVM)相结合的相似重复记录检测方法(QPSC}LSSVM)。首先计算记录字段的相似度值;然后利用QPSO对 LSSVM参数进行优化,构建相似重复记录检测模型;最后通过具体数据集进行仿真测试实验。仿真结果表明,QPSCL LSSVM不仅提高了重复记录检测准确率,而且提高了检测效率,是一种有效的相似重复记录检测算法。

关键词: 量子粒子群优化算法,最小二乘支持向量机,相似重复记录,检测

Abstract: Approximately duplicate record detection algorithm was proposed based on quantum swarm algorithm (QP- SO) and least squares support vector machine (LSSVM) to solve the large-scale database approximation duplicate re- cord detection problem. Firstly, the record field similarity values are calculated, and then the LSSVM parameters are op- timized,by QPSO to construction the approximately duplicate records detection model, finally simulation experiments arc carried out on the data set. hhe simulation results show that QPSC}I_SSVM not only improves the accuracy of the duplicate record detection but also improves the detection efficiency,and it is an effective approximate duplicate record, Detection algorithm.

Key words: Quantum particle swarm optimization, Least square support vector machines, Approximately duplicate record, Detection

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!