计算机科学 ›› 2017, Vol. 44 ›› Issue (Z6): 109-111.doi: 10.11896/j.issn.1002-137X.2017.6A.023

• 智能计算 • 上一篇    下一篇

基于标准欧氏距离的燃油流量缺失数据填补算法

陈静杰,车洁   

  1. 中国民航大学电子信息与自动化学院 天津300300,中国民航大学电子信息与自动化学院 天津300300
  • 出版日期:2017-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受科技支撑项目(2012BAC20B03),民航局节能减排专项计划项目(DPDSR0010)资助

Fuel Flow Missing-value Imputation Method Based on Standardized Euclidean Distance

CHEN Jing-jie and CHE Jie   

  • Online:2017-12-01 Published:2018-12-01

摘要: 为减小数据缺失对飞机油耗统计推断精度带来的负面影响,针对基于传统欧氏距离、马氏距离以及精简关联度的最近邻填补算法的不足,提出了一种基于标准欧氏距离的填补算法来估计QAR(Quick Access Recorder)数据中部分燃油流量数值的缺失。该算法通过QAR数据样本之间的标准欧氏距离选择最近邻样本,并利用熵值赋权法计算最近邻的加权系数,基于最近邻样本中燃油流量的加权平均即可得到缺失燃油流量的估计值。实验结果表明,标准欧氏距离能够有效度量样本相似性,所提出的算法优于常规填补算法,是处理飞机油耗数据缺失的一种有效方法。

关键词: 标准欧氏距离,燃油流量缺失数据估计,K近邻填补算法,熵值赋权法,RKNN算法

Abstract: To reduce the negative impact of aircraft fuel consumption statistical inference accuracy caused by the data missing,an estimated method based on standardized Euclidean distance was proposed to solve the fuel flow data missing problems.The nearest neighbors were chosen by the standardized Euclidean distance between QAR data samples,and then entropy was utilized to obtain the weight of the nearest neighbors.The missing value was estimated by the weighted average fuel flow of the nearest neighbors.Experiments prove that this method is valid to process fuel consumption data missing problems,and its performance is higher than the other imputation methods based on normal Euclidean distance,Mahalanobis distance or reduced relational grade.

Key words: Standardized euclidean distance,Fuel flow missing value estimation,KNN imputation method,Entropy,RKNN

[1] 庞新生.缺失数据处理方法的比较[J].统计与决策,2010(24):152-155.
[2] SHIGEYUKI O B A,MASA-AKI S,ICHIRO T,et al.A Baye-sian Missing Value Estimation Method for Gene Expression Profile Data[J].Bioinformatics,2003,19(16):510-514.
[3] TROYANSKAYA O,CANTOR M,SHERLOCK G,et al.Mis-sing Value Estimation Methods for DNA Microarrays[J].Bioinformatics,2001,17(4):520-525.
[4] 何云,皮德常.基于精简关联度的基因表达数据迭代填补算法[J].计算机科学,2015,2(11):251-255,3.
[5] STOICA P,LI J,LING J.Missing Data Recovery Via a Nonparametric Iterative Adaptive Approach[J].IEEE Signal Processing Letters,2009,16(4):241-244.
[6] 杨涛,骆嘉伟,王艳,等.基于马氏距离的缺失值填充算法[J].计算机应用,2002,25(12):2868-2871.
[7] 王雪飘,张宏亭,李学仁.基于马氏距离的飞行缺失数据估计方法[J].火力与指挥控制,2009,34(8):113-115.
[8] 刘婧.基于飞行数据分析的飞机燃油估计模型[D].南京:南京航空航天大学,2010.
[9] 陈静杰,肖冠平.飞机油耗分析工具设计[J].计算机工程与设计,2014,35(11):4012-4016.
[10] 陈静杰,邹迎欢.油耗预测中显著影响因素参数提取方法仿真[J].计算机仿真,2013,30(6):55-58.
[11] 童先群,周忠眉.基于属性值信息熵的KNN改进算法[J].计算机工程与应用,2010,46(3):115-117.
[12] 肖辉辉,段艳明.基于属性值相关距离的KNN算法的改进研究[J].计算机科学,2013,40(11A):157-159,7.
[13] 王凤梅,胡丽霞.一种基于近邻规则的缺失数据填补方法[J].计算机工程,2012,38(21):53-55,2.
[14] HUANG C C,LEE H M.A grey-based nearest neighbor ap-proach for missing attribute value prediction[J].Applied Intelligence,2004,20(3):239-252.
[15] MEESAD P,HENGPRAPROHM K.Combination of KNN-Based Feature Selection and KNN-Based Missing-Value Imputation of Microarry Data[C]∥The 3rd International Conference on Innovative Computing Information and Control.Washington,USA:IEEE Computer Society,2008:18-20.
[16] MOORTHY K,MOHAMAD M S,DERIS S.A Review onMissing Value Imputation Algorithms for Microarray Gene Expression Data[J].Current Bioinformatics,2014,9(1):18-22.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!