计算机科学 ›› 2019, Vol. 46 ›› Issue (5): 235-240.doi: 10.11896/j.issn.1002-137X.2019.05.036

• 图形图像与模式识别 • 上一篇    下一篇

基于异构机器学习算法融合的遥感影像分类

田振坤1,2, 傅莺莺3, 刘素红4,5   

  1. (中国劳动关系学院数学与计算机教学部 北京100048)1
    (中国劳动关系学院大数据与安全研究所 北京100048)2
    (北京工商大学理学院 北京100048)3
    (北京师范大学地理学院 北京100875)4
    (遥感科学国家重点实验室 北京100875)5
  • 发布日期:2019-05-15
  • 作者简介:田振坤(1979-),男,博士,副教授,主要研究方向为数字图像处理、模式识别、机器学习等,E-mail:tzhenkun@163.com;傅莺莺(1981-),女,博士,副教授,主要研究方向为统计与模式识别;刘素红(1965-),女,博士,教授,主要研究方向为遥感图像处理与地表参数反演,E-mail:liush@bnu.edn.cn(通信作者)。
  • 基金资助:
    国家自然科学基金项目(41171262),遥感科学国家重点实验室开放基金项目(OFSLRSS201628),北京市优秀人才培养资助青年骨干个人项目(2015000020124G032),中国劳动关系学院科研项目(18YYJS016)资助。

Remote Sensing Image Classification Based on Heterogeneous Machine Learning Algorithm Fusion

TIAN Zhen-kun1,2, FU Ying-ying3, LIU Su-hong4,5   

  1. (Mathematics and Computer Department,China University of Labor Relations,Beijing 100048,China)1
    (Institute of Big Data and Security,China University of Labor Relations,Beijing 100048,China)2
    (School of Science,Beijing Technology and Business University,Beijing 100048,China)3
    (School of Geography,Beijing Normal University,Beijing 100875,China)4
    (State Key Laboratory of Remote Sensing Science,Beijing 100875,China)5
  • Published:2019-05-15

摘要: 针对信息获取与处理过程中的不确定性导致的遥感数据分类精度难以满足土地覆盖变化、环境监测、专题信息提取等应用方面的需求,提出了一种基于机器学习的分类融合算法。采用6种异构分类器,以查准率及查全率矩阵为先验知识,依据分类器差异性指数AD对单分类器进行优化组合,结合三维概率矩阵分别得到抽象级、排序级和度量级的分类融合结果输出,并以北京地区Landsat 8遥感影像的典型区域为研究对象进行分类预测。结果表明,从6个单分类器中选取3个进行组合时的效果较好,其中AD值最大的(NB,KNN,SVM)分类器组合是综合效果最好的分类器组合;所提算法的抽象级输出比单分类的平均精度高12.28%,比分类效果最好的单分类器SVM高2.24%;所提算法对多个“强成员分类器”进行融合仍然能有效提高分类精度,比常用融合算法RF,Bagging和Boosting分别高出11.23%,7.56%和11.36%,对各种地物的分类精度有显著的提高。

关键词: 差异性指数, 分类器融合, 机器学习, 先验知识

Abstract: In the application of multi-spectral remote sensing data,such as land cover change,environmental monitoring and thematic information extraction,the classification accuracy is not high enough due to the uncertainty of remote sen-sing information acquisition and processing.In order to further improve the classification accuracy,this paper proposed a fusion algorithm based on 6 heterogeneous machine learning classifiers.This algorithm provides classification results in abstract level,ranked level and measurement level by using prior knowledge set which is composed of precision and recall matrix,Accuracy and Difference (AD) index of the combination of classifiers,and the 3-dimensional probability matrix.Based on the Landsat 8 image data,the classification results in the study area of Beijing are forecasted by the proposed fusion algorithm and other different algorithms respectively.Experimental results shows that the 3-classifier combination composed of NB,KNN and SVM obtaines maximum AD value and the best classification effect.The abstract level output of the algorithm is 12.28% higher than the average accuracy of 6 single classifiers and even 2.24% higher than the best single classifier of SVM.Compared with the commonly used algorithms such as Random Forest (RF),Bagging and Boosting failed in the case of “strong member classifier”,the proposed fusion algorithm performs still well with accuracy 11.23%,7.56% and 11.36% higher than RF,Bagging and Boosting respectively.The proposed fusion algorithm can effectively improve the classification accuracy of remote sensing data by making full use of the diversity of classifiers and prior knowledge such as precision and recall matrix in the process of classification.

Key words: Accuracy and difference Index, Classifiers fusion, Machine learning, Prior knowledge

中图分类号: 

  • TP181
[1]HUABING H,YANLEI C,NICHOLAS C,et al.Mapping majorland cover dynamics in Beijing using all Landsat imagesin Google Earth Engine[J].Remote Sensing of Environment,2017,202:166-176.
[2]ZHU L,MA L.Class centroid alignment based domain adaptation for classification of remote sensing images[J].Pattern Re-cognition Letters,2016,83:124-132.
[3]JASPER V V,ARNOLD K B,DANIEL G B,et al.A review of current calibration and validation practices in land-change mode-ling[J].Environmental Modelling & Software,2016,82:174-182.
[4]ZHAI J H,ZHAO W X.Soft Combination of Probabilistic Neural Network Classifiers for Face Recognition[J].Computer Scien-ce,2015,42(7):305-308.(in Chinese)翟俊海,赵文秀.软组合概率神经网络分类器人脸识别方法[J].计算机科学,2015,42(7):305-308.
[5]DANIEL S P,CÉSAR F,MARÍA JR,et al.Probabilistic class hierarchies for multiclass classification[J].Journal of Computational Science,2018,26:254-263.
[6]NIKOLAS M,FUILLERMO A N,ASAL B.A machine learning approach to energy pile design[J].Computers and Geotechnics,2018,97(2):189-203.
[7]WU X D,XIAO Q,WEN J G,et al.Advances in uncertaintyanalysis for the validation of remote sensing products:Take leaf area index for example[J].Journal of Remote Sensing,2014,18(5):1011-1023.(in Chinese)吴小丹,肖青,闻建光,等.遥感数据产品真实性检验不确定性分析研究进展[J].遥感学报,2014,18(5):1011-1023.
[8]HAO T,MATTHEW B,FENG Z,et al.Deriving and validating Leaf Area Index (LAI) at multiple spatial scales through lidar remote sensing:A case study in Sierra National Forest,CA[J].Remote Sensing of Environment,2014,143:131-141.
[9]MITCHELL B L,DAVID A K,STUART R P,et al.A comparison of resampling methods for remote sensing classification andaccuracy assessment [J].Remote Sensing of Environment,2018,208:145-153.
[10]MFAUVEL,J CHANUSSOT,J ABENEDIKTSSON.A spatial-spectral kernel-based approach for the classification of remote-sensing images[J].Pattern Recognition,2012,45:381-392.
[11]MELLOR A,BOUKIR S.Exploring diversity in ensemble classification:Applications in large area land cover mapping[J].ISPRS Journal of Photogrammetry and Remote Sensing,2017,129:151-161.
[12]TANG Y,LI X R.Set-based similarity learning in subspace for agricultural remote sensing classification[J].Neurocomputing,2016,173:332-328.
[13]ALIM S,CLAUDIO P,PAOLO G,et al.Supervised and Semi-Supervised Multi-View Canonical Correlation Analysis Ensemble for Heterogeneous Domain Adaptation in Remote Sensing Image Classification[J].Remote Sensing,2017,9(4):337.
[14]YANG Y J,ZHAN Y L,TIAN Q J,et al.Crop classificationbased on GF-1/WFV NDVI time series[J].Transactions of the Chinese Society of Agricultural Engineering,2015,31(24):155-161.(in Chinese)杨闫君,占玉林,田庆久,等.基于GF-1/WFV NDVI时间序列数据的作物分类[J].农业工程学报,2015,31(24):155-161.
[15]XU H Q.A Study on Information Extraction of Water Bodywith the Modified Normalized Difference Water Index(MNDWI)[J].Journal of Remote Sensing,2005,9(5):589-595.(in Chinese)徐涵秋.利用改进的归一化差异水体指数(MNDWI)提取水体信息的研究[J].遥感学报,2005,9(5):589-595.
[16]周志华.机器学习[M].北京:清华大学出版社,2016:28-31.
[17]ZHANG Y.Study of Remote Sensing Image Classification Based on Machine Learning[D].Beijing:Beijing Forestry University,2014.(in Chinese)张雁.基于机器学习的遥感图像分类研究[D].北京:北京林业大学,2014.
[18]CHIRICI G,SCOTTI R,MONTAGHI A.Stochastic gradientboosting classification trees for forest fuel types mapping through airborne laser scanning and IRS LISS-III imagery[J].International Journal of Applied Earth Observation and Geoinformation,2013,25:87-97.
[19]DU P J,XIA J S,ZHANG W,et al.Multiple Classifier System For Remote Sensing Image Classification:A Review[J].Sensors,2012,12(4):4764-4792[20]NITZE I,B BARRETT,CAWKWELL F.Temporal optimisation of image acquisition for land cover classification with Random Forest and MODIS time-series[J].International Journal of Applied Earth Observation and Geoinformation,2015,34:136-146.
[21]GU Y F,LIU H.Sample-screening MKL method via boosting strategy for hyperspectral image classification[J].Neurocomputing,2016,173:1630-1639.
[1] 冷典典, 杜鹏, 陈建廷, 向阳.
面向自动化集装箱码头的AGV行驶时间估计
Automated Container Terminal Oriented Travel Time Estimation of AGV
计算机科学, 2022, 49(9): 208-214. https://doi.org/10.11896/jsjkx.210700028
[2] 宁晗阳, 马苗, 杨波, 刘士昌.
密码学智能化研究进展与分析
Research Progress and Analysis on Intelligent Cryptology
计算机科学, 2022, 49(9): 288-296. https://doi.org/10.11896/jsjkx.220300053
[3] 李瑶, 李涛, 李埼钒, 梁家瑞, Ibegbu Nnamdi JULIAN, 陈俊杰, 郭浩.
基于多尺度的稀疏脑功能超网络构建及多特征融合分类研究
Construction and Multi-feature Fusion Classification Research Based on Multi-scale Sparse Brain Functional Hyper-network
计算机科学, 2022, 49(8): 257-266. https://doi.org/10.11896/jsjkx.210600094
[4] 张光华, 高天娇, 陈振国, 于乃文.
基于N-Gram静态分析技术的恶意软件分类研究
Study on Malware Classification Based on N-Gram Static Analysis Technology
计算机科学, 2022, 49(8): 336-343. https://doi.org/10.11896/jsjkx.210900203
[5] 何强, 尹震宇, 黄敏, 王兴伟, 王源田, 崔硕, 赵勇.
基于大数据的进化网络影响力分析研究综述
Survey of Influence Analysis of Evolutionary Network Based on Big Data
计算机科学, 2022, 49(8): 1-11. https://doi.org/10.11896/jsjkx.210700240
[6] 陈明鑫, 张钧波, 李天瑞.
联邦学习攻防研究综述
Survey on Attacks and Defenses in Federated Learning
计算机科学, 2022, 49(7): 310-323. https://doi.org/10.11896/jsjkx.211000079
[7] 李亚茹, 张宇来, 王佳晨.
面向超参数估计的贝叶斯优化方法综述
Survey on Bayesian Optimization Methods for Hyper-parameter Tuning
计算机科学, 2022, 49(6A): 86-92. https://doi.org/10.11896/jsjkx.210300208
[8] 赵璐, 袁立明, 郝琨.
多示例学习算法综述
Review of Multi-instance Learning Algorithms
计算机科学, 2022, 49(6A): 93-99. https://doi.org/10.11896/jsjkx.210500047
[9] 王飞, 黄涛, 杨晔.
基于Stacking多模型融合的IGBT器件寿命的机器学习预测算法研究
Study on Machine Learning Algorithms for Life Prediction of IGBT Devices Based on Stacking Multi-model Fusion
计算机科学, 2022, 49(6A): 784-789. https://doi.org/10.11896/jsjkx.210400030
[10] 肖治鸿, 韩晔彤, 邹永攀.
基于多源数据和逻辑推理的行为识别技术研究
Study on Activity Recognition Based on Multi-source Data and Logical Reasoning
计算机科学, 2022, 49(6A): 397-406. https://doi.org/10.11896/jsjkx.210300270
[11] 姚烨, 朱怡安, 钱亮, 贾耀, 张黎翔, 刘瑞亮.
一种基于异质模型融合的 Android 终端恶意软件检测方法
Android Malware Detection Method Based on Heterogeneous Model Fusion
计算机科学, 2022, 49(6A): 508-515. https://doi.org/10.11896/jsjkx.210700103
[12] 许杰, 祝玉坤, 邢春晓.
机器学习在金融资产定价中的应用研究综述
Application of Machine Learning in Financial Asset Pricing:A Review
计算机科学, 2022, 49(6): 276-286. https://doi.org/10.11896/jsjkx.210900127
[13] 董振恒, 任维平, 游新冬, 吕学强.
融入新能源领域术语知识的机器翻译方法
Machine Translation Method Integrating New Energy Terminology Knowledge
计算机科学, 2022, 49(6): 305-312. https://doi.org/10.11896/jsjkx.210500117
[14] 么晓明, 丁世昌, 赵涛, 黄宏, 罗家德, 傅晓明.
大数据驱动的社会经济地位分析研究综述
Big Data-driven Based Socioeconomic Status Analysis:A Survey
计算机科学, 2022, 49(4): 80-87. https://doi.org/10.11896/jsjkx.211100014
[15] 李野, 陈松灿.
基于物理信息的神经网络:最新进展与展望
Physics-informed Neural Networks:Recent Advances and Prospects
计算机科学, 2022, 49(4): 254-262. https://doi.org/10.11896/jsjkx.210500158
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!