Computer Science ›› 2024, Vol. 51 ›› Issue (11A): 240300095-8.doi: 10.11896/jsjkx.240300095

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Road Extraction from Complex Urban Remote Sensing Images Based on Multi-task Learning

WANG Kunyang1, LIU Yang1, YE Ning1, ZHANG Kai2   

  1. 1 School of Information Science and Technology,,Nanjing Forestry University,Nanjing 210037,China
    2 School of Instrument Science and Engineering,Southeast University,Nanjing 210018,China
  • Online:2024-11-16 Published:2024-11-13
  • About author:WANG Kunyang,born in 1993,M.S.His main research interests include embedded technology and so on.
    YE Ning,born in 1967,doctor,professor.His main research interests include bioinformatics,data mining and machine learning.
  • Supported by:
    Future Network Research Fund Program (FNSRFP-2021-YB-17).

Abstract: In this paper,we propose a new framework for road extraction from remote sensing images that aims to utilize the knowledge gained from road edge detection to improve the accuracy of road extraction.A multi-scale visual attention module that fuses multi-scale information and visual attention mechanisms is introduced in the study,and a cascading feature fusion module is constructed to integrate the network's prediction results at different scales.Based on this,we construct a multiscale visual attention network(MSVANet) containing encoders and decoders.A multi-task learning framework that incorporates the MSVANet is also proposed,and a particle swarm optimization algorithm(PSO) is used to optimize the automatic selection of the two learning rate hyperparameters of the multi-task learning framework.The training and testing results on the RNBD dataset show that the proposed method outperforms other road extraction methods in terms of various segmentation accuracy metrics and generalization ability.

Key words: Deep learning, Remote sensing image road extraction, Multi-task learning, Multi-scale visual attention network

CLC Number: 

  • TP391
[1]FANG H W.Application of Telematics in Intelligent Urban Transportation Networks[J].Electronic Technology,2022,51(2):228-229.
[2]MASSA F,BONAMINI L,SETTIMI A,et al.LiDAR-based GNSS Denied Localization for Autonomous Racing Cars[J].Sensors,2020,20(14):3992.
[3]ZHU Z X.Analysis of urban planning and sustainable development in the system of territorial spatial planning[J].Future City Design and Operation,2024(3):20-22.
[4]XU L Y,MAO K B,GUO Z H,et al.A review on the application of convolutional neural network in semantic segmentation of agricultural remote sensing images[J].Agricultural Outlook,2024,20(2):70-75.
[5]ZHU J Z.Research and application of remote sensing image semantic segmentation technology based on deep learning[J].Value Engineering,2023,42(34):109-111.
[6]WANG Z,XIE C M.Application of drone remote sensing technology in agriculture[J].Agricultural Engineering Technology,2023,43(29):44-45.
[7]LIAN R B,ZHANG Z M,LIAO Y P,et al.Fast road centerline extraction from remote sensing images by combining geodetic distance field and curve smoothing[J].Journal of Surveying and Mapping,2023,52(8):1317-1329.
[8]HUANG W K,TENG F,WANG Z D,et al.A review of image segmentation based on deep learning[J].Computer Science,2024,51(2):107-116.
[9]GUO D Q,FU Y,ZHU Y,et al.Self-attentive multi-scale feature fusion algorithm for semantic segmentation of remote sensing images[J].Journal of Computer Aided Design and Graphics,2023,35(8):1259-1268.
[10]MALAMBO L,POPESCU S C,ROONEY W,et al.A DeepLearning Semantic Segmentation-Based Approach for Field-LevelSorghum Panicle Counting [J].Remote Sensing,2019,11(24).
[11]LI X,JIAO H,WANG Y.Edge detection algorithm of cancer image based on deep learning [J].Bioengineered,2020,11(1):693-707.
[12]HUI H,ZHANG X,LI F,et al.A Partitioning-Stacking Prediction Fusion Network Based on an Improved Attention U-Net for Stroke Lesion Segmentation [J].IEEE Access,2020,PP(99):1-1.
[13]GAO D,CHEN J Y,XIE Y.A-PSPNet:APSPNet image semantic segmentation model incorporating attention mechanism[J].Journal of Chinese Academy of Electronic Science,2020,15(6):518-523.
[14]KEMKER R,SALVAGGIO C,KANAN C.Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning [J].ISPRS Journal of Photogrammetry and Remote Sensing,2018,145(NOV.):60-77.
[15]LIANG Z Y,WANG X L.Semantic segmentation of multispectral remote sensing images based on band-position adaptive selection[J].Advances in Lasers and Optoelectronics,2023,60(14):157-167.
[16]LU Z,QI L,LI B Y.Rice yield classification by combining hyperspectral features with semantic segmentation[J].Computer and Digital Engineering,2023,51(9):1979-1984+2097.
[17]LIU B,YU X,YU A,et al.Deep Few-Shot Learning for Hyperspectral Image Classification [J].IEEE Transactions on Geoscience & Remote Sensing,2018,57(4):2290-2304.
[18]LAKE B M,ULLMAN T D,TENENBAUM J B,et al.Building Machines That Learn and Think Like People [J].Behavioral and Brain Sciences,2017,40:e253.
[19]HOCHREITER S,YOUNGER A S,CONWELL P R.Learning to learn using gradient descent[C]//Artificial Neural Networks-ICANN 2001:International Conference Vienna,Austria,August 21-25,2001 Proceedings 11.Springer Berlin Heidelberg,2001:87-94.
[20]JÜRGEN SCHMIDHUBER,ZHAO J,WIERING M.ShiftingInductive Bias with Success-Story Algorithm,Adaptive Levin Search,and Incremental Self-Improvement [J].Machine Learning,1997,28(1):105-130.
[21]GRAVES A,WAYNE G,DANIHELKA I.Neural Turing Machines [J].arXiv:1410.5401,2014.
[22]XIE S,TU Z.Holistically-Nested Edge Detection[C]// 2015 IEEE International Conference on Computer Vision(ICCV).IEEE,2016.
[23]DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al.AnImage is Worth 16x16 Words:Transformers for Image Recognition at Scale[C]// International Conference on Learning Representations.2021.
[24]SHIVERS S,ROBERTS D,MCFADDEN J,et al.Using ImagingSpectrometry to Study Changes in Crop Area in California's Central Valley during Drought [J].Remote Sensing,2018,10(10).
[25]LIU Y,YAO J,LU X,et al.RoadNet:Learning to Comprehensively Analyze Road Networks in Complex Urban Scenes From High-Resolution Remotely Sensed Images [J].IEEE Transactions on Geoscience and Remote Sensing,2019,57(4):2043-2056.
[26]FETTAKA S,THIBAULT J,GUPTA Y.A new algorithmusing front prediction and NSGA-II for solving two and three-objective optimization problems [J].Optimization & Engineering,2015,16(4):713-736.
[27]PASZKE,ADAM,et al.Pytorch:An imperative style,high-performance deep learning library [J].Advances in Neural Information Processing Systems,2019,32.
[28]ZHOU L,ZHANG C,WU M.D-LinkNet:LinkNet with pre-trained encoder and dilated convolution for high resolution satellite imagery road extraction[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops.2018:182-186.
[29]WU M,ZHANG C,LIU J,et al.Towards accurate high resolution satellite image semantic segmentation[J].Ieee Access,2019,7:55609-55619.
[30]YAN H,ZHANG C,YANG J,et al.Did-LinkNet:Polishing D-block with dense connection and iterative fusion for road extraction[C]//2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS.IEEE,2021:2186-2189.
[1] DU Yu, YU Zishu, PENG Xiaohui, XU Zhiwei. Padding Load:Load Reducing Cluster Resource Waste and Deep Learning Training Costs [J]. Computer Science, 2024, 51(9): 71-79.
[2] XU Jinlong, GUI Zhonghua, LI Jia'nan, LI Yingying, HAN Lin. FP8 Quantization and Inference Memory Optimization Based on MLIR [J]. Computer Science, 2024, 51(9): 112-120.
[3] SUN Yumo, LI Xinhang, ZHAO Wenjie, ZHU Li, LIANG Ya’nan. Driving Towards Intelligent Future:The Application of Deep Learning in Rail Transit Innovation [J]. Computer Science, 2024, 51(8): 1-10.
[4] KONG Lingchao, LIU Guozhu. Review of Outlier Detection Algorithms [J]. Computer Science, 2024, 51(8): 20-33.
[5] TANG Ruiqi, XIAO Ting, CHI Ziqiu, WANG Zhe. Few-shot Image Classification Based on Pseudo-label Dependence Enhancement and NoiseInterferenceReduction [J]. Computer Science, 2024, 51(8): 152-159.
[6] XIAO Xiao, BAI Zhengyao, LI Zekai, LIU Xuheng, DU Jiajin. Parallel Multi-scale with Attention Mechanism for Point Cloud Upsampling [J]. Computer Science, 2024, 51(8): 183-191.
[7] ZHANG Junsan, CHENG Ming, SHEN Xiuxuan, LIU Yuxue, WANG Leiquan. Diversified Label Matrix Based Medical Image Report Generation [J]. Computer Science, 2024, 51(8): 200-208.
[8] GUO Fangyuan, JI Genlin. Video Anomaly Detection Method Based on Dual Discriminators and Pseudo Video Generation [J]. Computer Science, 2024, 51(8): 217-223.
[9] CHEN Siyu, MA Hailong, ZHANG Jianhui. Encrypted Traffic Classification of CNN and BiGRU Based on Self-attention [J]. Computer Science, 2024, 51(8): 396-402.
[10] YANG Heng, LIU Qinrang, FAN Wang, PEI Xue, WEI Shuai, WANG Xuan. Study on Deep Learning Automatic Scheduling Optimization Based on Feature Importance [J]. Computer Science, 2024, 51(7): 22-28.
[11] LI Jiaying, LIANG Yudong, LI Shaoji, ZHANG Kunpeng, ZHANG Chao. Study on Algorithm of Depth Image Super-resolution Guided by High-frequency Information ofColor Images [J]. Computer Science, 2024, 51(7): 197-205.
[12] SHI Dianxi, GAO Yunqi, SONG Linna, LIU Zhe, ZHOU Chenlei, CHEN Ying. Deep-Init:Non Joint Initialization Method for Visual Inertial Odometry Based on Deep Learning [J]. Computer Science, 2024, 51(7): 327-336.
[13] FAN Yi, HU Tao, YI Peng. Host Anomaly Detection Framework Based on Multifaceted Information Fusion of SemanticFeatures for System Calls [J]. Computer Science, 2024, 51(7): 380-388.
[14] GAN Run, WEI Xianglin, WANG Chao, WANG Bin, WANG Min, FAN Jianhua. Backdoor Attack Method in Autoencoder End-to-End Communication System [J]. Computer Science, 2024, 51(7): 413-421.
[15] HUANG Haixin, CAI Mingqi, WANG Yuyao. Review of Point Cloud Semantic Segmentation Based on Graph Convolutional Neural Networks [J]. Computer Science, 2024, 51(6A): 230400196-7.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!