Computer Science ›› 2024, Vol. 51 ›› Issue (5): 134-142.doi: 10.11896/jsjkx.230200134

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Study on Building Extraction from Remote Sensing Image Based on Multi-scale Attention

HE Xiaohui1, ZHOU Tao2, LI Panle2, CHANG Jing2, LI Jiamian2   

  1. 1 School of Earth Science and Technology,Zhengzhou University,Zhengzhou 450052,China
    2 School of Computer and Artificial Intelligence,Zhengzhou University,Zhengzhou 450001,China
  • Received:2023-02-19 Revised:2023-08-17 Online:2024-05-15 Published:2024-05-08
  • About author:HE Xiaohui,born in 1978,professor,Ph.D supervisor.Her main research interests include artificial intelligence,computer vision,remote sensing image processing and data mining.
  • Supported by:
    Henan Province Major Science and Technology SpecialProject--Research on Key Technologies for Constructing and Servicing the Yellow River Simulator for Supercomputing(201400210900).

Abstract: Building extraction from remote sensing images based on deep learning has the characteristics of wide coverage and high computational efficiency,and it plays an important role in urban construction,disaster prevention and other aspects.Most of the mainstream methods use multi-scale feature fusion to enable the neural network to learn more abundant semantic information.However,due to the complexity of multi-scale features and the interference of other ground objects,this kind of methods often lead to target missing and noise-intensive.To this end,this paper proposes a feature interpretation model MGA-ResNet50(MGAR) that combines attention mechanism.The core of the method is to use the multihead attention to process the hierarchical weighting of high-level semantic information,so as to extract the optimal feature combination with relatively better representation effect.Then use the gating structure to fuse the feature map of each dimension with the low-level semantic information of the corresponding encoder to compensate for the loss of local building details.Experimental results on public datasets such as Massachusetts Building and WHU Building show that the proposed algorithm can achieve higher F1 and IoU than the more advanced multi-scale feature fusion methods such as RAPNet,GAMNet and GSM.

Key words: Deep learning, Building extraction, Multi-scale feature, Multihead attention, Gating mechanism

CLC Number: 

  • TP391.4
[1]ZHANG Y,FEI X,WANG J,et al.Overview of building extraction methods based on high-resolution remote sensing images [J].Geomatics &Spatial Information Technology,2020,43(4):76-79.
[2]LONG J,SHELHAMER E,DARRELL T.Fully Convolutional Networks for Semantic Segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,39(4):640-651.
[3]ZHANG C,AN R,MA L.Improved U-Net remote sensingimage building change detection[J].Computer Engineering and Application,2021,57(3):239-246.
[4]HE Z,DING H,AN B.Cavity convolution E-Unet algorithm for building extraction from high-resolution remote sensing images [J].Journal of Geodesy and Geoinformation Science,2022,51(3):457-467.
[5]ZHANG C,GE Y,JIANG X.Building extraction from high-resolution remote sensing images based on sparse constraint SegNet [J].Journal of Xi'an University of Science and Technology,2020,40(3):441-448.
[6]WU L,HU X.Automatic building detection based on multi-scaleand multi-feature high spatial resolution remote sensing image [J].Remote Sensing of Land and Resources,2019,31(1):71-78.
[7]ZHANG Y,WANG X,ZHANG Z,et al.A remote sensingimage building extraction method based on boundary perception [J].Journal of Xi'an University of Electronic Science and Technology(Natural Science Edition),2022,49(1):236-244.
[8]LIU H,ZHANG C,GE Y,et al.Multi-scale feature fusion depth learning building extraction method [J].Geospatial Information,2022,20(2):97-100.
[9]ZHANG Y,YAN Q,DENG F.Multi-path RSU network method for building extraction from high-resolution remote sensingimage[J].Journal of Geodesy and Geoinformation Science,2022,51(1):135-144.
[10]LIU D,ZHANG H,CHENG D,et al.Building extraction me-thod based on attention mechanism [J].Remote Sensing Information,2021,36(4):119-124.
[11]ZHANG Y,CHENG C,YANG S,et al.Building extraction from remote sensing images based on dual attention mechanism model [J].Science of Surveying and Mapping,2022,47(4):129-136,174.
[12]LI H,LI Z,ZHANG D.Object-oriented building extraction at optimal scale [J].Remote Sensing Information,2022,37(3):72-76.
[13]CHEN K,GAO X,YAN M,et al.Pixel level building extraction of aerial image based on codec network [J].National Remote Sensing Bulletin,2020,24(9):1134-1142.
[14]HE Q,MENG Y,LI H.Multi-level code-decode network remote sensing image building segmentation [J].Application Research of Computers,2021,38(8):2510-2514.
[15]BIANCHINI M,SCARSELLI F.On the complexity of neuralnetwork classifiers:A comparison between shallow and deep architectures[J].IEEE Transactions on Neural Networks and Learning Systems,2014,25(8):1553-1565.
[16]RAGHU M,POOLE B,KLEINBERG J,et al.On the expressive power of deep neural networks[C]//Proceedings of the 34th International Conference on Machine Learning(Volume 70).Sydney:PMLR,2017:2847-2854.
[17]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE Press,2016:770-778.
[18]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Proceedings of the 31st International Confe-rence on Neural Information Processing Systems.2017:6000-6010.
[19]LIN T Y,DOLLAR P,GIRSHICK R,et al.Feature Pyramid Networks for Object Detection[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Honolulu:IEEE Press,2017:2117-2125.
[20]GU Y,YAN F.Building extraction based on different skeleton UNet++networks [J].Journal of University of Chinese Aca-demy of Sciences,2022,39(4):512-523.
[21]JI S,WEI S,LU M.Fully convolutional networks for multisource building extraction from an open aerial and satellite imagery data set[J].IEEE Transactions on Geoscience and Remote Sensing,2018,57(1):574-586.
[22]TIAN Q,ZHAO Y,LI Y,et al.Multiscale building extractionwith refined attention pyramid networks[J].IEEE Geoscience and Remote Sensing Letters,2021,19:1-5.
[23]ZHENG Z,ZHANG X,XIAO P,et al.Integrating gate and attention modules for high-resolution image semantic segmentation[J].IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing,2021,14:4530-4546.
[24]XU L,LI Y,XU J,et al.Gated spatial memory and centroid-aware network for building instance extraction[J].IEEE Tran-sactions on Geoscience and Remote Sensing,2021,60:1-14.
[1] BAO Kainan, ZHANG Junbo, SONG Li, LI Tianrui. ST-WaveMLP:Spatio-Temporal Global-aware Network for Traffic Flow Prediction [J]. Computer Science, 2024, 51(5): 27-34.
[2] ZHANG Jianliang, LI Yang, ZHU Qingshan, XUE Hongling, MA Junwei, ZHANG Lixia, BI Sheng. Substation Equipment Malfunction Alarm Algorithm Based on Dual-domain Sparse Transformer [J]. Computer Science, 2024, 51(5): 62-69.
[3] HE Shiyang, WANG Zhaohui, GONG Shengrong, ZHONG Shan. Cross-modal Information Filtering-based Networks for Visual Question Answering [J]. Computer Science, 2024, 51(5): 85-91.
[4] SONG Jianfeng, ZHANG Wenying, HAN Lu, HU Guozheng, MIAO Qiguang. Multi-stage Intelligent Color Restoration Algorithm for Black-and-White Movies [J]. Computer Science, 2024, 51(5): 92-99.
[5] BAI Xuefei, SHEN Wucheng, WANG Wenjian. Salient Object Detection Based on Feature Attention Purification [J]. Computer Science, 2024, 51(5): 125-133.
[6] XU Xuejie, WANG Baohui. Multi-label Patent Classification Based on Text and Historical Data [J]. Computer Science, 2024, 51(5): 172-178.
[7] LI Zichen, YI Xiuwen, CHEN Shun, ZHANG Junbo, LI Tianrui. Government Event Dispatch Approach Based on Deep Multi-view Network [J]. Computer Science, 2024, 51(5): 216-222.
[8] HONG Tijing, LIU Dengfeng, LIU Yian. Radar Active Jamming Recognition Based on Multiscale Fully Convolutional Neural Network and GRU [J]. Computer Science, 2024, 51(5): 306-312.
[9] SUN Jing, WANG Xiaoxia. Convolutional Neural Network Model Compression Method Based on Cloud Edge Collaborative Subclass Distillation [J]. Computer Science, 2024, 51(5): 313-320.
[10] CHEN Runhuan, DAI Hua, ZHENG Guineng, LI Hui , YANG Geng. Urban Electricity Load Forecasting Method Based on Discrepancy Compensation and Short-termSampling Contrastive Loss [J]. Computer Science, 2024, 51(4): 158-164.
[11] LIN Binwei, YU Zhiyong, HUANG Fangwan, GUO Xianwei. Data Completion and Prediction of Street Parking Spaces Based on Transformer [J]. Computer Science, 2024, 51(4): 165-173.
[12] XU Hao, LI Fengrun, LU Lu. Metal Surface Defect Detection Method Based on Dual-stream YOLOv4 [J]. Computer Science, 2024, 51(4): 209-216.
[13] SONG Hao, MAO Kuanmin, ZHU Zhou. Algorithm of Stereo Matching Based on GAANET [J]. Computer Science, 2024, 51(4): 229-235.
[14] XUE Jinqiang, WU Qin. Progressive Multi-stage Image Denoising Algorithm Combining Convolutional Neural Network and
Multi-layer Perceptron
[J]. Computer Science, 2024, 51(4): 243-253.
[15] CHEN Jinyin, LI Xiao, JIN Haibo, CHEN Ruoxi, ZHENG Haibin, LI Hu. CheatKD:Knowledge Distillation Backdoor Attack Method Based on Poisoned Neuronal Assimilation [J]. Computer Science, 2024, 51(3): 351-359.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!