结合局部、全局感知与语义流对齐的脑白质高信号分割方法

doi:10.11896/jsjkx.250700057

计算机科学 ›› 2026, Vol. 53 ›› Issue (4): 291-298.doi: 10.11896/jsjkx.250700057

• 计算机图形学&多媒体 • 上一篇下一篇

结合局部、全局感知与语义流对齐的脑白质高信号分割方法

张新峰¹, 郭依海¹, 刘晓民¹, 许忠贺¹, 李相生²

1 北京工业大学信息科学技术学院北京 100124
2 中国人民解放军空军特色医学中心影像科北京 100142

收稿日期:2025-07-10 修回日期:2025-11-13 出版日期:2026-04-15 发布日期:2026-04-08
通讯作者: 李相生(lxsheng500@163.com)
作者简介:(zxf@bjut.edu.cn)

White Matter High Signal Segmentation Method Combining Local and Global Perception and Semantic Flow Alignment

ZHANG Xinfeng¹, GUO Yihai¹, LIU Xiaomin¹, XU Zhonghe¹, LI Xiangsheng²

1 College of Information Science and Technology, Beijing University of Technology, Beijing 100124, China
2 Department of Radiology, Air Force Medical Center, PLA, Beijing 100142, China

Received:2025-07-10 Revised:2025-11-13 Published:2026-04-15 Online:2026-04-08
About author:ZHANG Xinfeng,born in 1974,Ph.D,associate professor.His main research interests include signal and information processing and machine learning.
LI Xiangsheng,born in 1975,Ph.D,professor.His main research interests include functional MRI imaging research and early diagnosis of lung cancer.

摘要/Abstract

摘要： 针对脑白质高信号目标小的特点,提出一种结合局部、全局感知与语义流对齐的脑白质信号分割方法PGF-Net。首先,提出局部感知注意力模块(Patch Aware Attention,PAA),通过划分局部小图像块进行特征选择的方法,加强局部特征提取能力;然后,提出结合局部和全局感知的注意力模块(Patch Global Aware Attention,PGAA),利用Transformer全局感知的特点建立长程依赖;最后,提出门控语义流对齐模块(Gated Flow Alignment Module GFAM),在解码部分预测语义流偏移场,引导解码器中的高层特征扩张,实现与编码器对应低层特征的精准对齐融合。实验结果表明,PGF-Net在自采数据集中,交并比(mIoU)达到0.876 9,Dice系数为0.842 3,豪斯多夫距离(HD)降至32.61,平均表面距离(ASD)仅为1.7,达到了最优效果;在两种小目标公开数据集上也达到最优效果,验证了其泛化性和鲁棒性。此方法在辅助医生诊断方面具有一定的应用前景。

关键词: 图像分割, 小目标, 局部感知, 全局感知, 语义流对齐

Abstract: A white matter hyperintensity segmentation method called PGF-Net is proposed,which combines local and global perception with semantic flow alignment,to address the characteristic of small targets in high signal white matter.Firstly,it proposes the PAA(Patch Aware Attention) module,which enhances the ability to extract local features by dividing local small image blocks for feature selection.Secondly,it proposes to combine local and global aware attention modules(PGAA) and utilizes the characteristics of Transformer global perception to establish long-range dependencies.Lastly,it proposes a gated flow alignment module(GFAM) to predict the semantic flow offset field in the decoding section.Guide the expansion of high-level features in the decoder to achieve precise alignment and fusion with the corresponding low-level features in the encoder.Experimental results show that the PGF-Net achieves optimal performance in a self collected dataset,with a cross union ratio(mIoU) of 0.876 9,a Dice coefficient of 0.842 3,a Hausdorff distance(HD) of 32.61,and an average surface distance(ASD) of only 1.7.The model also achieves optimal performance on two small target public datasets,verifying its generalization and robustness.This method has certain application prospects in assisting doctors in diagnosis in the future.

Key words: Image segmentation, Small target, Local perception, Global perception, Semantic flow alignment

中图分类号:

TP391

张新峰, 郭依海, 刘晓民, 许忠贺, 李相生. 结合局部、全局感知与语义流对齐的脑白质高信号分割方法[J]. 计算机科学, 2026, 53(4): 291-298. https://doi.org/10.11896/jsjkx.250700057

ZHANG Xinfeng, GUO Yihai, LIU Xiaomin, XU Zhonghe, LI Xiangsheng. White Matter High Signal Segmentation Method Combining Local and Global Perception and Semantic Flow Alignment[J]. Computer Science, 2026, 53(4): 291-298. https://doi.org/10.11896/jsjkx.250700057

参考文献

[1]CAO J,ZHONG W,XIA Y,et al.The association between vascular white matter hyperintensities and cognitive function:a longitudinal study based on community populations[J].Chinese Journal of Clinical Neuroscience,2025,33(2):200-209,220.
[2]WANG C,XU J,FU Q,et al.The relationship between Fazekas grading of white matter hyperintensities and cognitive and neurological impairment in patients[J].Clinical and Educational Journal of General Practice,2025,23(4):331-333,354.
[3]ZHAN J,QIU W,LAN H,et al.Study on the severity of highsignal intensity in white matter and the distribution of network disease syndrome types[J].Modern Chinese Doctor,2025,63(2):1-4.
[4]FARKHANI S,DEMNITZ N,BORAXBEKK C J,et al.End-to-end volumetric segmentation of white matter hyperintensities using deep learning[J].Computer Methods and Programs in Biomedicine,2024,245:108008.
[5]DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al.Animage is worth 16x16 words:Transformers for image recognition at scale[J].arXiv:2010.11929,2020.
[6]LEE A,WOO I,KANG D,et al.Fully automated segmentation on brain ischemic and white matter hyperintensities lesions using semantic segmentation networks with squeeze-and-excitation blocks in MRI[J].Informatics in Medicine Unlocked,2020,21:100440-100440.
[7]PARK G,HONG J,DUFFY B A,et al.White matter hyperintensities segmentation using the ensemble U-Net with multi-scale highlighting foregrounds[J].NeuroImage,2025,237:118140.
[8]ZHANG X F,JIANG Y F,GUO S J,et al.Brain tissue segmentation method combining multi-scale and attention mechanisms[J].Journal of Jilin University(Engineering and Technology Edition),2025,55(10):3352-3360.
[9]GHAFOORIAN M,KARSSEMEIJER N,HESKES T,et al.Location Sensitive Deep Convolutional Neural Networks for Segmentation of White Matter Hyperintensities[J].Scientific Reports.2017,7(1):5110.
[10]XU S,ZHENG S,XU W,et al.HCF-Net:Hierarchical Context Fusion Network for Infrared SmallObject Detection[C]//2024 IEEE International Conference on Multimedia and Expo(ICME).2024.
[11]VASWANIA,SHAZEER N,PARMAR N,et al. Attention is all you need[C]//Advances in Neural Information Processing Systems.2017.
[12]SHI B,GAI S,DARRELL T,et al.Refocusing is key to transfer learning[J].arXiv:2305.15542,2023.
[13]WANGQ,WU B,ZHU P,et al.ECA-Net:Efficient Channel Attention for Deep Convolutional Neural Networks[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).IEEE,2020.
[14]WOO S,PARK J,LEE J,et al.Cbam:Convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:3-19.
[15]LIU Z,LIN Y,CAO Y,et al.Swin transformer:Hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:10012-10022.
[16]LI X,YOU A,ZHU Z,et al.Semantic flow for fast and accurate scene parsing[C]//Computer Vision-ECCV.2020:775-793.
[17]JADERNERG M,SIMONYAN K,ZISSERMAN A.Spatial trans-former networks[C]//Advances in Neural Information Processing Systems.2015.
[18]PORWAL P,PACHADE S,KAMBLE R,et al.Indian diabetic retinopathy image dataset(IDRiD):a database for diabetic retinopathy screening research[J].Data,2018,3(3):25.
[19]KUIJF H,BIESBROEK J M,DE BRESSER J,et al.Data of the White Matter Hyperintensity(WMH) Segmentation Challenge[J].IEEE Transactions on Medical Imaging,2019,38(11):2556-2568.
[20]HUTTRNLOCHE R,DANIEL P,GREGORY A,et al.Comparing images using theHausdorff distance[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1993,15(9):850-863.
[21]ZHAO R,QIAN B,ZHANG X,et al.Rethinking dice loss for medical image segmentation[C]//2020 IEEE International Conference on Data Mining(ICDM).IEEE,2020:851-860.
[22]WANG Y,MA X,CHEN Z,et al.Symmetric cross entropy for robust learning with noisy labels[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:322-330.
[23]ISENSEE F,PETERSEN J,KLEIN A,et al.nnu-net:Self-adapting framework for u-net-based medical image segmentation[J].arXiv:1809.10486,2018.
[24]HUANG H,LIN L,TONG R,et al.3+:A full-scale connectedunet for medical image segmentation[C]//IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP).2020.
[25]LIY,JING B,LI Z,et al.Plug-and-play segment anything model improves nnUNet performance[J].Medical Physics,2025,52(2):899-912.
[26]LEI M,WU H,LYU X,et al.Condseg:A general medical image segmentation framework via contrast-driven feature enhancement[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2025:4571-4579.

相关文章 15

[1]	唐心亮, 潘晓润, 王建超, 苏鹤. 融合ByteTrack的EAP-YOLOv8无人机Marker点检测与追踪 Integrate ByteTrack’s EAP-YOLOv8 UAV Marker Point Detection and Tracking 计算机科学, 2026, 53(3): 266-276. https://doi.org/10.11896/jsjkx.241100115
[2]	范家斌, 王宝会, 陈继轩. 基于文本-图像多模态融合的变电所布局图纸图符检测方法 Method for Symbol Detection in Substation Layout Diagrams Based on Text-Image MultimodalFusion 计算机科学, 2026, 53(1): 206-215. https://doi.org/10.11896/jsjkx.250200090
[3]	薛静艳, 夏佳楠, 霍蕊莉, 刘杰, 周雪忠. 基于深度学习的OCT/OCTA视网膜图像分析方法综述 Review of Retinal Image Analysis Methods for OCT/OCTA Based on Deep Learning 计算机科学, 2026, 53(1): 128-140. https://doi.org/10.11896/jsjkx.241100047
[4]	沈涛, 张秀再, 许岱. 改进RT-DETR的遥感图像小目标检测算法 Improved RT-DETR Algorithm for Small Object Detection in Remote Sensing Images 计算机科学, 2025, 52(8): 214-221. https://doi.org/10.11896/jsjkx.241000019
[5]	黄红, 苏菡, 闵鹏. 融合多尺度特征的无人机图像中小目标检测算法 Small Target Detection Algorithm in UAV Images Integrating Multi-scale Features 计算机科学, 2025, 52(6A): 240700097-5. https://doi.org/10.11896/jsjkx.240700097
[6]	石辛诚, 王宝会, 于利韬, 杜辉. 基于三维CT片的下肢骨解剖结构分割算法的研究 Study on Segmentation Algorithm of Lower Limb Bone Anatomical Structure Based on 3D CTImages 计算机科学, 2025, 52(6A): 240500119-9. https://doi.org/10.11896/jsjkx.240500119
[7]	张鑫艳, 唐振超, 李一夫, 刘振宇. 基于多尺度注意力和不确定性损失的两阶段左心房疤痕分割 Two-stage Left Atrial Scar Segmentation Based on Multi-scale Attention and Uncertainty Loss 计算机科学, 2025, 52(6): 264-273. https://doi.org/10.11896/jsjkx.241200197
[8]	耿胜, 丁卫平, 鞠恒荣, 黄嘉爽, 姜舒, 王海鹏. FDiff-Fusion:基于模糊逻辑驱动的医学图像扩散融合网络分割模型 FDiff-Fusion:Medical Image Diffusion Fusion Network Segmentation Model Driven Based onFuzzy Logic 计算机科学, 2025, 52(6): 274-285. https://doi.org/10.11896/jsjkx.240600006
[9]	彭琳娜, 张红云, 苗夺谦. 基于边缘约束和改进Swin Unetr的复杂器官分割方法 Complex Organ Segmentation Based on Edge Constraints and Enhanced Swin Unetr 计算机科学, 2025, 52(4): 177-184. https://doi.org/10.11896/jsjkx.240600007
[10]	王宏强, 赵晖, 贾振红. 基于特征增强和群组混合注意力的棉花病害检测 Cotton Disease Detection Based on Feature Enhancement and Group Mix Attention 计算机科学, 2025, 52(11A): 250200043-7. https://doi.org/10.11896/jsjkx.250200043
[11]	张伟, 蔡宇帆, 叶林涛, 刘大志. 基于特征提取增强和金字塔结构的实时Transformer小目标检测模型 Real-time Transformer Small Target Detection Model Based on Feature Extraction Enhancement and Pyramid Structure 计算机科学, 2025, 52(11A): 250100139-11. https://doi.org/10.11896/jsjkx.250100139
[12]	钟延杰, 蹇木伟, 张昊然, 凌钰坤. 频域纹理先验与特征增强的医学图像分割模型 Medical Image Segmentation Model Based on Frequency Texture Prior and Frequency Feature Enhancement Fusion 计算机科学, 2025, 52(11A): 241200125-8. https://doi.org/10.11896/jsjkx.241200125
[13]	陈崇杨, 彭力, 杨杰龙. 基于特征增强与上下文融合的无人机小目标检测算法 UAV Small Object Detection Algorithm Based on Feature Enhancement and Context Fusion 计算机科学, 2025, 52(11): 131-140. https://doi.org/10.11896/jsjkx.241000017
[14]	张弘森, 吴蔚, 徐建, 吴飞, 季一木. 基于小目标特征增强RT-DETR的SAR图像舰船目标检测方法 Ship Detection Method for SAR Images Based on Small Target Feature Enhanced RT-DETR 计算机科学, 2025, 52(10): 151-158. https://doi.org/10.11896/jsjkx.250100097
[15]	杨舒琪, 韩俊玲, 康晓东, 杨靖怡, 郭洪洋, 李博. 面向3D肝脏CT图像分割的改进vnet模型 Improved vnet Model for 3D Liver CT Image Segmentation 计算机科学, 2024, 51(6A): 230400038-6. https://doi.org/10.11896/jsjkx.230400038

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

结合局部、全局感知与语义流对齐的脑白质高信号分割方法

White Matter High Signal Segmentation Method Combining Local and Global Perception and Semantic Flow Alignment

PDF (PC)

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

Metrics

本文评价

推荐阅读 0