Computer Science ›› 2026, Vol. 53 ›› Issue (5): 218-227.doi: 10.11896/jsjkx.250700046

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Frequency Driven Multi-scale Image Super-resolution Method

YANG Hongju, ZHANG Ziyang, LI Yao   

  1. School of Computer and Information Technology, Shanxi University, Taiyuan 030006, China
  • Received:2025-07-08 Revised:2025-09-26 Published:2026-05-08
  • About author:YANG Hongju,born in 1975,Ph.D,associate professor,master supervisor,is a member of CCF(No.10371M).Her main research interests include compu-ter vision and deep learning.
  • Supported by:
    National Natural Science Foundation of China(62376145),Natural Science Foundation of Shanxi Province,China(202303021211024) and Key Project of the National Natural Science Foundation of China(U24A20323).

Abstract: Single-image super-resolution is a critical technique aimed at reconstructing high-resolution images from single low-resolution inputs.It plays a pivotal role in various fields such as medical image enhancement,satellite remote sensing,security surveillance,and digital media content optimization.The core of SISR lies in restoring the lost high-frequency details to improve visual quality.However,this task faces numerous challenges due to its ill-posed inverse problem nature:traditional interpolation methods struggle to reconstruct complex details;convolutional neural network-based approaches can extract local features but fall short in global modeling;while Transformers excel at capturing long-range dependencies,they have limited capabilities for refining high-frequency information,leading to blurred edges and distorted textures in reconstructed images.To address these issues,this study proposes an innovative model that leverages wavelet transformation for multi-scale image decomposition,optimizing the extraction of high-frequency information.Three key modules are designed:the wavelet refinement module enhances the processing of high-frequency details;the shifted rectangular feature enhancement module captures global context;and the multi-scale wavelet fusion module integrates high-frequency priors with global features.This method significantly improves texture clarity and edge sharpness,balancing both local details and overall consistency.Experimental results demonstrate that the proposed model outperforms existing techniques on benchmark datasets such as Set5,Set14,and BSD100,achieving an average peak signal-to-noise ratio improvement of approximately 0.3 dB,along with superior subjective visual quality.This research not only effectively tackles the challenge of high-frequency information recovery but also provides new insights into the field of single-image super-resolution,holding significant academic and practical value.

Key words: Single-image super-resolution, Transformer, Convolutional neural networks, Wavelet transform, High-frequency information optimization, Multi-scale processing

CLC Number: 

  • TP183
[1]MUDUNURI S P,BISWAS S.Low resolution face recognition across variations in pose and illumination[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,38(5):1034-1040.
[2]WAN X F,HAN F,JIANG X Q,et al.Research on super-resolution restoration of video surveillance images[J].Laser Journal,2014,35(3):5-8.
[3]GREENSPAN H.Super-resolution in medical imaging[J].The Computer Journal,2009,52(1):43-63.
[4]MENG F,WU S,LI Y,et al.Single remote sensing image super-resolution via a generative adversarial network with stratified dense sampling and chain training[J].IEEE Transactions on Geoscience and Remote Sensing,2023,62:1-22.
[5]JIANG N,WANG L.Quantum image scaling using nearestneighbor interpolation[J].Quantum Information Processing,2015,14:1559- 1571.
[6]KIRKLAND E J.Advanced Computing in Electron Microscopy[M].Springer,2010:261-263.
[7]GAO S K,GRUEV V.Bilinear and bicubic interpolation me-thods for division of focal plane polarimeters[J].Optics Express,2011,19(27):26161- 26173.
[8]KIM JW,LEE J K,LEE K M.Accurate image super-resolutionusing very deep convolutional networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2016:1646- 1654.
[9]LEDIG C,THEIS L,HUSZÁR F,et al.Photo-realistic single image super-resolution using a generative adversarial network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2017:4681- 4690.
[10]HAN K,WANG Y H,CHEN H T,et al.A survey on vision transformer[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,45(1):87-110.
[11]ZHANG D S.Wavelet transform[M].Cham:Springer,2019:35-44.
[12]SUNKAVALLI K,JOHNSON M K,MATUSIK W,et al.Multi-scale image harmonization[J].ACM Transactions on Graphics,2010,29(4):1-10.
[13]DONG C,LOY C C,HE K M,et al.Learning a deep convolutional network for image super-resolution[C]//Computer Vision-ECCV 2014:13th European Conference.Springer,2014:184-199.
[14]ZHANG Y L,LI K P,LI K,et al.Image super-resolution using very deep residual channel attention networks[C]//Proceedings of the European Conference on Computer Vision(ECCV).Cham:Springer,2018:286-301.
[15]HUI Z,GAO X B,YANG Y C,et al.Lightweight image super-resolution with information multi-distillation network[C]//Proceedings of the 27th ACM International Conference on Multimedia.Cham:Springer,2019:2024-2032.
[16]LIU J,TANG J,WU G S.Residualfeature distillation network for lightweight image super-resolution[C]//Proceedings of Computer Vision-ECCV 2020 Workshops.Cham:Springer,2020:41-55.
[17]ZHOU S C,ZHANG J W,ZUO W M,et al.Cross-scale internal graph neural network for image super-resolution[J].Advances in Neural Information Processing Systems,2020,33:3499-3509.
[18]NIU B,WEN W L,REN W Q,et al.Single image super-resolution via a holistic attention network[C]//Proceedings of Computer Vision-ECCV 2020:16th European Conference.Cham:Springer,2020:191-207.
[19]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Advances in Neural Information Processing Systems.2017.
[20]LIANG J Y,CAO J Z,SUN G L,et al.SwinIR:Image restoration usingswin transformer[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2021:1833-1844.
[21]DONG X Y,BAO J M,CHEN D D,et al.Cswin transformer:a general vision transformer backbone with cross-shaped windows[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2022:12124-12134.
[22]CHEN X Y,WANG X T,ZHOU J T,et al.Activating more pixels in image super-resolution transformer[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2023:22367-22377.
[23]WOO S,PARK J,LEE J Y,et al.Cbam:Convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision(ECCV).Cham:Springer,2018:3-19.
[24]LI A,ZHANG L,LIU Y,et al.Feature modulation transformer:Cross-refinement of global representation via high-frequency prior for image super-resolution[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2023:12514-12524.
[25]MALLAT S G.Multifrequency channel decompositions of images and wavelet models[J].IEEE Transactions on Acoustics,Speech,and Signal Processing,2002,37(12):2091-2110.
[26]DUAN L J,WU C L,EN Q,et al.Image Super-Resolution Algorithm of Deep Residual Network Based on Wavelet Domain[J].Journal of Software,2019,30(4):941-953.
[27]HUANG H B,HE R,SUN Z N,et al.Wavelet-srnet:Awavelet-based cnn for multi-scale face super resolution[C]//Procee-dings of the IEEE International Conference on Computer Vision.Piscataway,NJ:IEEE,2017:1689-1697.
[28]MA C Y,ZHU J W,LI Y J,et al.Single image super resolution via wavelet transform fusion and SRFeat network[J].Journal of Ambient Intelligence and Humanized Computing,2022,13(11):1-9.
[29]DHAREJO F A,DEEBA F,ZHOU Y C,et al.Twist-gan:towards wavelet transform and transferred gan for spatio-temporal single image super resolution[J].ACM Transactions on Intelligent Systems and Technology,2021,12(6):1-20.
[30]SHI W Z,CABALLERO J,HUSZÁR F,et al.Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2016:1874-1883.
[31]WANG W,SUN Y J,WANG X.Lightweight frequency and spatial feature fused multi-scale remote sensing sceneclassification network[J].Journal of Jilin University(Engineering and Technology Edition),2025,55(10):3361-3371.
[32]骆迪,张乾.基于多尺度注意力的图像超分辨研究[J/OL].重庆工商大学学报(自然科学版),1-14[2025-12-08].https://link.cnki.net/urlid/50.1155.N.20240926.0921.002.
[33]ZHONG M Y,JIANG L,LI C.Image super-resolution reconstruction algorithm based on multi-scale feature extraction residual network[J].Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition),2024,36(1):76-85.
[34]FINDER S E,AMOYAL R,TREISTER E,et al.Wavelet convolutions for large receptive fields[C]//Proceedings of the European Conference on Computer Vision.Cham:Springer,2024:363-380.
[35]WANG W X,CHEN W,QIU Q B,et al.Crossformer++:A versatile vision transformer hinging on cross-scale attention[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2023,46(5):3123-3136.
[36]ZAMIR S W,ARORA A,KHAN S,et al.Restormer:Efficient transformer for high-resolution image restoration[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2022:5728-5739.
[37]LIM B,SON S,KIM H,et al.Enhanced deep residual networks for single image super-resolution[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops.Piscataway,NJ:IEEE,2017:136-144.
[38]LI W B,ZHOU K,QI L,et al.Lapar:Linearly-assembled pixel-adaptive regression network for single image super-resolution and beyond[J].Advances in Neural Information Processing Systems,2020,33:20343-20355.
[39]LUO X T,XIE Y,ZHANG Y L,et al.Latticenet:Towardslightweight image super-resolution with lattice block[C]//Proceedings of Computer Vision-ECCV 2020:16th European Conference.Cham:Springer,2020:272-289.
[40]HUI Z,GAO X B,YANG Y C,et al.Lightweight image super-resolution with information multi-distillation network[C]//Proceedings of the 27th ACM International Conference on Multimedia.New York:ACM,2019:2024-2032.
[41]AHN N,KANG B,SOHN K A.Fast,accurate,and lightweight super-resolution with cascading residual network[C]//Procee-dings of the European Conference on Computer Vision(ECCV).Cham:Springer,2018:252-268.
[42]SUN B,ZHANG Y L,JIANG S Y,et al.Hybrid pixel-unshuffled network for lightweight image super-resolution[C]//Proceedings of the AAAI Conference on Artificial Intelligence.Washington,DC:AAAI,2023:2375-2383.
[43]LU Z S,LI J C,LIU H,et al.Transformer for single image super-resolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2022:457-466.
[44]RAY A,KUMAR G,KOLEKAR M H.CFAT:Unleashing Triangular Windows for Image Super-Resolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2024:26120-26129.
[45]NI Z K,ZHANG Y,YANG W H,et al,Structural Similarity-Inspired Unfolding for Lightweight Image Super-Resolution[J].IEEE Transactions on Image Processing,2025,34:3861-3872.
[1] LI Tengjia, MA Chun’ai. Multi-scale Transformer Oil Price Prediction Framework with AEMD and Trend Cross-attention [J]. Computer Science, 2026, 53(5): 157-163.
[2] LI Zongmin, WANG Li, LI Yachuan, LIU Yujie, RONG Guangcai, LIU Weihan, MA Wenkang. High-accuracy Human Pose Estimation Combining Wavelet Analysis and Frequency-DomainAttention [J]. Computer Science, 2026, 53(5): 228-236.
[3] GUO Jingchen, YANG Kuiwu, DING Mengdi, WEI Jianghong. Survey of Adversarial Sample Attacks for Vision Transformer [J]. Computer Science, 2026, 53(5): 404-418.
[4] XIN Yichen, LI Shichong, CHEN Bin, CHENG Zhangtao, LI Ye, ZHOU Fan. Enhancing Temporal Knowledge Graph Reasoning Method with Graph Information Bottleneck and Transformer [J]. Computer Science, 2026, 53(4): 393-405.
[5] CHEN Han, XU Zefeng, JIANG Jiu, FAN Fan, ZHANG Junjian, HE Chu, WANG Wenwei. Large Language Model and Deep Network Based Cognitive Assessment Automatic Diagnosis [J]. Computer Science, 2026, 53(3): 41-51.
[6] LI Zequn, DING Fei. Fatigue Driving Detection Based on Dual-branch Fusion and Segmented Domain AdaptationTransfer Learning [J]. Computer Science, 2026, 53(3): 78-87.
[7] YU Ding, LI Zhangwei. Prediction Method of RNA Secondary Structure Based on Transformer Architecture [J]. Computer Science, 2026, 53(3): 375-382.
[8] LI Jiahao, JING Junchang, XU Qian, LIU Dong. GTKT:Knowledge Tracing Model Integrating Connectivism Learning and Multi-layer TemporalGraph Transformer [J]. Computer Science, 2026, 53(2): 78-88.
[9] PAN Jian, WANG Xuhao. Time Series Forecasting Model Integrating Multi-scale Features and Attention Mechanism [J]. Computer Science, 2026, 53(2): 180-186.
[10] LIU Chenhong, LI Fenglian, YANG Jia, WANG Suzhe, CHEN Guijun. Boundary-focused Multi-scale Feature Fusion Network for Stroke Lesion Segmentation [J]. Computer Science, 2026, 53(2): 264-272.
[11] WANG Cheng, JIN Cheng. KAN-based Unsupervised Multivariate Time Series Anomaly Detection Network [J]. Computer Science, 2026, 53(1): 89-96.
[12] DENG Jiayan, TIAN Shirui, LIU Xiangli, OUYANG Hongwei, JIAO Yunjia, DUAN Mingxing. Trajectory Prediction Method Based on Multi-stage Pedestrian Feature Mining [J]. Computer Science, 2025, 52(9): 241-248.
[13] HU Hailong, XU Xiangwei, LI Yaqian. Drug Combination Recommendation Model Based on Dynamic Disease Modeling [J]. Computer Science, 2025, 52(9): 96-105.
[14] DING Zhengze, NIE Rencan, LI Jintao, SU Huaping, XU Hang. MTFuse:An Infrared and Visible Image Fusion Network Based on Mamba and Transformer [J]. Computer Science, 2025, 52(8): 188-194.
[15] LIU Huayong, XU Minghui. Hash Image Retrieval Based on Mixed Attention and Polarization Asymmetric Loss [J]. Computer Science, 2025, 52(8): 204-213.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!