Computer Science ›› 2025, Vol. 52 ›› Issue (4): 231-239.doi: 10.11896/jsjkx.240700039
• Computer Graphics & Multimedia • Previous Articles Next Articles
SUN Tanghui1, ZHAO Gang1,2, GUO Meiqian1
CLC Number:
[1]BORSOS Z,MARINIER R,VINCENT D,et al.Audiolm:a language modeling approach to audio generation[J].IEEE/ACM Transactions on Audio,Speech,and Language Processing,2023,31:2523-2533. [2]WU J,GAUR Y,CHEN Z,et al.On decoder-only architecture for speech-to-text and large language model integration[C]//2023 IEEE Automatic Speech Recognition and Understanding Workshop(ASRU).IEEE,2023:1-8. [3]WU J,JIANG Y,LIU Q,et al.General object foundation model for images and videos at scale[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2024:3783-3795. [4]LI M,LI S,ZHANG X,et al.Univs:Unified and universal video segmentation with prompts as queries[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2024:3227-3238. [5]SCHICK T,DWIVEDI-YU J,DESSÌ R,et al.Toolformer:Language models can teach themselves to use tools[J].Advances in Neural Information Processing Systems,2024,36:68539-68551. [6]KASNECI E,SEßLER K,KÜCHEMANN S,et al.ChatGPT for good? On opportunities and challenges of large language models for education[J].Learning and Individual Differences,2023,103:102274. [7]RUSSAKOVSKY O,DENG J,SU H,et al.Imagenet large scale visual recognition challenge[J].International Journal of Computer Vision,2015,115:211-252. [8]KRIZHEVSKY A,HINTON G.Learning multiple layers of fea-tures from tiny images[J].Handbook of Systemic Autoimmune Diseases,2009,1(4):1-58. [9]LIN T Y,MAIRE M,BELONGIE S,et al.Microsoft coco:Common objects in context [C]//Computer Vision-ECCV 2014: 13th European Conference,Zurich,Switzerland,September 6-12,2014,Proceedings,Part V 13.Springer International Publishing,2014:740-755. [10]DESUKY A S,HUSSAIN S.An improved hybrid approach for handling classimbalance problem[J].Arabian Journal for Science and Engineering,2021,46:3853-3864. [11]HOYOS-OSORIO J,ALVAREZ-MEZA A,DAZA-SANTACO-LOMA G,et al.Relevant information undersampling to support imbalanced data classification[J].Neurocomputing,2021,436:136-146. [12]CHEN X,ZHOU Y,WU D,et al.Area:adaptive reweighting via effective area for long-tailed classification[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2023:19277-19287. [13]HUANG C,LI Y,LOY CC,et al.Learning deep representation for imbalanced classification [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:5375-5384. [14]KANG B,XIE S,ROHRBACH M,et al.Decoupling representation and classifier for long-tailed recognition[EB/OL].[2024-04-24].https://arxiv.org/abs/1910.09217. [15]ZHOU B,CUI Q,WEI X S,et al.Bbn: Bilateral-branch network with cumulative learning for long-tailed visual recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:9719-9728. [16]KANG B,LI Y,XIE S,et al.Exploring balanced feature spaces for representation learning[C]//International Conference on Learning Representations.2021. [17]LI Y,HOU Q,ZHENG Z,et al.Large selective kernel network for remote sensing object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2023:16794-16805. [18]MIENYE I D,SUN Y.Performance analysis of cost-sensitivelearning methods with application to imbalanced medical data[J].Informatics in Medicine Unlocked,2021,25:100690. [19]YEUNG M,SALA E,SCHÖNLIEBC B,et al.Unified focalloss:Generalising dice and cross entropy-based losses to handle class imbalanced medical image segmentation[J].Computerized Medical Imaging and Graphics,2022,95:102026. [20]HUYNH T,NIBALI A,HE Z.Semi-supervised learning formedical image classification using imbalanced training data[J].Computer Methods and Programs in Biomedicine,2022,216:106628. [21]KIM H E,COSA-LINAN A,SANTHANAMN,et al.Transfer learning for medical image classification:a literature review[J].BMC Medical Imaging,2022,22(1):69. [22]GALDRAN A,CARNEIRO G,GONZÁLEZ BALLESTER MA.Balanced-mixup for highly imbalanced medical image classification[C]//Medical Image Computing and Computer Assisted Intervention-MICCAI 2021:24th International Conference,Strasbourg,France,September 27-October 1,2021,Procee-dings,Part V 24.Springer International Publishing,2021:323-333. [23]DING H,HUANG N,CUI X.Leveraging GANs data augmentation for imbalanced medical image classification[J].Applied Soft Computing,2024,165:112050. [24]ALQUDAH A,ALQUDAH A M.Sliding window based deep ensemble system for breast cancer classification[J].Journal of Medical Engineering & Technology,2021,45(4):313-323. [25]GUO M H,LU C Z,LIU Z N,et al.Visual attention network[J].Computational Visual Media,2023,9(4):733-752. [26]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal Loss for dense object detection[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2980-2988. [27]CUI Y,JIA M,LIN T Y,et al.Class-balanced loss based on effective number of samples[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:9268-9277. [28]ZHANG M,SU H,WEN J.Classification of flower image based on attention mechanism and multi-Loss attention network[J].Computer Communications,2021,179:307-317. [29]CAO K,WEI C,GAIDON A,et al.Learning imbalanced data-sets with label-distribution-aware margin loss[J].Advances in Neural Information Processing Systems,2019,32:1-12. [30]ZHONG Z,CUI J,LIU S,et al.Improving calibration for long-tailed rerecognition[C]//Proeedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.2021:16489-16498. [31]ZHU L,WANG X,KE Z,et al.Biformer:Vision transformer with bi-level routing attention[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:10323-10333. [32]YANG L,ZHANG R Y,LI L,et al.Simam:A simple,parameter-free attention module for convolutional neural networks[C]//International Conference on Machine Learning.PMLR,2021:11863-11874. [33]SIMONYAN K,ZISSERMAN A.Very deep convolutional net-works for large-scale image recognition.[EB/OL].[2024-04-24].https://arxiv.org/abs/1409.1556. [34]HAN Z,WEI B,ZHENG Y,et al.Breast cancer multi-classification from histopathological images with structured deep learning model[J].Scientific reports,2017,7(1):4172. [35]HE K,ZHANG X,REN S,et al.Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778. |
[1] | ZHENG Xu, HUANG Xiangjie, YANG Yang. Reversible Facial Privacy Protection Method Based on “Invisible Masks” [J]. Computer Science, 2025, 52(5): 384-391. |
[2] | GAO Wei, WANG Lei, LI Jianan, LI Shuailong, HAN Lin. Operator Fusion Optimization for Deep Learning Compiler TVM [J]. Computer Science, 2025, 52(5): 58-66. |
[3] | MIAO Zhuang, CUI Haoran, ZHANG Qiyang, WANG Jiabao, LI Yang. Restoration of Atmospheric Turbulence-degraded Images Based on Contrastive Learning [J]. Computer Science, 2025, 52(5): 171-178. |
[4] | SUN Jinyong, WANG Xuechun, CAI Guoyong, SHANG Zhiliang. Open Set Recognition Based on Meta Class Incremental Learning [J]. Computer Science, 2025, 52(5): 187-198. |
[5] | LI Xiaolan, MA Yong. Study on Lightweight Flame Detection Algorithm with Progressive Adaptive Feature Fusion [J]. Computer Science, 2025, 52(4): 64-73. |
[6] | DENG Ceyu, LI Duantengchuan, HU Yiren, WANG Xiaoguang, LI Zhifei. Joint Inter-word and Inter-sentence Multi-relationship Modeling for Review-basedRecommendation Algorithm [J]. Computer Science, 2025, 52(4): 119-128. |
[7] | PENG Linna, ZHANG Hongyun, MIAO Duoqian. Complex Organ Segmentation Based on Edge Constraints and Enhanced Swin Unetr [J]. Computer Science, 2025, 52(4): 177-184. |
[8] | KONG Jialin, ZHANG Qi, WEI Jianze, LI Qi. Adaptive Contextual Learning Method Based on Iris Texture Perception [J]. Computer Science, 2025, 52(4): 185-193. |
[9] | ZHOU Yi, MAO Kuanmin. Research on Individual Identification of Cattle Based on YOLO-Unet Combined Network [J]. Computer Science, 2025, 52(4): 194-201. |
[10] | MENG Sijiang, WANG Hongxia, ZENG Qiang, ZHOU Yang. Multi-view and Multi-scale Fusion Attention Network for Document Image Forgery Localization [J]. Computer Science, 2025, 52(4): 327-335. |
[11] | WANG Yifei, ZHANG Shengjie, XUE Dizhan, QIAN Shengsheng. Self-supervised Backdoor Attack Defence Method Based on Poisoned Classifier [J]. Computer Science, 2025, 52(4): 336-342. |
[12] | WANG Tao, BAI Xuefei, WANG Wenjian. Selective Feature Fusion for 3D CT Image Segmentation of Renal Cancer Based on Edge Enhancement [J]. Computer Science, 2025, 52(3): 41-49. |
[13] | WANG Jie, WANG Chuangye, XIE Jiucheng, GAO Hao. Animatable Head Avatar Reconstruction Algorithm Based on Region Encoding [J]. Computer Science, 2025, 52(3): 50-57. |
[14] | ZHONG Yue, GU Jieming. 3D Reconstruction of Single-view Sketches Based on Attention Mechanism and Contrastive Loss [J]. Computer Science, 2025, 52(3): 77-85. |
[15] | WANG Yuan, HUO Peng, HAN Yi, CHEN Tun, WANG Xiang, WEN Hui. Survey on Deep Learning-based Meteorological Forecasting Models [J]. Computer Science, 2025, 52(3): 112-126. |
|