基于高效空间信道信息编码的轻量级图像超分辨率重构

Lightweight image super-resolution reconstruction based on efficient spatial-channel information encoding

在线阅读下载PDF

导出

摘要作为计算机视觉的基础任务,单幅图像超分辨率(Single Image Super-Resolution,SISR)长期以来一直是一个备受关注的研究课题。近期的研究表明,Transformer的成功不仅归功于其自注意力(Self-Attention,SA)机制,还体现在其宏观框架和先进组件的整体设计上。空间池化、位移、多层感知机(Multi-Layer Perception,MLP)、傅里叶变换和常数矩阵等方法,具有与SA机制相似的空间信息编码能力,能够替代并实现与其相当的效果。基于这一发现,本文的目标是利用Transformer中优越的宏观架构与高效的空间信息编码技术结合,改进复杂度较高的SA机制,以提升SISR性能。具体而言,本文重新审视了空间卷积的设计,旨在通过卷积调制技术实现更高效的空间特征编码,并通过动态调制方法表达特征。提出的高效空间信息编码(Efficient Spatial Information Encoding,ESIE)层,采用大核卷积和Hadamard积的方式,模仿查询与键之间的点积操作,并实现与SA机制中值表示再校准类似的效果。因此,ESIE层不仅能够捕捉长程依赖和自适应行为,还能够保持线性计算复杂度。另一方面,针对传统前馈网络(Feed-Forward Network,FFN)在处理空间信息时的次优表现,本文在提出的高效通道信息编码(Efficient Channel Information Encoding,ECIE)层中引入了空间感知和动态自适应机制。该方法有助于增强特征的多样性,并有效地调节层间的信息流动。实验结果表明,本文提出的高效空间-通道信息编码网络(Efficient Spatial-Channel Information Encoding,ESCIEN)在定量和定性评估上均优于现有模型。 As a fundamental task of computer vision,Single Image Super-Resolution(SISR)is a hot topic that has been intensively studied for a long time.Recent researches have shown that the success of Transformers comes from their macro-level framework and advanced components,not just their Self-Attention(SA)mechanism.Spatial pooling,shifting,MLP,Fourier transform,and constant matrix,which all have spatial information encoding capabilities similar to SA,can replace SA and achieve comparable results.Based on these findings,this work aims to combine efficient spatial information encoding technology with superior macro architectures in Transformers for SISR.To this end,the paper rethinks spatial convolution to achieve more efficient encoding of spatial features and realizes dynamic modulation by convolutional modulation techniques.The large-kernel convolution and Hadamard product are utilized in the proposed Efficient Spatial Information Encoding(ESIE)layer to imitate the matrix multiplication between query and key and recalibration of value representations in SA.Therefore,ESIE layer also achieve long-range correlations and self-adaptation behavior,similar to SA,but only requires linear computational complexity.In addition,to address the sub-optimality of vanilla Feed-Forward Networks(FFN),the paper introduces spatial awareness and locality in the proposed Efficient Channel Information Encoding(ECIE)layer.It can improve feature diversity and regulate information flow between layers.Experimental results show that the proposed Efficient Spatial-Channel Information Encoding Network(ESCIEN)outperforms other models both quantitatively and qualitatively.Codes and trained models will be made available if the paper is accepted.

作者莫开治滕奇志任超 MO Kaizhi;TENG Qizhi;REN Chao(College of Electronics and Information Engineering,Sichuan University,Chengdu 610065,China)

机构地区四川大学电子信息学院

出处《智能计算机与应用》 2025年第8期1-9,共9页 Intelligent Computer and Applications

基金国家自然科学基金(62271336,62171304)。

关键词图像超分辨率空间信息编码卷积调制技术大核卷积 image super resolution spatial information encoding convolutional modulation technology large kernel convolution

分类号 TN911.73 [电子电信—通信与信息系统]

作者简介莫开治(1998-),男,硕士研究生,主要研究方向:图像处理;任超(1988-),男,博士,副研究员,主要研究方向:图像处理,计算机视觉,人工智能,多媒体通信与信息系统;通信作者:滕奇志(1961-),女,博士,教授,主要研究方向:多维数字信号处理,模式识别,计算机应用。Email:qzteng@scu.edu.cn。

引文网络
相关文献

参考文献1

1Zeyu Ren,Shuihua Wang,Yudong Zhang.Weakly supervised machine learning[J].CAAI Transactions on Intelligence Technology,2023,8(3):549-580. 被引量：3

二级参考文献4

1周志华.Multi-Instance Learning from Supervised View[J].Journal of Computer Science & Technology,2006,21(5):800-809. 被引量：12
2WANG Wei,ZHOU Zhi-Hua.Crowdsourcing label quality: a theoretical analysis[J].Science China(Information Sciences),2015,58(11):109-120. 被引量：7
3zhi-hua zhou.A brief introduction to weakly supervised learning[J].National Science Review,2018,5(1):44-53. 被引量：118
4Jia Dengqiang,Luo Xinzhe,Ding Wangbin,Huang Liqin,Zhuang Xiahai.SeRN:A Two-Stage Framework of Registration for Semi-Supervised Learning for Medical Images[J].Journal of Shanghai Jiaotong university(Science),2022,27(2):176-189. 被引量：1

共引文献2

1Hengame Ahmadi Golilarz,Alireza Azadbar,Roohallah Alizadehsani,Juan Manuel Gorriz.GAN‐MD:A myocarditis detection using multi‐channel convolutional neural networks and generative adversarial network‐based data augmentation[J].CAAI Transactions on Intelligence Technology,2024,9(4):866-878. 被引量：1
2邓志鹏,何施茗,杨根,满君丰.基于深度学习的纹理表面缺陷检测方法综述[J].计算机集成制造系统,2025,31(3):721-745. 被引量：2

1姜永祺,单慧琳.基于ECSMNet的风力发电机表面缺陷检测研究[J].电子测量与仪器学报,2025,39(5):166-176.
2刘文星,单慧琳,王兴涛,刘洁茹,陈戈,单梦姣.基于阶梯式残差与坐标信息重组的SAR船舰检测方法[J].光学学报,2025,45(9):323-341.
3卜扬,屈霞,陈涛,武伟宁.基于RCSI-YOLOv5的轴承表面缺陷检测方法[J].陕西科技大学学报,2025,43(2):203-214.
4毛德乾,高珊珊,吕海霞,张彩明,周元峰.基于局部Transformer的多尺度图像去雾网络[J].计算机辅助设计与图形学学报,2025,37(6):1006-1019.
5石军,王天同,朱子琦,赵敏帆,王炳勋,安虹.基于深度学习的医学图像分割方法综述[J].中国图象图形学报,2025,30(6):2161-2186.
6曾哲.教学论与靶向问题情境创设:小学跨学科写作新路径——以《日新月异的生活》跨学科课程为例[J].福建基础教育研究,2025(6):46-50.
7王培崇,张天颖,李丽荣.一种改进的海鸥优化算法及其应用[J].武汉大学学报(工学版),2025,58(6):991-998.
8梁颂扬,杨泽鑫.融合信息增强和兴趣演化的个性化推荐[J].现代信息科技,2025,9(15):122-127.
9周云龙,陈德富,刘小湖,桑伊健,周晗昀.基于改进Transformer的端到端说话人确认模型[J].计算机应用,2025,45(S1):89-94.
10张梦婉,郑洁菲,叶童,马宁.近20年全球木质林产品研究热点与演进趋势研究[J].西部林业科学,2025,54(4):141-149.

智能计算机与应用

2025年第8期

浏览历史

内容加载中请稍等...

基于高效空间信道信息编码的轻量级图像超分辨率重构

参考文献1

二级参考文献4

共引文献2

相关作者

相关机构

相关主题

浏览历史