Enhanced Panoramic Image Generation with GAN and CLIP Models

在线阅读下载PDF

导出

摘要 Panoramic images, offering a 360-degree view, are essential in virtual reality(VR) and augmented reality(AR), enhancing realism with high-quality textures. However, acquiring complete and high-quality panoramic textures is challenging. This paper introduces a method using generative adversarial networks(GANs) and the contrastive language-image pretraining(CLIP) model to restore and control texture in panoramic images. The GAN model captures complex structures and maintains consistency, while CLIP enables fine-grained texture control via semantic text-image associations. GAN inversion optimizes latent codes for precise texture details. The resulting low dynamic range(LDR) images are converted to high dynamic range(HDR) using the Blender engine for seamless texture blending. Experimental results demonstrate the effectiveness and flexibility of this method in panoramic texture restoration and generation.

作者 Shilong Li Qiang Zhao

机构地区 School of Automation

出处《Journal of Beijing Institute of Technology》 2025年第1期91-101,共11页 北京理工大学学报(英文版)

关键词 panoramic images environment texture generative adversarial networks(GANs) contrastive language-image pretraining(CLIP)model blender engine fine-grained control texture generation

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

作者简介 Corresponding author:Shilong Li is currently pursuing a Ph.D.at Hangzhou Dianzi University under the supervision of Professor Qiang Zhao.The primary research focus is on panoramic image processing.Email:244060063@hdu.edu.cn;Qiang Zhao received the B.Eng.degree in software engineering and the Ph.D.degree in computer science and technology from Tianjin University,Tianjin,China,in 2009 and 2016,respectively.He is currently a Professor with the school of communication engineering,Hangzhou Dianzi University,Hangzhou,China.Before that,he was an Assistant Professor and Associate Professor with the Institute of Computing Technology,Chinese Academy of Sciences,Beijing,China.His main research interests include image based rendering,feature extraction,and panoramic image processing.

引文网络
相关文献

1江李铠,王国中,赵海武.基于条件生成式对抗网络的高质量动态实时渲染方法[J].上海工程技术大学学报,2024,38(4):451-457.
2Zhi-yong YOU,Wei-li CHENG,Guo-lei LIU,Jian LI,Li-fei WANG,Hui YU,Hong-xia WANG,Ze-qin CUI,Jin-hui WANG.Effects of<c+a>slip mode on microstructure evolution and compressive flow behavior of extruded dilute Mg−0.5Bi−0.5Sn−0.5Mn alloy[J].Transactions of Nonferrous Metals Society of China,2024,34(11):3599-3614.
3马世龙,吴峰华,杨哲海,孟娜,郝旺.数字孪生机器人电机产线Unity3D虚拟仿真教学系统开发[J].山东工业技术,2025(1):32-40.
4张新洋,华陆韬,张志龙.智慧水利应用中的可视化建模关键技术研究[J].建筑经济,2024,45(S2):298-300.
5沈燕,何迪,刘畅,邱钧.基于神经光场的生成式子光场数据扩展[J].激光与光电子学进展,2024,61(24):404-411.
6秦健峰,莫磊,支建辉.基于沉浸式技术的应急广播系统多场景应用探析[J].黑龙江广播电视技术,2024(4):66-68.
7赵海燕,杜丽娟,刘琨,肖琳.图像目标抗遮挡视觉差补偿算法仿真[J].计算机仿真,2025,42(1):244-248.
8Jianuo Huang,Bohan Lai,Weiye Qiu,Caixu Xu,Jie He.DMHFR:Decoder with Multi-Head Feature Receptors for Tract Image Segmentation[J].Computers, Materials & Continua,2025,82(3):4841-4862.
9Wanbin ZHA,Jiangtao XU,Jinghua AO,Kaiming NIE,Zhiyuan GAO.Analysis and solution of streak effect in high dynamic range CMOS image sensors[J].Science China(Information Sciences),2025,68(1):399-400.
10张韬,黎杰.基于AR实景增强的桥梁轻量化监测关键技术的研究与应用[J].科技资讯,2025,23(3):164-166.

Journal of Beijing Institute of Technology

2025年第1期

浏览历史

内容加载中请稍等...

Enhanced Panoramic Image Generation with GAN and CLIP Models

相关作者

相关机构

相关主题

浏览历史